spark-instrumented-optimizer

History

HyukjinKwon f984f6acfe Revert "[SPARK-27870][SQL][PYSPARK] Flush batch timely for pandas UDF (for improving pandas UDFs pipeline)" ## What changes were proposed in this pull request? This PR reverts `9c4eb99c52` for the reasons below: 1. An alternative was not considered properly, https://github.com/apache/spark/pull/24734#issuecomment-500101639 https://github.com/apache/spark/pull/24734#issuecomment-500102340 https://github.com/apache/spark/pull/24734#issuecomment-499202982 - I opened a PR https://github.com/apache/spark/pull/24826 2. `9c4eb99c52` fixed timely flushing which behaviour is somewhat hacky and the timing isn't also guaranteed (in case each batch takes longer to process). 3. For pipelining for smaller batches, looks it's better to allow to configure buffer size rather than having another factor to flush ## How was this patch tested? N/A Closes #24827 from HyukjinKwon/revert-flush. Authored-by: HyukjinKwon <gurwls223@apache.org> Signed-off-by: Dongjoon Hyun <dhyun@apple.com>		2019-06-09 08:28:31 -07:00
..
__init__.py
test_appsubmit.py
test_broadcast.py	[SPARK-26201] Fix python broadcast with encryption	2018-11-30 12:48:56 -06:00
test_conf.py
test_context.py	[SPARK-26349][PYSPARK] Forbid insecure py4j gateways	2019-01-08 11:26:36 -08:00
test_daemon.py
test_join.py
test_profiler.py
test_rdd.py	[SPARK-23961][SPARK-27548][PYTHON] Fix error when toLocalIterator goes out of scope and properly raise errors from worker	2019-05-07 14:47:39 -07:00
test_readwrite.py
test_serializers.py	Revert "[SPARK-27870][SQL][PYSPARK] Flush batch timely for pandas UDF (for improving pandas UDFs pipeline)"	2019-06-09 08:28:31 -07:00
test_shuffle.py
test_taskcontext.py	[SPARK-25921][FOLLOW UP][PYSPARK] Fix barrier task run without BarrierTaskContext while python worker reuse	2019-01-11 14:28:37 +08:00
test_util.py
test_worker.py	[SPARK-26743][PYTHON] Adds a test to check the actual resource limit set via 'spark.executor.pyspark.memory'	2019-01-28 10:02:27 +08:00