spark-instrumented-optimizer

History

Gengliang Wang f5107614d6 [SPARK-28089][SQL] File source v2: support reading output of file streaming Sink ## What changes were proposed in this pull request? File source V1 supports reading output of FileStreamSink as batch. https://github.com/apache/spark/pull/11897 We should support this in file source V2 as well. When reading with paths, we first check if there is metadata log of FileStreamSink. If yes, we use `MetadataLogFileIndex` for listing files; Otherwise, we use `InMemoryFileIndex`. ## How was this patch tested? Unit test Closes #24900 from gengliangwang/FileStreamV2. Authored-by: Gengliang Wang <gengliang.wang@databricks.com> Signed-off-by: Wenchen Fan <wenchen@databricks.com>		2019-06-20 12:57:13 +08:00
..
benchmarks	[SPARK-27701][SQL] Extend NestedColumnAliasing to general nested field cases including GetArrayStructField	2019-06-11 20:12:53 -07:00
src	[SPARK-28089][SQL] File source v2: support reading output of file streaming Sink	2019-06-20 12:57:13 +08:00
v1.2.1/src	[SPARK-27105][SQL] Optimize away exponential complexity in ORC predicate conversion	2019-06-19 10:44:58 +08:00
v2.3.5/src	[SPARK-27105][SQL] Optimize away exponential complexity in ORC predicate conversion	2019-06-19 10:44:58 +08:00
pom.xml	[SPARK-27521][SQL] Move data source v2 to catalyst module	2019-06-05 09:55:55 -07:00