spark-instrumented-optimizer/sql/core
Gengliang Wang f5107614d6 [SPARK-28089][SQL] File source v2: support reading output of file streaming Sink
## What changes were proposed in this pull request?

File source V1 supports reading output of FileStreamSink as batch. https://github.com/apache/spark/pull/11897
We should support this in file source V2 as well. When reading with paths, we first check if there is metadata log of FileStreamSink. If yes, we use `MetadataLogFileIndex` for listing files; Otherwise, we use `InMemoryFileIndex`.

## How was this patch tested?

Unit test

Closes #24900 from gengliangwang/FileStreamV2.

Authored-by: Gengliang Wang <gengliang.wang@databricks.com>
Signed-off-by: Wenchen Fan <wenchen@databricks.com>
2019-06-20 12:57:13 +08:00
..
benchmarks [SPARK-27701][SQL] Extend NestedColumnAliasing to general nested field cases including GetArrayStructField 2019-06-11 20:12:53 -07:00
src [SPARK-28089][SQL] File source v2: support reading output of file streaming Sink 2019-06-20 12:57:13 +08:00
v1.2.1/src [SPARK-27105][SQL] Optimize away exponential complexity in ORC predicate conversion 2019-06-19 10:44:58 +08:00
v2.3.5/src [SPARK-27105][SQL] Optimize away exponential complexity in ORC predicate conversion 2019-06-19 10:44:58 +08:00
pom.xml [SPARK-27521][SQL] Move data source v2 to catalyst module 2019-06-05 09:55:55 -07:00