spark-instrumented-optimizer

History

Cheng Lian cbaf595447 [SPARK-8014] [SQL] Avoid premature metadata discovery when writing a HadoopFsRelation with a save mode other than Append The current code references the schema of the DataFrame to be written before checking save mode. This triggers expensive metadata discovery prematurely. For save mode other than `Append`, this metadata discovery is useless since we either ignore the result (for `Ignore` and `ErrorIfExists`) or delete existing files (for `Overwrite`) later. This PR fixes this issue by deferring metadata discovery after save mode checking. Author: Cheng Lian <lian@databricks.com> Closes #6583 from liancheng/spark-8014 and squashes the following commits: 1aafabd [Cheng Lian] Updates comments 088abaa [Cheng Lian] Avoids schema merging and partition discovery when data schema and partition schema are defined 8fbd93f [Cheng Lian] Fixes SPARK-8014 (cherry picked from commit `686a45f0b9`) Signed-off-by: Yin Huai <yhuai@databricks.com>		2015-06-02 13:32:34 -07:00
..
compatibility/src/test/scala/org/apache/spark/sql/hive/execution	[SQL] [TEST] udf_java_method failed due to jdk version	2015-05-21 12:32:10 -07:00
src	[SPARK-8014] [SQL] Avoid premature metadata discovery when writing a HadoopFsRelation with a save mode other than Append	2015-06-02 13:32:34 -07:00
v0.13.1/src/main/scala/org/apache/spark/sql/hive	[SPARK-6505] [SQL] Remove the reflection call in HiveFunctionWrapper	2015-04-27 14:08:05 +08:00
pom.xml	Preparing development version 1.4.0-SNAPSHOT	2015-06-02 08:41:15 -07:00