spark-instrumented-optimizer

History

Wenchen Fan 50a991264c [SPARK-19359][SQL] renaming partition should not leave useless directories ## What changes were proposed in this pull request? Hive metastore is not case-preserving and keep partition columns with lower case names. If Spark SQL creates a table with upper-case partition column names using `HiveExternalCatalog`, when we rename partition, it first calls the HiveClient to renamePartition, which will create a new lower case partition path, then Spark SQL renames the lower case path to upper-case. However, when we rename a nested path, different file systems have different behaviors. e.g. in jenkins, renaming `a=1/b=2` to `A=2/B=2` will success, but leave an empty directory `a=1`. in mac os, the renaming doesn't work as expected and result to `a=1/B=2`. This PR renames the partition directory recursively from the first partition column in `HiveExternalCatalog`, to be most compatible with different file systems. ## How was this patch tested? new regression test Author: Wenchen Fan <wenchen@databricks.com> Closes #16837 from cloud-fan/partition.		2017-02-09 00:39:22 -05:00
..
compatibility/src/test/scala/org/apache/spark/sql/hive/execution	[SPARK-18936][SQL] Infrastructure for session local timezone support.	2017-01-26 11:51:05 +01:00
src	[SPARK-19359][SQL] renaming partition should not leave useless directories	2017-02-09 00:39:22 -05:00
pom.xml	[SPARK-17807][CORE] split test-tags into test-JAR	2016-12-21 16:37:20 -08:00