spark-instrumented-optimizer

History

Wenchen Fan 9348e68420 [SPARK-22833][EXAMPLE] Improvement SparkHive Scala Examples ## What changes were proposed in this pull request? Some improvements: 1. Point out we are using both Spark SQ native syntax and HQL syntax in the example 2. Avoid using the same table name with temp view, to not confuse users. 3. Create the external hive table with a directory that already has data, which is a more common use case. 4. Remove the usage of `spark.sql.parquet.writeLegacyFormat`. This config was introduced by https://github.com/apache/spark/pull/8566 and has nothing to do with Hive. 5. Remove `repartition` and `coalesce` example. These 2 are not Hive specific, we should put them in a different example file. BTW they can't accurately control the number of output files, `spark.sql.files.maxRecordsPerFile` also controls it. ## How was this patch tested? N/A Author: Wenchen Fan <wenchen@databricks.com> Closes #20081 from cloud-fan/minor.	2017-12-26 09:37:39 -08:00
..
src/main	[SPARK-22833][EXAMPLE] Improvement SparkHive Scala Examples	2017-12-26 09:37:39 -08:00
pom.xml	[SPARK-22142][BUILD][STREAMING] Move Flume support behind a profile, take 2	2017-10-06 15:08:28 +01:00

Wenchen Fan 9348e68420 [SPARK-22833][EXAMPLE] Improvement SparkHive Scala Examples

## What changes were proposed in this pull request?
Some improvements:
1. Point out we are using both Spark SQ native syntax and HQL syntax in the example
2. Avoid using the same table name with temp view, to not confuse users.
3. Create the external hive table with a directory that already has data, which is a more common use case.
4. Remove the usage of `spark.sql.parquet.writeLegacyFormat`. This config was introduced by https://github.com/apache/spark/pull/8566 and has nothing to do with Hive.
5. Remove `repartition` and `coalesce` example. These 2 are not Hive specific, we should put them in a different example file. BTW they can't accurately control the number of output files, `spark.sql.files.maxRecordsPerFile` also controls it.

## How was this patch tested?

N/A

Author: Wenchen Fan <wenchen@databricks.com>

Closes #20081 from cloud-fan/minor.

2017-12-26 09:37:39 -08:00

src/main

[SPARK-22833][EXAMPLE] Improvement SparkHive Scala Examples

2017-12-26 09:37:39 -08:00

pom.xml

[SPARK-22142][BUILD][STREAMING] Move Flume support behind a profile, take 2

2017-10-06 15:08:28 +01:00