spark-instrumented-optimizer

History

Cheng Lian 90f304b0c9 [SPARK-7567] [SQL] Migrating Parquet data source to FSBasedRelation This PR migrates Parquet data source to the newly introduced `FSBasedRelation`. `FSBasedParquetRelation` is created to replace `ParquetRelation2`. Major differences are: 1. Partition discovery code has been factored out to `FSBasedRelation` 1. `AppendingParquetOutputFormat` is not used now. Instead, an anonymous subclass of `ParquetOutputFormat` is used to handle appending and writing dynamic partitions 1. When scanning partitioned tables, `FSBasedParquetRelation.buildScan` only builds an `RDD[Row]` for a single selected partition 1. `FSBasedParquetRelation` doesn't rely on Catalyst expressions for filter push down, thus it doesn't extend `CatalystScan` anymore After migrating `JSONRelation` (which extends `CatalystScan`), we can remove `CatalystScan`. <!-- Reviewable:start --> [<img src="https://reviewable.io/review_button.png" height=40 alt="Review on Reviewable"/>](https://reviewable.io/reviews/apache/spark/6090) <!-- Reviewable:end --> Author: Cheng Lian <lian@databricks.com> Closes #6090 from liancheng/parquet-migration and squashes the following commits: 6063f87 [Cheng Lian] Casts to OutputCommitter rather than FileOutputCommtter bfd1cf0 [Cheng Lian] Fixes compilation error introduced while rebasing f9ea56e [Cheng Lian] Adds ParquetRelation2 related classes to MiMa check whitelist 261d8c1 [Cheng Lian] Minor bug fix and more tests db65660 [Cheng Lian] Migrates Parquet data source to FSBasedRelation (cherry picked from commit `7ff16e8abe`) Signed-off-by: Michael Armbrust <michael@databricks.com>		2015-05-13 11:04:21 -07:00
..
project	[SPARK-6750] Upgrade ScalaStyle to 0.7.	2015-04-07 12:37:33 -07:00
build.properties	[SPARK-5415] bump sbt to version to 0.13.7	2015-01-28 02:13:06 -08:00
MimaBuild.scala	[SPARK-6371] [build] Update version to 1.4.0-SNAPSHOT.	2015-03-20 18:43:57 +00:00
MimaExcludes.scala	[SPARK-7567] [SQL] Migrating Parquet data source to FSBasedRelation	2015-05-13 11:04:21 -07:00
plugins.sbt	[SPARK-6750] Upgrade ScalaStyle to 0.7.	2015-04-07 12:37:33 -07:00
SparkBuild.scala	[SPARK-7485] [BUILD] Remove pyspark files from assembly.	2015-05-12 01:39:28 -07:00