spark-instrumented-optimizer

History

Reynold Xin 578bfeeff5 [SPARK-7654][SQL] DataFrameReader and DataFrameWriter for input/output API This patch introduces DataFrameWriter and DataFrameReader. DataFrameReader interface, accessible through SQLContext.read, contains methods that create DataFrames. These methods used to reside in SQLContext. Example usage: ```scala sqlContext.read.json("...") sqlContext.read.parquet("...") ``` DataFrameWriter interface, accessible through DataFrame.write, implements a builder pattern to avoid the proliferation of options in writing DataFrame out. It currently implements: - mode - format (e.g. "parquet", "json") - options (generic options passed down into data sources) - partitionBy (partitioning columns) Example usage: ```scala df.write.mode("append").format("json").partitionBy("date").saveAsTable("myJsonTable") ``` TODO: - [ ] Documentation update - [ ] Move JDBC into reader / writer? - [ ] Deprecate the old interfaces - [ ] Move the generic load interface into reader. - [ ] Update example code and documentation Author: Reynold Xin <rxin@databricks.com> Closes #6175 from rxin/reader-writer and squashes the following commits: b146c95 [Reynold Xin] Deprecation of old APIs. bd8abdf [Reynold Xin] Fixed merge conflict. 26abea2 [Reynold Xin] Added general load methods. 244fbec [Reynold Xin] Added equivalent to example. 4f15d92 [Reynold Xin] Added documentation for partitionBy. 7e91611 [Reynold Xin] [SPARK-7654][SQL] DataFrameReader and DataFrameWriter for input/output API.		2015-05-15 22:00:31 -07:00
..
compatibility/src/test/scala/org/apache/spark/sql/hive/execution	[SPARK-6908] [SQL] Use isolated Hive client	2015-05-07 19:36:24 -07:00
src	[SPARK-7654][SQL] DataFrameReader and DataFrameWriter for input/output API	2015-05-15 22:00:31 -07:00
v0.12.0/src/main/scala/org/apache/spark/sql/hive	[SPARK-6638] [SQL] Improve performance of StringType in SQL	2015-04-15 13:06:38 -07:00
v0.13.1/src/main/scala/org/apache/spark/sql/hive	[SPARK-6505] [SQL] Remove the reflection call in HiveFunctionWrapper	2015-04-27 14:08:05 +08:00
pom.xml	[SPARK-7168] [BUILD] Update plugin versions in Maven build and centralize versions	2015-04-28 07:48:34 -04:00