ODIn/spark-instrumented-optimizer

Author	SHA1	Message	Date
Patrick Wendell	f2796816be	Preparing Spark release v1.4.0-rc3	2015-05-28 23:40:22 -07:00
Patrick Wendell	119c93af9c	Preparing development version 1.4.0-SNAPSHOT	2015-05-28 22:57:31 -07:00
Patrick Wendell	2d97d7a0aa	Preparing Spark release v1.4.0-rc3	2015-05-28 22:57:26 -07:00
Reynold Xin	9b97e95e86	[SPARK-7927] whitespace fixes for SQL core. So we can enable a whitespace enforcement rule in the style checker to save code review time. Author: Reynold Xin <rxin@databricks.com> Closes #6477 from rxin/whitespace-sql-core and squashes the following commits: ce6e369 [Reynold Xin] Fixed tests. 6095fed [Reynold Xin] [SPARK-7927] whitespace fixes for SQL core. (cherry picked from commit `ff44c711ab`) Signed-off-by: Reynold Xin <rxin@databricks.com>	2015-05-28 20:10:28 -07:00
Patrick Wendell	7c342bdd93	Preparing development version 1.4.0-SNAPSHOT	2015-05-27 22:36:30 -07:00
Patrick Wendell	4983dfc878	Preparing Spark release v1.4.0-rc3	2015-05-27 22:36:23 -07:00
Liang-Chi Hsieh	b4ecbce65c	[SPARK-7897][SQL] Use DecimalType to represent unsigned bigint in JDBCRDD JIRA: https://issues.apache.org/jira/browse/SPARK-7897 Author: Liang-Chi Hsieh <viirya@gmail.com> Closes #6438 from viirya/jdbc_unsigned_bigint and squashes the following commits: ccb3c3f [Liang-Chi Hsieh] Use DecimalType to represent unsigned bigint. (cherry picked from commit `a1e092eae5`) Signed-off-by: Reynold Xin <rxin@databricks.com>	2015-05-27 18:51:42 -07:00
Cheng Lian	89fe93fc3b	[SPARK-7684] [SQL] Refactoring MetastoreDataSourcesSuite to workaround SPARK-7684 As stated in SPARK-7684, currently `TestHive.reset` has some execution order specific bug, which makes running specific test suites locally pretty frustrating. This PR refactors `MetastoreDataSourcesSuite` (which relies on `TestHive.reset` heavily) using various `withXxx` utility methods in `SQLTestUtils` to ask each test case to cleanup their own mess so that we can avoid calling `TestHive.reset`. Author: Cheng Lian <lian@databricks.com> Author: Yin Huai <yhuai@databricks.com> Closes #6353 from liancheng/workaround-spark-7684 and squashes the following commits: 26939aa [Yin Huai] Move the initialization of jsonFilePath to beforeAll. a423d48 [Cheng Lian] Fixes Scala style issue dfe45d0 [Cheng Lian] Refactors MetastoreDataSourcesSuite to workaround SPARK-7684 92a116d [Cheng Lian] Fixes minor styling issues (cherry picked from commit `b97ddff000`) Signed-off-by: Yin Huai <yhuai@databricks.com>	2015-05-27 13:09:42 -07:00
Reynold Xin	0468d57a6f	Removed Guava dependency from JavaTypeInference's type signature. This should also close #6243. Author: Reynold Xin <rxin@databricks.com> Closes #6431 from rxin/JavaTypeInference-guava and squashes the following commits: e58df3c [Reynold Xin] Removed Gauva dependency from JavaTypeInference's type signature. (cherry picked from commit `6fec1a9409`) Signed-off-by: Reynold Xin <rxin@databricks.com>	2015-05-27 11:54:42 -07:00
Cheng Lian	a25ce91f96	[SPARK-7847] [SQL] Fixes dynamic partition directory escaping Please refer to [SPARK-7847] [1] for details. [1]: https://issues.apache.org/jira/browse/SPARK-7847 Author: Cheng Lian <lian@databricks.com> Closes #6389 from liancheng/spark-7847 and squashes the following commits: 935c652 [Cheng Lian] Adds test case for writing various data types as dynamic partition value f4fc398 [Cheng Lian] Converts partition columns to Scala type when writing dynamic partitions d0aeca0 [Cheng Lian] Fixes dynamic partition directory escaping (cherry picked from commit `15459db4f6`) Signed-off-by: Yin Huai <yhuai@databricks.com>	2015-05-27 10:09:20 -07:00
Liang-Chi Hsieh	01c3ef536d	[SPARK-7697][SQL] Use LongType for unsigned int in JDBCRDD JIRA: https://issues.apache.org/jira/browse/SPARK-7697 The reported problem case is mysql. But for h2 db, there is no unsigned int. So it is not able to add corresponding test. Author: Liang-Chi Hsieh <viirya@gmail.com> Closes #6229 from viirya/unsignedint_as_long and squashes the following commits: dc4b5d8 [Liang-Chi Hsieh] Merge remote-tracking branch 'upstream/master' into unsignedint_as_long 608695b [Liang-Chi Hsieh] Use LongType for unsigned int in JDBCRDD. (cherry picked from commit `4f98d7a7f1`) Signed-off-by: Reynold Xin <rxin@databricks.com>	2015-05-27 00:27:44 -07:00
Cheng Lian	d0bd68ff8a	[SPARK-7868] [SQL] Ignores _temporary directories in HadoopFsRelation So that potential partial/corrupted data files left by failed tasks/jobs won't affect normal data scan. Author: Cheng Lian <lian@databricks.com> Closes #6411 from liancheng/spark-7868 and squashes the following commits: 273ea36 [Cheng Lian] Ignores _temporary directories (cherry picked from commit `b463e6d618`) Signed-off-by: Yin Huai <yhuai@databricks.com>	2015-05-26 20:49:05 -07:00
Josh Rosen	faadbd4d99	[SPARK-7858] [SQL] Use output schema, not relation schema, for data source input conversion In `DataSourceStrategy.createPhysicalRDD`, we use the relation schema as the target schema for converting incoming rows into Catalyst rows. However, we should be using the output schema instead, since our scan might return a subset of the relation's columns. This patch incorporates #6414 by liancheng, which fixes an issue in `SimpleTestRelation` that prevented this bug from being caught by our old tests: > In `SimpleTextRelation`, we specified `needsConversion` to `true`, indicating that values produced by this testing relation should be of Scala types, and need to be converted to Catalyst types when necessary. However, we also used `Cast` to convert strings to expected data types. And `Cast` always produces values of Catalyst types, thus no conversion is done at all. This PR makes `SimpleTextRelation` produce Scala values so that data conversion code paths can be properly tested. Closes #5986. Author: Josh Rosen <joshrosen@databricks.com> Author: Cheng Lian <lian@databricks.com> Author: Cheng Lian <liancheng@users.noreply.github.com> Closes #6400 from JoshRosen/SPARK-7858 and squashes the following commits: e71c866 [Josh Rosen] Re-fix bug so that the tests pass again 56b13e5 [Josh Rosen] Add regression test to hadoopFsRelationSuites 2169a0f [Josh Rosen] Remove use of SpecificMutableRow and BufferedIterator 6cd7366 [Josh Rosen] Fix SPARK-7858 by using output types for conversion. 5a00e66 [Josh Rosen] Add assertions in order to reproduce SPARK-7858 8ba195c [Cheng Lian] Merge 9968fba9979287aaa1f141ba18bfb9d4c116a3b3 into `61664732b2` 9968fba [Cheng Lian] Tests the data type conversion code paths (cherry picked from commit `0c33c7b4a6`) Signed-off-by: Yin Huai <yhuai@databricks.com>	2015-05-26 20:24:50 -07:00
Cheng Lian	7edb17bf07	[SPARK-7842] [SQL] Makes task committing/aborting in InsertIntoHadoopFsRelation more robust When committing/aborting a write task issued in `InsertIntoHadoopFsRelation`, if an exception is thrown from `OutputWriter.close()`, the committing/aborting process will be interrupted, and leaves messy stuff behind (e.g., the `_temporary` directory created by `FileOutputCommitter`). This PR makes these two process more robust by catching potential exceptions and falling back to normal task committment/abort. Author: Cheng Lian <lian@databricks.com> Closes #6378 from liancheng/spark-7838 and squashes the following commits: f18253a [Cheng Lian] Makes task committing/aborting in InsertIntoHadoopFsRelation more robust (cherry picked from commit `8af1bf10b7`) Signed-off-by: Cheng Lian <lian@databricks.com>	2015-05-26 00:29:06 +08:00
Yin Huai	b06389caec	[SPARK-7805] [SQL] Move SQLTestUtils.scala and ParquetTest.scala to src/test https://issues.apache.org/jira/browse/SPARK-7805 Because `sql/hive`'s tests depend on the test jar of `sql/core`, we do not need to store `SQLTestUtils` and `ParquetTest` in `src/main`. We should only add stuff that will be needed by `sql/console` or Python tests (for Python, we need it in `src/main`, right? davies). Author: Yin Huai <yhuai@databricks.com> Closes #6334 from yhuai/SPARK-7805 and squashes the following commits: af6d0c9 [Yin Huai] mima b86746a [Yin Huai] Move SQLTestUtils.scala and ParquetTest.scala to src/test. (cherry picked from commit `ed21476bc0`) Signed-off-by: Yin Huai <yhuai@databricks.com>	2015-05-24 09:51:49 -07:00
Patrick Wendell	947d700ec8	Preparing development version 1.4.0-SNAPSHOT	2015-05-23 20:13:05 -07:00
Patrick Wendell	03fb26a3e5	Preparing Spark release v1.4.0-rc2	2015-05-23 20:13:00 -07:00
Patrick Wendell	f2f74b9b1a	Preparing development version 1.4.1-SNAPSHOT	2015-05-23 14:59:37 -07:00
Patrick Wendell	0da7396990	Preparing Spark release v1.4.0-rc2-test	2015-05-23 14:59:31 -07:00
Patrick Wendell	8da8caab17	Preparing development version 1.4.1-SNAPSHOT	2015-05-23 14:46:27 -07:00
Patrick Wendell	8f50218f38	Preparing Spark release 1.4.0-rc2-test	2015-05-23 14:46:23 -07:00
Yin Huai	8d6d8a538c	[SPARK-7654] [SQL] Move insertInto into reader/writer interface. This one continues the work of https://github.com/apache/spark/pull/6216. Author: Yin Huai <yhuai@databricks.com> Author: Reynold Xin <rxin@databricks.com> Closes #6366 from yhuai/insert and squashes the following commits: 3d717fb [Yin Huai] Use insertInto to handle the casue when table exists and Append is used for saveAsTable. 56d2540 [Yin Huai] Add PreWriteCheck to HiveContext's analyzer. c636e35 [Yin Huai] Remove unnecessary empty lines. cf83837 [Yin Huai] Move insertInto to write. Also, remove the partition columns from InsertIntoHadoopFsRelation. 0841a54 [Reynold Xin] Removed experimental tag for deprecated methods. 33ed8ef [Reynold Xin] [SPARK-7654][SQL] Move insertInto into reader/writer interface. (cherry picked from commit `2b7e63585d`) Signed-off-by: Yin Huai <yhuai@databricks.com>	2015-05-23 09:48:30 -07:00
Davies Liu	d1515381cb	[SPARK-7322, SPARK-7836, SPARK-7822][SQL] DataFrame window function related updates 1. ntile should take an integer as parameter. 2. Added Python API (based on #6364) 3. Update documentation of various DataFrame Python functions. Author: Davies Liu <davies@databricks.com> Author: Reynold Xin <rxin@databricks.com> Closes #6374 from rxin/window-final and squashes the following commits: 69004c7 [Reynold Xin] Style fix. 288cea9 [Reynold Xin] Update documentaiton. 7cb8985 [Reynold Xin] Merge pull request #6364 from davies/window 66092b4 [Davies Liu] update docs ed73cb4 [Reynold Xin] [SPARK-7322][SQL] Improve DataFrame window function documentation. ef55132 [Davies Liu] Merge branch 'master' of github.com:apache/spark into window4 8936ade [Davies Liu] fix maxint in python 3 2649358 [Davies Liu] update docs 778e2c0 [Davies Liu] SPARK-7836 and SPARK-7822: Python API of window functions (cherry picked from commit `efe3bfdf49`) Signed-off-by: Yin Huai <yhuai@databricks.com>	2015-05-23 08:30:18 -07:00
Michael Armbrust	427dc04c1e	[SPARK-6743] [SQL] Fix empty projections of cached data Author: Michael Armbrust <michael@databricks.com> Closes #6165 from marmbrus/wrongColumn and squashes the following commits: 4fad158 [Michael Armbrust] Merge remote-tracking branch 'origin/master' into wrongColumn aad7eab [Michael Armbrust] rxins comments f1e8df1 [Michael Armbrust] [SPARK-6743][SQL] Fix empty projections of cached data (cherry picked from commit `3b68cb0430`) Signed-off-by: Michael Armbrust <michael@databricks.com>	2015-05-22 09:45:31 -07:00
Cheng Hao	bfaf6a094a	[SPARK-7322][SQL] Window functions in DataFrame This closes #6104. Author: Cheng Hao <hao.cheng@intel.com> Author: Reynold Xin <rxin@databricks.com> Closes #6343 from rxin/window-df and squashes the following commits: 026d587 [Reynold Xin] Address code review feedback. dc448fe [Reynold Xin] Fixed Hive tests. 9794d9d [Reynold Xin] Moved Java test package. 9331605 [Reynold Xin] Refactored API. 3313e2a [Reynold Xin] Merge pull request #6104 from chenghao-intel/df_window d625a64 [Cheng Hao] Update the dataframe window API as suggsted c141fb1 [Cheng Hao] hide all of properties of the WindowFunctionDefinition 3b1865f [Cheng Hao] scaladoc typos f3fd2d0 [Cheng Hao] polish the unit test 6847825 [Cheng Hao] Add additional analystcs functions 57e3bc0 [Cheng Hao] typos 24a08ec [Cheng Hao] scaladoc 28222ed [Cheng Hao] fix bug of range/row Frame 1d91865 [Cheng Hao] style issue 53f89f2 [Cheng Hao] remove the over from the functions.scala 964c013 [Cheng Hao] add more unit tests and window functions 64e18a7 [Cheng Hao] Add Window Function support for DataFrame (cherry picked from commit `f6f2eeb179`) Signed-off-by: Reynold Xin <rxin@databricks.com>	2015-05-22 01:00:37 -07:00
Yin Huai	11a0640db9	[SPARK-7737] [SQL] Use leaf dirs having data files to discover partitions. https://issues.apache.org/jira/browse/SPARK-7737 cc liancheng Author: Yin Huai <yhuai@databricks.com> Closes #6329 from yhuai/spark-7737 and squashes the following commits: 7e0dfc7 [Yin Huai] Use leaf dirs having data files to discover partitions. (cherry picked from commit `347b50106b`) Signed-off-by: Cheng Lian <lian@databricks.com>	2015-05-22 07:12:12 +08:00
Andrew Or	ba04b52360	[SPARK-7718] [SQL] Speed up partitioning by avoiding closure cleaning According to yhuai we spent 6-7 seconds cleaning closures in a partitioning job that takes 12 seconds. Since we provide these closures in Spark we know for sure they are serializable, so we can bypass the cleaning. Author: Andrew Or <andrew@databricks.com> Closes #6256 from andrewor14/sql-partition-speed-up and squashes the following commits: a82b451 [Andrew Or] Fix style 10f7e3e [Andrew Or] Avoid getting call sites and cleaning closures 17e2943 [Andrew Or] Merge branch 'master' of github.com:apache/spark into sql-partition-speed-up 523f042 [Andrew Or] Skip unnecessary Utils.getCallSites too f7fe143 [Andrew Or] Avoid unnecessary closure cleaning (cherry picked from commit `5287eec5a6`) Signed-off-by: Yin Huai <yhuai@databricks.com>	2015-05-21 14:33:24 -07:00
Tathagata Das	e597692acd	[SPARK-7478] [SQL] Added SQLContext.getOrCreate Having a SQLContext singleton would make it easier for applications to use a lazily instantiated single shared instance of SQLContext when needed. It would avoid problems like 1. In REPL/notebook environment, rerunning the line {{val sqlContext = new SQLContext}} multiple times created different contexts while overriding the reference to previous context, leading to issues like registered temp tables going missing. 2. In Streaming, creating SQLContext directly leads to serialization/deserialization issues when attempting to recover from DStream checkpoints. See [SPARK-6770]. Also to get around this problem I had to suggest creating a singleton instance - https://github.com/apache/spark/blob/master/examples/src/main/scala/org/apache/spark/examples/streaming/SqlNetworkWordCount.scala This can be solved by {{SQLContext.getOrCreate}} which get or creates a new singleton instance of SQLContext using either a given SparkContext or a given SparkConf. rxin marmbrus Author: Tathagata Das <tathagata.das1565@gmail.com> Closes #6006 from tdas/SPARK-7478 and squashes the following commits: 25f4da9 [Tathagata Das] Addressed comments. 79fe069 [Tathagata Das] Added comments. c66ca76 [Tathagata Das] Merge remote-tracking branch 'apache-github/master' into SPARK-7478 48adb14 [Tathagata Das] Removed HiveContext.getOrCreate bf8cf50 [Tathagata Das] Fix more bug dec5594 [Tathagata Das] Fixed bug b4e9721 [Tathagata Das] Remove unnecessary import 4ef513b [Tathagata Das] Merge remote-tracking branch 'apache-github/master' into SPARK-7478 d3ea8e4 [Tathagata Das] Added HiveContext 83bc950 [Tathagata Das] Updated tests f82ae81 [Tathagata Das] Fixed test bc72868 [Tathagata Das] Added SQLContext.getOrCreate (cherry picked from commit `3d0cccc858`) Signed-off-by: Tathagata Das <tathagata.das1565@gmail.com>	2015-05-21 14:08:31 -07:00
Yin Huai	96c82515b8	[SPARK-7763] [SPARK-7616] [SQL] Persists partition columns into metastore Author: Yin Huai <yhuai@databricks.com> Author: Cheng Lian <lian@databricks.com> Closes #6285 from liancheng/spark-7763 and squashes the following commits: bb2829d [Yin Huai] Fix hashCode. d677f7d [Cheng Lian] Fixes Scala style issue 44b283f [Cheng Lian] Adds test case for SPARK-7616 6733276 [Yin Huai] Fix a bug that potentially causes https://issues.apache.org/jira/browse/SPARK-7616. 6cabf3c [Yin Huai] Update unit test. 7e02910 [Yin Huai] Use metastore partition columns and do not hijack maybePartitionSpec. e9a03ec [Cheng Lian] Persists partition columns into metastore (cherry picked from commit `30f3f556f7`) Signed-off-by: Yin Huai <yhuai@databricks.com>	2015-05-21 13:51:49 -07:00
Cheng Lian	70d9839cf3	[SPARK-7749] [SQL] Fixes partition discovery for non-partitioned tables When no partition columns can be found, we should have an empty `PartitionSpec`, rather than a `PartitionSpec` with empty partition columns. This PR together with #6285 should fix SPARK-7749. Author: Cheng Lian <lian@databricks.com> Author: Yin Huai <yhuai@databricks.com> Closes #6287 from liancheng/spark-7749 and squashes the following commits: a799ff3 [Cheng Lian] Adds test cases for SPARK-7749 c4949be [Cheng Lian] Minor refactoring, and tolerant _TEMPORARY directory name 5aa87ea [Yin Huai] Make parsePartitions more robust. fc56656 [Cheng Lian] Returns empty PartitionSpec if no partition columns can be inferred 19ae41e [Cheng Lian] Don't list base directory as leaf directory (cherry picked from commit `8730fbb47b`) Signed-off-by: Yin Huai <yhuai@databricks.com>	2015-05-21 10:56:26 -07:00
Davies Liu	3aa6185101	[SPARK-7565] [SQL] fix MapType in JsonRDD The key of Map in JsonRDD should be converted into UTF8String (also failed records), Thanks to yhuai viirya Closes #6084 Author: Davies Liu <davies@databricks.com> Closes #6299 from davies/string_in_json and squashes the following commits: 0dbf559 [Davies Liu] improve test, fix corrupt record 6836a80 [Davies Liu] move unit tests into Scala b97af11 [Davies Liu] fix MapType in JsonRDD (cherry picked from commit `a25c1ab8f0`) Signed-off-by: Yin Huai <yhuai@databricks.com>	2015-05-21 09:58:57 -07:00
Liang-Chi Hsieh	e70be6987b	[SPARK-7746][SQL] Add FetchSize parameter for JDBC driver JIRA: https://issues.apache.org/jira/browse/SPARK-7746 Looks like an easy to add parameter but can show significant performance improvement if the JDBC driver accepts it. Author: Liang-Chi Hsieh <viirya@gmail.com> Closes #6283 from viirya/jdbc_fetchsize and squashes the following commits: de47f94 [Liang-Chi Hsieh] Don't keep fetchSize as single parameter. b7bff2f [Liang-Chi Hsieh] Add FetchSize parameter for JDBC driver. (cherry picked from commit `d0eb9ffe97`) Signed-off-by: Reynold Xin <rxin@databricks.com>	2015-05-20 22:24:04 -07:00
Cheng Hao	4fd674336c	[SPARK-7320] [SQL] Add Cube / Rollup for dataframe This is a follow up for #6257, which broke the maven test. Add cube & rollup for DataFrame For example: ```scala testData.rollup($"a" + $"b", $"b").agg(sum($"a" - $"b")) testData.cube($"a" + $"b", $"b").agg(sum($"a" - $"b")) ``` Author: Cheng Hao <hao.cheng@intel.com> Closes #6304 from chenghao-intel/rollup and squashes the following commits: 04bb1de [Cheng Hao] move the table register/unregister into beforeAll/afterAll a6069f1 [Cheng Hao] cancel the implicit keyword ced4b8f [Cheng Hao] remove the unnecessary code changes 9959dfa [Cheng Hao] update the code as comments e1d88aa [Cheng Hao] update the code as suggested 03bc3d9 [Cheng Hao] Remove the CubedData & RollupedData 5fd62d0 [Cheng Hao] hiden the CubedData & RollupedData 5ffb196 [Cheng Hao] Add Cube / Rollup for dataframe (cherry picked from commit `42c592adb3`) Signed-off-by: Yin Huai <yhuai@databricks.com>	2015-05-20 19:58:33 -07:00
Patrick Wendell	9b37e32c55	Preparing development version 1.4.0-SNAPSHOT	2015-05-20 17:29:00 -07:00
Patrick Wendell	1e458e3553	Preparing Spark release rc-test	2015-05-20 17:28:55 -07:00
pwendell	8d66849862	Preparing development version 1.4.0-SNAPSHOT	2015-05-20 17:26:15 -07:00
pwendell	ae29aeaf8e	Preparing Spark release rc-test	2015-05-20 17:26:10 -07:00
jenkins	534c787b9f	Preparing development version 1.4.0-SNAPSHOT	2015-05-20 16:49:59 -07:00
jenkins	5f4d87f608	Preparing Spark release rc-test	2015-05-20 16:49:54 -07:00
Patrick Wendell	205ed15f29	Preparing development version 1.4.0-SNAPSHOT	2015-05-20 16:30:01 -07:00
Patrick Wendell	09a1c6231e	Preparing Spark release rc-test	2015-05-20 16:29:52 -07:00
Patrick Wendell	f84bdbce8c	Revert "[SPARK-7320] [SQL] Add Cube / Rollup for dataframe" This reverts commit `10698e1131`.	2015-05-20 13:39:22 -07:00
Yin Huai	55bd1bb52e	[SPARK-7713] [SQL] Use shared broadcast hadoop conf for partitioned table scan. https://issues.apache.org/jira/browse/SPARK-7713 I tested the performance with the following code: ```scala import sqlContext._ import sqlContext.implicits._ (1 to 5000).foreach { i => val df = (1 to 1000).map(j => (j, s"str$j")).toDF("a", "b").save(s"/tmp/partitioned/i=$i") } sqlContext.sql(""" CREATE TEMPORARY TABLE partitionedParquet USING org.apache.spark.sql.parquet OPTIONS ( path '/tmp/partitioned' )""") table("partitionedParquet").explain(true) ``` In our master `explain` takes 40s in my laptop. With this PR, `explain` takes 14s. Author: Yin Huai <yhuai@databricks.com> Closes #6252 from yhuai/broadcastHadoopConf and squashes the following commits: 6fa73df [Yin Huai] Address comments of Josh and Andrew. 807fbf9 [Yin Huai] Make the new buildScan and SqlNewHadoopRDD private sql. e393555 [Yin Huai] Cheng's comments. 2eb53bb [Yin Huai] Use a shared broadcast Hadoop Configuration for partitioned HadoopFsRelations. (cherry picked from commit `b631bf73b9`) Signed-off-by: Yin Huai <yhuai@databricks.com>	2015-05-20 11:23:49 -07:00
Cheng Hao	10698e1131	[SPARK-7320] [SQL] Add Cube / Rollup for dataframe Add `cube` & `rollup` for DataFrame For example: ```scala testData.rollup($"a" + $"b", $"b").agg(sum($"a" - $"b")) testData.cube($"a" + $"b", $"b").agg(sum($"a" - $"b")) ``` Author: Cheng Hao <hao.cheng@intel.com> Closes #6257 from chenghao-intel/rollup and squashes the following commits: 7302319 [Cheng Hao] cancel the implicit keyword a66e38f [Cheng Hao] remove the unnecessary code changes a2869d4 [Cheng Hao] update the code as comments c441777 [Cheng Hao] update the code as suggested 84c9564 [Cheng Hao] Remove the CubedData & RollupedData 279584c [Cheng Hao] hiden the CubedData & RollupedData ef357e1 [Cheng Hao] Add Cube / Rollup for dataframe (cherry picked from commit `09265ad7c8`) Signed-off-by: Cheng Lian <lian@databricks.com>	2015-05-20 19:12:12 +08:00
scwf	86893390cf	[SPARK-7656] [SQL] use CatalystConf in FunctionRegistry follow up for #5806 Author: scwf <wangfei1@huawei.com> Closes #6164 from scwf/FunctionRegistry and squashes the following commits: 15e6697 [scwf] use catalogconf in FunctionRegistry (cherry picked from commit `60336e3bc0`) Signed-off-by: Michael Armbrust <michael@databricks.com>	2015-05-19 17:36:33 -07:00
Patrick Wendell	ac3197e1b9	Preparing development version 1.4.1-SNAPSHOT	2015-05-19 09:35:12 +00:00
Patrick Wendell	777a08166f	Preparing Spark release v1.4.0-rc1	2015-05-19 09:35:12 +00:00
Patrick Wendell	586ede6b32	Revert "Preparing Spark release v1.4.0-rc1" This reverts commit `79fb01a3be`.	2015-05-19 02:27:14 -07:00
Patrick Wendell	e7309ec729	Revert "Preparing development version 1.4.1-SNAPSHOT" This reverts commit `a1d896b85b`.	2015-05-19 02:27:07 -07:00
Patrick Wendell	a1d896b85b	Preparing development version 1.4.1-SNAPSHOT	2015-05-19 07:13:24 +00:00

1 2 3 4 5 ...

808 commits