ODIn/spark-instrumented-optimizer

Author	SHA1	Message	Date
Patrick Wendell	f8aab7a7bc	Preparing Spark release v1.4.1-rc3	2015-07-06 19:39:37 -07:00
Liang-Chi Hsieh	4d813833df	[SPARK-8463][SQL] Use DriverRegistry to load jdbc driver at writing path JIRA: https://issues.apache.org/jira/browse/SPARK-8463 Currently, at the reading path, `DriverRegistry` is used to load needed jdbc driver at executors. However, at the writing path, we also need `DriverRegistry` to load jdbc driver. Author: Liang-Chi Hsieh <viirya@gmail.com> Closes #6900 from viirya/jdbc_write_driver and squashes the following commits: 16cd04b [Liang-Chi Hsieh] Use DriverRegistry to load jdbc driver at writing path. (cherry picked from commit `d4d6d31db5`) Signed-off-by: Reynold Xin <rxin@databricks.com>	2015-07-06 17:17:15 -07:00
Patrick Wendell	e990561ce0	Preparing development version 1.4.2-SNAPSHOT	2015-07-02 23:18:53 -07:00
Patrick Wendell	07b95c7adf	Preparing Spark release v1.4.1-rc2	2015-07-02 23:18:48 -07:00
Burak Yavuz	ff76b33b67	[SPARK-8803] handle special characters in elements in crosstab cc rxin Having back ticks or null as elements causes problems. Since elements become column names, we have to drop them from the element as back ticks are special characters. Having null throws exceptions, we could replace them with empty strings. Handling back ticks should be improved for 1.5 Author: Burak Yavuz <brkyvz@gmail.com> Closes #7201 from brkyvz/weird-ct-elements and squashes the following commits: e06b840 [Burak Yavuz] fix scalastyle 93a0d3f [Burak Yavuz] added tests for NaN and Infinity 9dba6ce [Burak Yavuz] address cr1 db71dbd [Burak Yavuz] handle special characters in elements in crosstab (cherry picked from commit `9b23e92c72`) Signed-off-by: Reynold Xin <rxin@databricks.com> Conflicts: sql/core/src/main/scala/org/apache/spark/sql/execution/stat/StatFunctions.scala	2015-07-02 22:12:47 -07:00
Vinod K C	eb0dd45de4	[SPARK-8787] [SQL] Changed parameter order of @deprecated in package object sql Parameter order of deprecated annotation in package object sql is wrong >>deprecated("1.3.0", "use DataFrame") . This has to be changed to deprecated("use DataFrame", "1.3.0") Author: Vinod K C <vinod.kc@huawei.com> Closes #7183 from vinodkc/fix_deprecated_param_order and squashes the following commits: 1cbdbe8 [Vinod K C] Modified the message 700911c [Vinod K C] Changed order of parameters (cherry picked from commit `c572e25617`) Signed-off-by: Sean Owen <sowen@cloudera.com>	2015-07-02 13:42:58 +01:00
Kousuke Saruta	f5c9296a6f	[DOCS] Fix minor wrong lambda expression example. It's a really minor issue but there is an example with wrong lambda-expression usage in `SQLContext.scala` like as follows. ``` sqlContext.udf().register("myUDF", (Integer arg1, String arg2) -> arg2 + arg1), <- We have an extra `)` here. DataTypes.StringType); ``` Author: Kousuke Saruta <sarutak@oss.nttdata.co.jp> Closes #7187 from sarutak/fix-minor-wrong-lambda-expression and squashes the following commits: a13196d [Kousuke Saruta] Fixed minor wrong lambda expression example. (cherry picked from commit `41588365ad`) Signed-off-by: Kousuke Saruta <sarutak@oss.nttdata.co.jp>	2015-07-02 21:16:54 +09:00
Wenchen Fan	2f85d8ee0c	[SPARK-8621] [SQL] support empty string as column name improve the empty check in `parseAttributeName` so that we can allow empty string as column name. Close https://github.com/apache/spark/pull/7117 Author: Wenchen Fan <cloud0fan@outlook.com> Closes #7149 from cloud-fan/8621 and squashes the following commits: efa9e3e [Wenchen Fan] support empty string (cherry picked from commit `31b4a3d7f2`) Signed-off-by: Reynold Xin <rxin@databricks.com>	2015-07-01 10:31:49 -07:00
Burak Yavuz	ffc793a6ca	[SPARK-8715] ArrayOutOfBoundsException fixed for DataFrameStatSuite.crosstab cc yhuai Author: Burak Yavuz <brkyvz@gmail.com> Closes #7100 from brkyvz/ct-flakiness-fix and squashes the following commits: abc299a [Burak Yavuz] change 'to' to until 7e96d7c [Burak Yavuz] ArrayOutOfBoundsException fixed for DataFrameStatSuite.crosstab (cherry picked from commit `ecacb1e88a`) Signed-off-by: Yin Huai <yhuai@databricks.com>	2015-06-29 18:48:38 -07:00
Burak Yavuz	6b9f3831a8	[SPARK-8681] fixed wrong ordering of columns in crosstab I specifically randomized the test. What crosstab does is equivalent to a countByKey, therefore if this test fails again for any reason, we will know that we hit a corner case or something. cc rxin marmbrus Author: Burak Yavuz <brkyvz@gmail.com> Closes #7060 from brkyvz/crosstab-fixes and squashes the following commits: 0a65234 [Burak Yavuz] addressed comments v1 d96da7e [Burak Yavuz] fixed wrong ordering of columns in crosstab (cherry picked from commit `be7ef06762`) Signed-off-by: Reynold Xin <rxin@databricks.com>	2015-06-29 13:15:12 -07:00
Kousuke Saruta	da51cf58fc	[SQL][DOCS] Remove wrong example from DataFrame.scala In DataFrame.scala, there are examples like as follows. ``` * // The following are equivalent: * peopleDf.filter($"age" > 15) * peopleDf.where($"age" > 15) * peopleDf($"age" > 15) ``` But, I think the last example doesn't work. Author: Kousuke Saruta <sarutak@oss.nttdata.co.jp> Closes #6977 from sarutak/fix-dataframe-example and squashes the following commits: 46efbd7 [Kousuke Saruta] Removed wrong example (cherry picked from commit `94e040d059`) Signed-off-by: Reynold Xin <rxin@databricks.com>	2015-06-29 12:16:44 -07:00
Cheng Lian	0605e08434	[SPARK-8604] [SQL] HadoopFsRelation subclasses should set their output format class `HadoopFsRelation` subclasses, especially `ParquetRelation2` should set its own output format class, so that the default output committer can be setup correctly when doing appending (where we ignore user defined output committers). Author: Cheng Lian <lian@databricks.com> Closes #6998 from liancheng/spark-8604 and squashes the following commits: 9be51d1 [Cheng Lian] Adds more comments 6db1368 [Cheng Lian] HadoopFsRelation subclasses should set their output format class (cherry picked from commit `c337844ed7`) Signed-off-by: Cheng Lian <lian@databricks.com>	2015-06-25 00:07:01 -07:00
Yin Huai	7e53ff2581	[SPARK-8578] [SQL] Should ignore user defined output committer when appending data (branch 1.4) This is https://github.com/apache/spark/pull/6964 for branch 1.4. Author: Yin Huai <yhuai@databricks.com> Closes #6966 from yhuai/SPARK-8578-branch-1.4 and squashes the following commits: 9c3947b [Yin Huai] Do not use a custom output commiter when appendiing data.	2015-06-24 09:51:18 -07:00
Patrick Wendell	eafbe13459	Preparing development version 1.4.2-SNAPSHOT	2015-06-23 19:48:44 -07:00
Patrick Wendell	60e08e5075	Preparing Spark release v1.4.1-rc1	2015-06-23 19:48:39 -07:00
lockwobr	27693e1757	[SQL] [DOCS] updated the documentation for explode the syntax was incorrect in the example in explode Author: lockwobr <lockwobr@gmail.com> Closes #6943 from lockwobr/master and squashes the following commits: 3d864d1 [lockwobr] updated the documentation for explode (cherry picked from commit `4f7fbefb8d`) Signed-off-by: Kousuke Saruta <sarutak@oss.nttdata.co.jp>	2015-06-24 02:51:36 +09:00
Patrick Wendell	1cfa7302ee	Preparing development version 1.4.2-SNAPSHOT	2015-06-22 22:21:31 -07:00
Patrick Wendell	d0a5560ce4	Preparing Spark release v1.4.1-rc1	2015-06-22 22:21:26 -07:00
Patrick Wendell	48d6830144	[BUILD] Preparing Spark release 1.4.1	2015-06-22 22:18:52 -07:00
Michael Armbrust	65981619b2	[SPARK-8420] [SQL] Fix comparision of timestamps/dates with strings (branch-1.4) This is branch 1.4 backport of https://github.com/apache/spark/pull/6888. Below is the original description. In earlier versions of Spark SQL we casted `TimestampType` and `DataType` to `StringType` when it was involved in a binary comparison with a `StringType`. This allowed comparing a timestamp with a partial date as a user would expect. - `time > "2014-06-10"` - `time > "2014"` In 1.4.0 we tried to cast the String instead into a Timestamp. However, since partial dates are not a valid complete timestamp this results in `null` which results in the tuple being filtered. This PR restores the earlier behavior. Note that we still special case equality so that these comparisons are not affected by not printing zeros for subsecond precision. Author: Michael Armbrust <michaeldatabricks.com> Closes #6888 from marmbrus/timeCompareString and squashes the following commits: bdef29c [Michael Armbrust] test partial date 1f09adf [Michael Armbrust] special handling of equality 1172c60 [Michael Armbrust] more test fixing 4dfc412 [Michael Armbrust] fix tests aaa9508 [Michael Armbrust] newline 04d908f [Michael Armbrust] [SPARK-8420][SQL] Fix comparision of timestamps/dates with strings Conflicts: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/HiveTypeCoercion.scala sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/predicates.scala Author: Michael Armbrust <michael@databricks.com> Closes #6914 from yhuai/timeCompareString-1.4 and squashes the following commits: 9882915 [Michael Armbrust] [SPARK-8420] [SQL] Fix comparision of timestamps/dates with strings	2015-06-22 10:45:33 -07:00
Cheng Lian	451c8722af	[SPARK-8406] [SQL] Backports SPARK-8406 and PR #6864 to branch-1.4 Author: Cheng Lian <lian@databricks.com> Closes #6932 from liancheng/spark-8406-for-1.4 and squashes the following commits: a0168fe [Cheng Lian] Backports SPARK-8406 and PR #6864 to branch-1.4	2015-06-22 10:04:29 -07:00
Yin Huai	2510365faa	[HOT-FIX] Fix compilation (caused by `0131142d98`) Author: Yin Huai <yhuai@databricks.com> Closes #6913 from yhuai/branch-1.4-hotfix and squashes the following commits: 7f91fa0 [Yin Huai] [HOT-FIX] Fix compilation (caused by `0131142d98`).	2015-06-19 17:29:51 -07:00
Nathan Howell	0131142d98	[SPARK-8093] [SQL] Remove empty structs inferred from JSON documents Author: Nathan Howell <nhowell@godaddy.com> Closes #6799 from NathanHowell/spark-8093 and squashes the following commits: 76ac3e8 [Nathan Howell] [SPARK-8093] [SQL] Remove empty structs inferred from JSON documents (cherry picked from commit `9814b971f0`) Signed-off-by: Yin Huai <yhuai@databricks.com> Conflicts: sql/core/src/test/scala/org/apache/spark/sql/json/TestJsonData.scala	2015-06-19 16:23:11 -07:00
Josh Rosen	152f4465d3	[SPARK-8446] [SQL] Add helper functions for testing SparkPlan physical operators This patch introduces `SparkPlanTest`, a base class for unit tests of SparkPlan physical operators. This is analogous to Spark SQL's existing `QueryTest`, which does something similar for end-to-end tests with actual queries. These helper methods provide nicer error output when tests fail and help developers to avoid writing lots of boilerplate in order to execute manually constructed physical plans. Author: Josh Rosen <joshrosen@databricks.com> Author: Josh Rosen <rosenville@gmail.com> Author: Michael Armbrust <michael@databricks.com> Closes #6885 from JoshRosen/spark-plan-test and squashes the following commits: f8ce275 [Josh Rosen] Fix some IntelliJ inspections and delete some dead code 84214be [Josh Rosen] Add an extra column which isn't part of the sort ae1896b [Josh Rosen] Provide implicits automatically a80f9b0 [Josh Rosen] Merge pull request #4 from marmbrus/pr/6885 d9ab1e4 [Michael Armbrust] Add simple resolver c60a44d [Josh Rosen] Manually bind references 996332a [Josh Rosen] Add types so that tests compile a46144a [Josh Rosen] WIP (cherry picked from commit `207a98ca59`) Signed-off-by: Michael Armbrust <michael@databricks.com>	2015-06-18 16:45:27 -07:00
Radek Ostrowski	4da0686508	[SQL] [DOC] improved a comment [SQL][DOC] I found it a bit confusing when I came across it for the first time in the docs Author: Radek Ostrowski <dest.hawaii@gmail.com> Author: radek <radek@radeks-MacBook-Pro-2.local> Closes #6332 from radek1st/master and squashes the following commits: dae3347 [Radek Ostrowski] fixed typo c76bb3a [radek] improved a comment (cherry picked from commit `4bd10fd509`) Signed-off-by: Sean Owen <sowen@cloudera.com>	2015-06-16 21:04:45 +01:00
Michael Armbrust	2805d145e3	[SPARK-8358] [SQL] Wait for child resolution when resolving generators Author: Michael Armbrust <michael@databricks.com> Closes #6811 from marmbrus/aliasExplodeStar and squashes the following commits: fbd2065 [Michael Armbrust] more style 806a373 [Michael Armbrust] fix style 7cbb530 [Michael Armbrust] [SPARK-8358][SQL] Wait for child resolution when resolving generatorsa (cherry picked from commit `9073a426e4`) Signed-off-by: Michael Armbrust <michael@databricks.com>	2015-06-14 11:21:55 -07:00
Michael Armbrust	1ca431e83f	[SPARK-8329][SQL] Allow _ in DataSource options Author: Michael Armbrust <michael@databricks.com> Closes #6786 from marmbrus/optionsParser and squashes the following commits: e7d18ef [Michael Armbrust] add dots 99a3452 [Michael Armbrust] [SPARK-8329][SQL] Allow _ in DataSource options (cherry picked from commit `4aed66f299`) Signed-off-by: Reynold Xin <rxin@databricks.com>	2015-06-12 23:11:25 -07:00
navis.ryu	5c05b5c0d2	[SPARK-8285] [SQL] CombineSum should be calculated as unlimited decimal first case cs CombineSum(expr) => val calcType = expr.dataType expr.dataType match { case DecimalType.Fixed(_, _) => DecimalType.Unlimited case _ => expr.dataType } calcType is always expr.dataType. credits are all belong to IntelliJ Author: navis.ryu <navis@apache.org> Closes #6736 from navis/SPARK-8285 and squashes the following commits: 20382c1 [navis.ryu] [SPARK-8285] [SQL] CombineSum should be calculated as unlimited decimal first (cherry picked from commit `6a47114bc2`) Signed-off-by: Reynold Xin <rxin@databricks.com>	2015-06-10 18:19:24 -07:00
Cheng Lian	69197c3e38	[SPARK-8121] [SQL] Fixes InsertIntoHadoopFsRelation job initialization for Hadoop 1.x (branch 1.4 backport based on https://github.com/apache/spark/pull/6669 )	2015-06-08 11:36:42 -07:00
Reynold Xin	b9c046f6d7	[SPARK-8004][SQL] Quote identifier in JDBC data source. This is a follow-up patch to #6577 to replace columnEnclosing to quoteIdentifier. I also did some minor cleanup to the JdbcDialect file. Author: Reynold Xin <rxin@databricks.com> Closes #6689 from rxin/jdbc-quote and squashes the following commits: bad365f [Reynold Xin] Fixed test compilation... e39e14e [Reynold Xin] Fixed compilation. db9a8e0 [Reynold Xin] [SPARK-8004][SQL] Quote identifier in JDBC data source. (cherry picked from commit `d6d601a07b`) Signed-off-by: Reynold Xin <rxin@databricks.com>	2015-06-07 10:52:18 -07:00
Liang-Chi Hsieh	b4d54417e5	[SPARK-8141] [SQL] Precompute datatypes for partition columns and reuse it JIRA: https://issues.apache.org/jira/browse/SPARK-8141 Author: Liang-Chi Hsieh <viirya@gmail.com> Closes #6687 from viirya/reuse_partition_column_types and squashes the following commits: dab0688 [Liang-Chi Hsieh] Reuse partitionColumnTypes. (cherry picked from commit `26d07f1ece`) Signed-off-by: Cheng Lian <lian@databricks.com>	2015-06-07 15:35:43 +08:00
Liang-Chi Hsieh	b6fdc6cf11	[SPARK-8004][SQL] Enclose column names by JDBC Dialect JIRA: https://issues.apache.org/jira/browse/SPARK-8004 Author: Liang-Chi Hsieh <viirya@gmail.com> Closes #6577 from viirya/enclose_jdbc_columns and squashes the following commits: 614606a [Liang-Chi Hsieh] For comment. bc50182 [Liang-Chi Hsieh] Enclose column names by JDBC Dialect. (cherry picked from commit `901a552c5e`) Signed-off-by: Reynold Xin <rxin@databricks.com>	2015-06-06 23:00:18 -07:00
Cheng Lian	d8a53fb806	[SPARK-8079] [SQL] Makes InsertIntoHadoopFsRelation job/task abortion more robust As described in SPARK-8079, when writing a DataFrame to a `HadoopFsRelation`, if `HadoopFsRelation.prepareForWriteJob` throws exception, an unexpected NPE will be thrown during job abortion. (This issue doesn't bring much damage since the job is failing anyway.) This PR makes the job/task abortion logic in `InsertIntoHadoopFsRelation` more robust to avoid such confusing exceptions. Author: Cheng Lian <lian@databricks.com> Closes #6612 from liancheng/spark-8079 and squashes the following commits: 87cd81e [Cheng Lian] Addresses @rxin's comment 1864c75 [Cheng Lian] Addresses review comments 9e6dbb3 [Cheng Lian] Makes InsertIntoHadoopFsRelation job/task abortion more robust (cherry picked from commit `16fc49617e`) Signed-off-by: Cheng Lian <lian@databricks.com>	2015-06-06 17:23:46 +08:00
Shivaram Venkataraman	3e3151e755	[SPARK-8085] [SPARKR] Support user-specified schema in read.df cc davies sun-rui Author: Shivaram Venkataraman <shivaram@cs.berkeley.edu> Closes #6620 from shivaram/sparkr-read-schema and squashes the following commits: 16a6726 [Shivaram Venkataraman] Fix loadDF to pass schema Also add a unit test a229877 [Shivaram Venkataraman] Use wrapper function to DataFrameReader ee70ba8 [Shivaram Venkataraman] Support user-specified schema in read.df (cherry picked from commit `12f5eaeee1`) Signed-off-by: Shivaram Venkataraman <shivaram@cs.berkeley.edu>	2015-06-05 10:19:15 -07:00
Mike Dusenberry	81ff7a9012	[SPARK-7969] [SQL] Added a DataFrame.drop function that accepts a Column reference. Added a `DataFrame.drop` function that accepts a `Column` reference rather than a `String`, and added associated unit tests. Basically iterates through the `DataFrame` to find a column with an expression that is equivalent to that of the `Column` argument supplied to the function. Author: Mike Dusenberry <dusenberrymw@gmail.com> Closes #6585 from dusenberrymw/SPARK-7969_Drop_method_on_Dataframes_should_handle_Column and squashes the following commits: 514727a [Mike Dusenberry] Updating the @since tag of the drop(Column) function doc to reflect version 1.4.1 instead of 1.4.0. 2f1bb4e [Mike Dusenberry] Adding an additional assert statement to the 'drop column after join' unit test in order to make sure the correct column was indeed left over. 6bf7c0e [Mike Dusenberry] Minor code formatting change. e583888 [Mike Dusenberry] Adding more Python doctests for the df.drop with column reference function to test joined datasets that have columns with the same name. 5f74401 [Mike Dusenberry] Updating DataFrame.drop with column reference function to use logicalPlan.output to prevent ambiguities resulting from columns with the same name. Also added associated unit tests for joined datasets with duplicate column names. 4b8bbe8 [Mike Dusenberry] Adding Python support for Dataframe.drop with a Column reference. 986129c [Mike Dusenberry] Added a DataFrame.drop function that accepts a Column reference rather than a String, and added associated unit tests. Basically iterates through the DataFrame to find a column with an expression that is equivalent to one supplied to the function. (cherry picked from commit `df7da07a86`) Signed-off-by: Reynold Xin <rxin@databricks.com>	2015-06-04 11:30:25 -07:00
Andrew Or	bfe74b34a6	[SPARK-7558] Demarcate tests in unit-tests.log (1.4) This includes the following commits: original: `9eb222c` hotfix1: `8c99793` hotfix2: `a4f2412` scalastyle check: `609c492` --- Original patch #6441 Branch-1.3 patch #6602 Author: Andrew Or <andrew@databricks.com> Closes #6598 from andrewor14/demarcate-tests-1.4 and squashes the following commits: 4c3c566 [Andrew Or] Merge branch 'branch-1.4' of github.com:apache/spark into demarcate-tests-1.4 e217b78 [Andrew Or] [SPARK-7558] Guard against direct uses of FunSuite / FunSuiteLike 46d4361 [Andrew Or] Various whitespace changes (minor) 3d9bf04 [Andrew Or] Make all test suites extend SparkFunSuite instead of FunSuite eaa520e [Andrew Or] Fix tests? b4d93de [Andrew Or] Fix tests 634a777 [Andrew Or] Fix log message a932e8d [Andrew Or] Fix manual things that cannot be covered through automation 8bc355d [Andrew Or] Add core tests as dependencies in all modules 75d361f [Andrew Or] Introduce base abstract class for all test suites	2015-06-03 20:46:44 -07:00
Reynold Xin	1f90a06bda	[SPARK-8074] Parquet should throw AnalysisException during setup for data type/name related failures. Author: Reynold Xin <rxin@databricks.com> Closes #6608 from rxin/parquet-analysis and squashes the following commits: b5dc8e2 [Reynold Xin] Code review feedback. 5617cf6 [Reynold Xin] [SPARK-8074] Parquet should throw AnalysisException during setup for data type/name related failures. (cherry picked from commit `939e4f3d8d`) Signed-off-by: Reynold Xin <rxin@databricks.com>	2015-06-03 13:58:15 -07:00
animesh	0a1dad6cd4	[SPARK-7980] [SQL] Support SQLContext.range(end) 1. range() overloaded in SQLContext.scala 2. range() modified in python sql context.py 3. Tests added accordingly in DataFrameSuite.scala and python sql tests.py Author: animesh <animesh@apache.spark> Closes #6609 from animeshbaranawal/SPARK-7980 and squashes the following commits: 935899c [animesh] SPARK-7980:python+scala changes (cherry picked from commit `d053a31be9`) Signed-off-by: Reynold Xin <rxin@databricks.com>	2015-06-03 11:28:38 -07:00
Patrick Wendell	ab713af564	Preparing development version 1.4.0-SNAPSHOT	2015-06-02 18:06:41 -07:00
Patrick Wendell	22596c534a	Preparing Spark release v1.4.0-rc4	2015-06-02 18:06:35 -07:00
Patrick Wendell	e3c35b217c	Preparing development version 1.4.0-SNAPSHOT	2015-06-02 17:01:15 -07:00
Patrick Wendell	a14fad11ef	Preparing Spark release v1.4.0-rc4	2015-06-02 17:01:10 -07:00
Patrick Wendell	92ccc5ba39	Preparing development version 1.4.0-SNAPSHOT	2015-06-02 14:02:19 -07:00
Patrick Wendell	d630f4d697	Preparing Spark release v1.4.0-rc4	2015-06-02 14:02:14 -07:00
Cheng Lian	cbaf595447	[SPARK-8014] [SQL] Avoid premature metadata discovery when writing a HadoopFsRelation with a save mode other than Append The current code references the schema of the DataFrame to be written before checking save mode. This triggers expensive metadata discovery prematurely. For save mode other than `Append`, this metadata discovery is useless since we either ignore the result (for `Ignore` and `ErrorIfExists`) or delete existing files (for `Overwrite`) later. This PR fixes this issue by deferring metadata discovery after save mode checking. Author: Cheng Lian <lian@databricks.com> Closes #6583 from liancheng/spark-8014 and squashes the following commits: 1aafabd [Cheng Lian] Updates comments 088abaa [Cheng Lian] Avoids schema merging and partition discovery when data schema and partition schema are defined 8fbd93f [Cheng Lian] Fixes SPARK-8014 (cherry picked from commit `686a45f0b9`) Signed-off-by: Yin Huai <yhuai@databricks.com>	2015-06-02 13:32:34 -07:00
Cheng Lian	f71a09de6e	[SPARK-8037] [SQL] Ignores files whose name starts with dot in HadoopFsRelation Author: Cheng Lian <lian@databricks.com> Closes #6581 from liancheng/spark-8037 and squashes the following commits: d08e97b [Cheng Lian] Ignores files whose name starts with dot in HadoopFsRelation (cherry picked from commit `1bb5d716c0`) Signed-off-by: Cheng Lian <lian@databricks.com>	2015-06-03 01:09:19 +08:00
Patrick Wendell	92a677891c	Preparing development version 1.4.0-SNAPSHOT	2015-06-02 08:41:15 -07:00
Patrick Wendell	48c506724a	Preparing Spark release v1.4.0-rc4	2015-06-02 08:41:10 -07:00
Yin Huai	87941ff8c4	[SPARK-8023][SQL] Add "deterministic" attribute to Expression to avoid collapsing nondeterministic projects. This closes #6570. Author: Yin Huai <yhuai@databricks.com> Author: Reynold Xin <rxin@databricks.com> Closes #6573 from rxin/deterministic and squashes the following commits: 356cd22 [Reynold Xin] Added unit test for the optimizer. da3fde1 [Reynold Xin] Merge pull request #6570 from yhuai/SPARK-8023 da56200 [Yin Huai] Comments. e38f264 [Yin Huai] Comment. f9d6a73 [Yin Huai] Add a deterministic method to Expression. (cherry picked from commit `0f80990bfa`) Signed-off-by: Reynold Xin <rxin@databricks.com> Conflicts: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/random.scala	2015-06-02 00:21:27 -07:00
Yin Huai	4940630f56	[SPARK-8020] [SQL] Spark SQL conf in spark-defaults.conf make metadataHive get constructed too early https://issues.apache.org/jira/browse/SPARK-8020 Author: Yin Huai <yhuai@databricks.com> Closes #6571 from yhuai/SPARK-8020-1 and squashes the following commits: 0398f5b [Yin Huai] First populate the SQLConf and then construct executionHive and metadataHive. (cherry picked from commit `7b7f7b6c6f`) Signed-off-by: Yin Huai <yhuai@databricks.com>	2015-06-02 00:17:09 -07:00

1 2 3 4 5 ...

874 commits