spark-instrumented-optimizer

History

Wenchen Fan 962e9bcf94 [SPARK-12756][SQL] use hash expression in Exchange This PR makes bucketing and exchange share one common hash algorithm, so that we can guarantee the data distribution is same between shuffle and bucketed data source, which enables us to only shuffle one side when join a bucketed table and a normal one. This PR also fixes the tests that are broken by the new hash behaviour in shuffle. Author: Wenchen Fan <wenchen@databricks.com> Closes #10703 from cloud-fan/use-hash-expr-in-shuffle.		2016-01-13 22:43:28 -08:00
..
__init__.py	[SPARK-12600][SQL] Remove deprecated methods in Spark SQL	2016-01-04 18:02:38 -08:00
column.py	[SPARK-12791][SQL] Simplify CaseWhen by breaking "branches" into "conditions" and "values"	2016-01-13 12:44:35 -08:00
context.py	[SPARK-12600][SQL] Remove deprecated methods in Spark SQL	2016-01-04 18:02:38 -08:00
dataframe.py	[SPARK-12756][SQL] use hash expression in Exchange	2016-01-13 22:43:28 -08:00
functions.py	[SPARK-12642][SQL] improve the hash expression to be decoupled from unsafe row	2016-01-13 12:29:02 -08:00
group.py	[SPARK-12756][SQL] use hash expression in Exchange	2016-01-13 22:43:28 -08:00
readwriter.py	[SPARK-12600][SQL] Remove deprecated methods in Spark SQL	2016-01-04 18:02:38 -08:00
tests.py	[SPARK-12600][SQL] Remove deprecated methods in Spark SQL	2016-01-04 18:02:38 -08:00
types.py	[SPARK-11158][SQL] Modified _verify_type() to be more informative on Errors by presenting the Object	2015-10-18 11:39:19 -07:00
utils.py	[SPARK-11804] [PYSPARK] Exception raise when using Jdbc predicates opt…	2015-11-18 08:18:54 -08:00
window.py	[SPARK-10373] [PYSPARK] move @since into pyspark from sql	2015-09-08 20:56:22 -07:00