spark-instrumented-optimizer

History

Wenchen Fan da7bbb9435 [SPARK-8104] [SQL] auto alias expressions in analyzer Currently we auto alias expression in parser. However, during parser phase we don't have enough information to do the right alias. For example, Generator that has more than 1 kind of element need MultiAlias, ExtractValue don't need Alias if it's in middle of a ExtractValue chain. Author: Wenchen Fan <cloud0fan@outlook.com> Closes #6647 from cloud-fan/alias and squashes the following commits: 552eba4 [Wenchen Fan] fix python 5b5786d [Wenchen Fan] fix agg 73a90cb [Wenchen Fan] fix case-preserve of ExtractValue 4cfd23c [Wenchen Fan] fix order by d18f401 [Wenchen Fan] refine 9f07359 [Wenchen Fan] address comments 39c1aef [Wenchen Fan] small fix 33640ec [Wenchen Fan] auto alias expressions in analyzer		2015-06-22 12:13:00 -07:00
..
ml	[SPARK-8468] [ML] Take the negative of some metrics in RegressionEvaluator to get correct cross validation	2015-06-20 13:01:59 -07:00
mllib	[SPARK-8511] [PYSPARK] Modify a test to remove a saved model in `regression.py`	2015-06-22 11:53:11 -07:00
sql	[SPARK-8104] [SQL] auto alias expressions in analyzer	2015-06-22 12:13:00 -07:00
streaming	[SPARK-8444] [STREAMING] Adding Python streaming example for queueStream	2015-06-19 00:07:53 -07:00
__init__.py	[SPARK-4172] [PySpark] Progress API in Python	2015-02-17 13:36:43 -08:00
accumulators.py	[SPARK-7899] [PYSPARK] Fix Python 3 pyspark/sql/types module conflict	2015-05-29 14:13:44 -07:00
broadcast.py	[SPARK-4897] [PySpark] Python 3 support	2015-04-16 16:20:57 -07:00
cloudpickle.py	[SPARK-4897] [PySpark] Python 3 support	2015-04-16 16:20:57 -07:00
conf.py	[SPARK-4897] [PySpark] Python 3 support	2015-04-16 16:20:57 -07:00
context.py	[SPARK-8373] [PYSPARK] Add emptyRDD to pyspark and fix the issue when calling sum on an empty RDD	2015-06-17 13:59:39 -07:00
daemon.py	[SPARK-4897] [PySpark] Python 3 support	2015-04-16 16:20:57 -07:00
files.py	[SPARK-3309] [PySpark] Put all public API in __all__	2014-09-03 11:49:45 -07:00
heapq3.py	[SPARK-4897] [PySpark] Python 3 support	2015-04-16 16:20:57 -07:00
java_gateway.py	[SPARK-6949] [SQL] [PySpark] Support Date/Timestamp in Column expression	2015-04-21 00:08:18 -07:00
join.py	[SPARK-4897] [PySpark] Python 3 support	2015-04-16 16:20:57 -07:00
profiler.py	[SPARK-4897] [PySpark] Python 3 support	2015-04-16 16:20:57 -07:00
rdd.py	[SPARK-8373] [PYSPARK] Add emptyRDD to pyspark and fix the issue when calling sum on an empty RDD	2015-06-17 13:59:39 -07:00
rddsampler.py	[SPARK-4897] [PySpark] Python 3 support	2015-04-16 16:20:57 -07:00
resultiterable.py	[SPARK-3074] [PySpark] support groupByKey() with single huge key	2015-04-09 17:07:23 -07:00
serializers.py	[SPARK-8339] [PYSPARK] integer division for python 3	2015-06-19 00:12:20 -07:00
shell.py	[SPARK-4897] [PySpark] Python 3 support	2015-04-16 16:20:57 -07:00
shuffle.py	[SPARK-8202] [PYSPARK] fix infinite loop during external sort in PySpark	2015-06-18 13:45:58 -07:00
statcounter.py	[SPARK-4897] [PySpark] Python 3 support	2015-04-16 16:20:57 -07:00
status.py	[SPARK-4172] [PySpark] Progress API in Python	2015-02-17 13:36:43 -08:00
storagelevel.py	[SPARK-3417] Use new-style classes in PySpark	2014-09-08 15:45:36 -07:00
tests.py	[SPARK-8202] [PYSPARK] fix infinite loop during external sort in PySpark	2015-06-18 13:45:58 -07:00
traceback_utils.py	[SPARK-1087] Move python traceback utilities into new traceback_utils.py file.	2014-09-15 19:28:17 -07:00
worker.py	[SPARK-6216] [PYSPARK] check python version of worker with driver	2015-05-18 12:55:13 -07:00