spark-instrumented-optimizer

History

Davies Liu 5d8f7b9e87 [SPARK-6677] [SQL] [PySpark] fix cached classes It's possible to have two DataType object with same id (memory address) at different time, we should check the cached classes to verify that it's generated by given datatype. This PR also change `__FIELDS__` and `__DATATYPE__` to lower case to match Python code style. Author: Davies Liu <davies@databricks.com> Closes #5445 from davies/fix_type_cache and squashes the following commits: 63b3238 [Davies Liu] typo 47bdede [Davies Liu] fix cached classes		2015-04-11 22:33:23 -07:00
..
ml	[SPARK-6781] [SQL] use sqlContext in python shell	2015-04-08 13:31:45 -07:00
mllib	[SPARK-6577] [MLlib] [PySpark] SparseMatrix should be supported in PySpark	2015-04-09 23:10:13 -07:00
sql	[SPARK-6677] [SQL] [PySpark] fix cached classes	2015-04-11 22:33:23 -07:00
streaming	[SPARK-6211][Streaming] Add Python Kafka API unit test	2015-04-09 23:14:24 -07:00
__init__.py	[SPARK-4172] [PySpark] Progress API in Python	2015-02-17 13:36:43 -08:00
accumulators.py	[SPARK-4387][PySpark] Refactoring python profiling code to make it extensible	2015-01-28 13:48:06 -08:00
broadcast.py	[SPARK-4548] []SPARK-4517] improve performance of python broadcast	2014-11-24 17:17:03 -08:00
cloudpickle.py	[SPARK-3679] [PySpark] pickle the exact globals of functions	2014-09-24 13:00:05 -07:00
conf.py	[SPARK-3412] [PySpark] Replace Epydoc with Sphinx to generate Python API docs	2014-10-07 18:09:27 -07:00
context.py	[SPARK-6194] [SPARK-677] [PySpark] fix memory leak in collect()	2015-03-09 16:24:06 -07:00
daemon.py	[SPARK-6294] fix hang when call take() in JVM on PythonRDD	2015-03-12 01:34:38 -07:00
files.py	[SPARK-3309] [PySpark] Put all public API in __all__	2014-09-03 11:49:45 -07:00
heapq3.py	[SPARK-3073] [PySpark] use external sort in sortBy() and sortByKey()	2014-08-26 16:57:40 -07:00
java_gateway.py	[SPARK-6506] [pyspark] Do not try to retrieve SPARK_HOME when not needed...	2015-04-08 10:14:52 -07:00
join.py	[SPARK-3074] [PySpark] support groupByKey() with single huge key	2015-04-09 17:07:23 -07:00
profiler.py	[SPARK-4387][PySpark] Refactoring python profiling code to make it extensible	2015-01-28 13:48:06 -08:00
rdd.py	[SPARK-6216] [PySpark] check the python version in worker	2015-04-10 14:04:53 -07:00
rddsampler.py	[SPARK-4477] [PySpark] remove numpy from RDDSampler	2014-11-20 16:40:25 -08:00
resultiterable.py	[SPARK-3074] [PySpark] support groupByKey() with single huge key	2015-04-09 17:07:23 -07:00
serializers.py	[SPARK-3074] [PySpark] support groupByKey() with single huge key	2015-04-09 17:07:23 -07:00
shell.py	[SPARK-6781] [SQL] use sqlContext in python shell	2015-04-08 13:31:45 -07:00
shuffle.py	[SPARK-3074] [PySpark] support groupByKey() with single huge key	2015-04-09 17:07:23 -07:00
statcounter.py	StatCounter on NumPy arrays [PYSPARK][SPARK-2012]	2014-08-01 22:33:25 -07:00
status.py	[SPARK-4172] [PySpark] Progress API in Python	2015-02-17 13:36:43 -08:00
storagelevel.py	[SPARK-3417] Use new-style classes in PySpark	2014-09-08 15:45:36 -07:00
tests.py	[SPARK-6216] [PySpark] check the python version in worker	2015-04-10 14:04:53 -07:00
traceback_utils.py	[SPARK-1087] Move python traceback utilities into new traceback_utils.py file.	2014-09-15 19:28:17 -07:00
worker.py	[SPARK-6216] [PySpark] check the python version in worker	2015-04-10 14:04:53 -07:00