spark-instrumented-optimizer

History

Davies Liu a95ad99e31 [SPARK-3592] [SQL] [PySpark] support applySchema to RDD of Row Fix the issue when applySchema() to an RDD of Row. Also add type mapping for BinaryType. Author: Davies Liu <davies.liu@gmail.com> Closes #2448 from davies/row and squashes the following commits: dd220cf [Davies Liu] fix test 3f3f188 [Davies Liu] add more test f559746 [Davies Liu] add tests, fix serialization 9688fd2 [Davies Liu] support applySchema to RDD of Row		2014-09-19 15:33:42 -07:00
..
mllib	[SPARK-3491] [MLlib] [PySpark] use pickle to serialize data in MLlib	2014-09-19 15:01:11 -07:00
__init__.py	[SPARK-3309] [PySpark] Put all public API in __all__	2014-09-03 11:49:45 -07:00
accumulators.py	[SPARK-3309] [PySpark] Put all public API in __all__	2014-09-03 11:49:45 -07:00
broadcast.py	[SPARK-3430] [PySpark] [Doc] generate PySpark API docs using Sphinx	2014-09-16 12:51:58 -07:00
cloudpickle.py	[SPARK-3094] [PySpark] compatitable with PyPy	2014-09-12 18:42:50 -07:00
conf.py	[SPARK-3309] [PySpark] Put all public API in __all__	2014-09-03 11:49:45 -07:00
context.py	[SPARK-3491] [MLlib] [PySpark] use pickle to serialize data in MLlib	2014-09-19 15:01:11 -07:00
daemon.py	[SPARK-3030] [PySpark] Reuse Python worker	2014-09-13 16:22:04 -07:00
files.py	[SPARK-3309] [PySpark] Put all public API in __all__	2014-09-03 11:49:45 -07:00
heapq3.py	[SPARK-3073] [PySpark] use external sort in sortBy() and sortByKey()	2014-08-26 16:57:40 -07:00
java_gateway.py	[SPARK-3167] Handle special driver configs in Windows	2014-08-26 22:52:16 -07:00
join.py	[SPARK-2470] PEP8 fixes to PySpark	2014-07-21 22:30:53 -07:00
rdd.py	[SPARK-3491] [MLlib] [PySpark] use pickle to serialize data in MLlib	2014-09-19 15:01:11 -07:00
rddsampler.py	[SPARK-2627] [PySpark] have the build enforce PEP 8 automatically	2014-08-06 12:58:24 -07:00
resultiterable.py	[SPARK-2627] [PySpark] have the build enforce PEP 8 automatically	2014-08-06 12:58:24 -07:00
serializers.py	[SPARK-3491] [MLlib] [PySpark] use pickle to serialize data in MLlib	2014-09-19 15:01:11 -07:00
shell.py	[SPARK-3273][SPARK-3301]We should read the version information from the same place	2014-09-06 15:08:43 -07:00
shuffle.py	[SPARK-3463] [PySpark] aggregate and show spilled bytes in Python	2014-09-13 22:31:21 -07:00
sql.py	[SPARK-3592] [SQL] [PySpark] support applySchema to RDD of Row	2014-09-19 15:33:42 -07:00
statcounter.py	StatCounter on NumPy arrays [PYSPARK][SPARK-2012]	2014-08-01 22:33:25 -07:00
storagelevel.py	[SPARK-3417] Use new-style classes in PySpark	2014-09-08 15:45:36 -07:00
tests.py	[SPARK-3592] [SQL] [PySpark] support applySchema to RDD of Row	2014-09-19 15:33:42 -07:00
traceback_utils.py	[SPARK-1087] Move python traceback utilities into new traceback_utils.py file.	2014-09-15 19:28:17 -07:00
worker.py	[SPARK-3554] [PySpark] use broadcast automatically for large closure	2014-09-18 18:11:48 -07:00