..
mllib
[SPARK-4749] [mllib]: Allow initializing KMeans clusters using a seed
2015-01-21 10:32:10 -08:00
streaming
[SPARK-3325][Streaming] Add a parameter to the method print in class DStream
2015-01-02 15:09:41 -08:00
__init__.py
[SPARK-4348] [PySpark] [MLlib] rename random.py to rand.py
2014-11-13 10:24:54 -08:00
accumulators.py
[SPARK-3478] [PySpark] Profile the Python tasks
2014-09-30 18:24:57 -07:00
broadcast.py
[SPARK-4548] []SPARK-4517] improve performance of python broadcast
2014-11-24 17:17:03 -08:00
cloudpickle.py
[SPARK-3679] [PySpark] pickle the exact globals of functions
2014-09-24 13:00:05 -07:00
conf.py
[SPARK-3412] [PySpark] Replace Epydoc with Sphinx to generate Python API docs
2014-10-07 18:09:27 -07:00
context.py
[SPARK-5063] More helpful error messages for several invalid operations
2015-01-23 17:53:15 -08:00
daemon.py
[SPARK-4088] [PySpark] Python worker should exit after socket is closed by JVM
2014-10-25 01:20:39 -07:00
files.py
[SPARK-3309] [PySpark] Put all public API in __all__
2014-09-03 11:49:45 -07:00
heapq3.py
[SPARK-3073] [PySpark] use external sort in sortBy() and sortByKey()
2014-08-26 16:57:40 -07:00
java_gateway.py
[SPARK-5097][SQL] DataFrame
2015-01-27 16:08:24 -08:00
join.py
[SPARK-546] Add full outer join to RDD and DStream.
2014-09-24 20:39:09 -07:00
rdd.py
[SPARK-5440][pyspark] Add toLocalIterator to pyspark rdd
2015-01-28 12:47:12 -08:00
rddsampler.py
[SPARK-4477] [PySpark] remove numpy from RDDSampler
2014-11-20 16:40:25 -08:00
resultiterable.py
[SPARK-2627] [PySpark] have the build enforce PEP 8 automatically
2014-08-06 12:58:24 -07:00
serializers.py
[SPARK-5224] [PySpark] improve performance of parallelize list/ndarray
2015-01-15 11:40:41 -08:00
shell.py
[SPARK-3273][SPARK-3301]We should read the version information from the same place
2014-09-06 15:08:43 -07:00
shuffle.py
[SPARK-4384] [PySpark] improve sort spilling
2014-11-19 15:45:37 -08:00
sql.py
[SPARK-5097][SQL] DataFrame
2015-01-27 16:08:24 -08:00
statcounter.py
StatCounter on NumPy arrays [PYSPARK][SPARK-2012]
2014-08-01 22:33:25 -07:00
storagelevel.py
[SPARK-3417] Use new-style classes in PySpark
2014-09-08 15:45:36 -07:00
tests.py
[SPARK-5361]Multiple Java RDD <-> Python RDD conversions not working correctly
2015-01-28 11:08:44 -08:00
traceback_utils.py
[SPARK-1087] Move python traceback utilities into new traceback_utils.py file.
2014-09-15 19:28:17 -07:00
worker.py
[SPARK-4548] []SPARK-4517] improve performance of python broadcast
2014-11-24 17:17:03 -08:00