spark-instrumented-optimizer

History

Xiangrui Meng 3188553f73 [SPARK-1743][MLLIB] add loadLibSVMFile and saveAsLibSVMFile to pyspark Make loading/saving labeled data easier for pyspark users. Also changed type check in `SparseVector` to allow numpy integers. Author: Xiangrui Meng <meng@databricks.com> Closes #672 from mengxr/pyspark-mllib-util and squashes the following commits: 2943fa7 [Xiangrui Meng] format docs d61668d [Xiangrui Meng] add loadLibSVMFile and saveAsLibSVMFile to pyspark		2014-05-07 16:01:11 -07:00
..
mllib	[SPARK-1743][MLLIB] add loadLibSVMFile and saveAsLibSVMFile to pyspark	2014-05-07 16:01:11 -07:00
__init__.py	SPARK-1004. PySpark on YARN	2014-04-29 23:24:34 -07:00
accumulators.py	Add custom serializer support to PySpark.	2013-11-10 16:45:38 -08:00
broadcast.py	Fix some Python docs and make sure to unset SPARK_TESTING in Python	2013-12-29 20:15:07 -05:00
cloudpickle.py	Rename top-level 'pyspark' directory to 'python'	2013-01-01 15:05:00 -08:00
conf.py	SPARK-1114: Allow PySpark to use existing JVM and Gateway	2014-02-20 21:20:39 -08:00
context.py	SPARK-1579: Clean up PythonRDD and avoid swallowing IOExceptions	2014-05-07 09:48:31 -07:00
daemon.py	SPARK-1579: Clean up PythonRDD and avoid swallowing IOExceptions	2014-05-07 09:48:31 -07:00
files.py	Initial work to rename package to org.apache.spark	2013-09-01 14:13:13 -07:00
java_gateway.py	[SPARK-1549] Add Python support to spark-submit	2014-05-06 15:12:35 -07:00
join.py	Spark 1271: Co-Group and Group-By should pass Iterable[X]	2014-04-08 18:15:59 -07:00
rdd.py	[SPARK-1674] fix interrupted system call error in pyspark's RDD.pipe	2014-04-29 18:06:45 -07:00
rddsampler.py	SPARK-1438 RDD.sample() make seed param optional	2014-04-24 17:27:16 -07:00
resultiterable.py	Spark 1271: Co-Group and Group-By should pass Iterable[X]	2014-04-08 18:15:59 -07:00
serializers.py	SPARK-1421. Make MLlib work on Python 2.6	2014-04-05 20:52:05 -07:00
shell.py	Fixed broken pyspark shell.	2014-04-18 10:10:13 -07:00
sql.py	[SPARK-1460] Returning SchemaRDD instead of normal RDD on Set operations...	2014-05-07 09:41:31 -07:00
statcounter.py	Spark 1246 add min max to stat counter	2014-03-18 00:45:47 -07:00
storagelevel.py	SPARK-1305: Support persisting RDD's directly to Tachyon	2014-04-04 20:38:20 -07:00
tests.py	[SPARK-1549] Add Python support to spark-submit	2014-05-06 15:12:35 -07:00
worker.py	SPARK-1115: Catch depickling errors	2014-02-26 14:51:21 -08:00