spark-instrumented-optimizer/python/pyspark/mllib
nate.crosswhite 7450a992b3 [SPARK-4749] [mllib]: Allow initializing KMeans clusters using a seed
This implements the functionality for SPARK-4749 and provides units tests in Scala and PySpark

Author: nate.crosswhite <nate.crosswhite@stresearch.com>
Author: nxwhite-str <nxwhite-str@users.noreply.github.com>
Author: Xiangrui Meng <meng@databricks.com>

Closes #3610 from nxwhite-str/master and squashes the following commits:

a2ebbd3 [nxwhite-str] Merge pull request #1 from mengxr/SPARK-4749-kmeans-seed
7668124 [Xiangrui Meng] minor updates
f8d5928 [nate.crosswhite] Addressing PR issues
277d367 [nate.crosswhite] Merge remote-tracking branch 'upstream/master'
9156a57 [nate.crosswhite] Merge remote-tracking branch 'upstream/master'
5d087b4 [nate.crosswhite] Adding KMeans train with seed and Scala unit test
616d111 [nate.crosswhite] Merge remote-tracking branch 'upstream/master'
35c1884 [nate.crosswhite] Add kmeans initial seed to pyspark API
2015-01-21 10:32:10 -08:00
..
__init__.py [SPARK-4821] [mllib] [python] [docs] Fix for pyspark.mllib.rand doc 2014-12-17 14:12:46 -08:00
classification.py [SPARK-4822] Use sphinx tags for Python doc annotations 2014-12-17 17:31:24 -08:00
clustering.py [SPARK-4749] [mllib]: Allow initializing KMeans clusters using a seed 2015-01-21 10:32:10 -08:00
common.py [SPARK-5223] [MLlib] [PySpark] fix MapConverter and ListConverter in MLlib 2015-01-13 12:50:31 -08:00
feature.py [SPARK-4822] Use sphinx tags for Python doc annotations 2014-12-17 17:31:24 -08:00
linalg.py [SPARK-2909] [MLlib] [PySpark] SparseVector in pyspark now supports indexing 2015-01-14 11:03:11 -08:00
rand.py [SPARK-4891][PySpark][MLlib] Add gamma/log normal/exp dist sampling to P... 2015-01-08 15:03:43 -08:00
recommendation.py [SPARK-4531] [MLlib] cache serialized java object 2014-11-21 15:02:31 -08:00
regression.py [SPARK-4531] [MLlib] cache serialized java object 2014-11-21 15:02:31 -08:00
stat.py [SPARK-4822] Use sphinx tags for Python doc annotations 2014-12-17 17:31:24 -08:00
tests.py [SPARK-4749] [mllib]: Allow initializing KMeans clusters using a seed 2015-01-21 10:32:10 -08:00
tree.py [SPARK-4580] [SPARK-4610] [mllib] [docs] Documentation for tree ensembles + DecisionTree API fix 2014-12-04 09:57:50 +08:00
util.py [SPARK-4324] [PySpark] [MLlib] support numpy.array for all MLlib API 2014-11-10 22:26:16 -08:00