spark-instrumented-optimizer/python/pyspark/ml
Xiangrui Meng 68f9957149 [SPARK-9918] [MLLIB] remove runs from k-means and rename epsilon to tol
This requires some discussion. I'm not sure whether `runs` is a useful parameter. It certainly complicates the implementation. We might want to optimize the k-means implementation with block matrix operations. In this case, having `runs` may not be worth the trade-off. Also it increases the communication cost in a single job, which might cause other issues.

This PR also renames `epsilon` to `tol` to have consistent naming among algorithms. The Python constructor is updated to include all parameters.

jkbradley yu-iskw

Author: Xiangrui Meng <meng@databricks.com>

Closes #8148 from mengxr/SPARK-9918 and squashes the following commits:

149b9e5 [Xiangrui Meng] fix constructor in Python and rename epsilon to tol
3cc15b3 [Xiangrui Meng] fix test and change initStep to initSteps in python
a0a0274 [Xiangrui Meng] remove runs from k-means in the pipeline API
2015-08-12 23:04:59 -07:00
..
param [SPARK-8706] [PYSPARK] [PROJECT INFRA] Add pylint checks to PySpark 2015-07-15 08:25:53 -07:00
__init__.py [SPARK-7535] [.0] [MLLIB] Audit the pipeline APIs for 1.4 2015-05-21 22:57:33 -07:00
classification.py [SPARK-9789] [ML] Added logreg threshold param back 2015-08-12 14:27:13 -07:00
clustering.py [SPARK-9918] [MLLIB] remove runs from k-means and rename epsilon to tol 2015-08-12 23:04:59 -07:00
evaluation.py [SPARK-9766] [ML] [PySpark] check and add miss docs for PySpark ML 2015-08-12 13:24:18 -07:00
feature.py [SPARK-9766] [ML] [PySpark] check and add miss docs for PySpark ML 2015-08-12 13:24:18 -07:00
pipeline.py [SPARK-8679] [PYSPARK] [MLLIB] Default values in Pipeline API should be immutable 2015-06-30 10:27:29 -07:00
recommendation.py [SPARK-7922] [MLLIB] use DataFrames for user/item factors in ALSModel 2015-05-28 22:38:38 -07:00
regression.py [SPARK-8711] [ML] Add additional methods to PySpark ML tree models 2015-07-07 08:58:08 -07:00
tests.py [SPARK-8456] [ML] Ngram featurizer python 2015-06-29 18:40:30 -07:00
tuning.py [SPARK-7432] [MLLIB] fix flaky CrossValidator doctest 2015-06-02 08:51:00 -07:00
util.py [SPARK-7380] [MLLIB] pipeline stages should be copyable in Python 2015-05-18 12:02:18 -07:00
wrapper.py [SPARK-8679] [PYSPARK] [MLLIB] Default values in Pipeline API should be immutable 2015-06-30 10:27:29 -07:00