spark-instrumented-optimizer/python/pyspark/ml
Xiangrui Meng 11a5f32116 [SPARK-7535] [.0] [MLLIB] Audit the pipeline APIs for 1.4
Some changes to the pipeilne APIs:

1. Estimator/Transformer/ doesn’t need to extend Params since PipelineStage already does.
1. Move Evaluator to ml.evaluation.
1. Mention larger metric values are better.
1. PipelineModel doc. “compiled” -> “fitted”
1. Hide object PolynomialExpansion.
1. Hide object VectorAssembler.
1. Word2Vec.minCount (and other) -> group param
1. ParamValidators -> DeveloperApi
1. Hide MetadataUtils/SchemaUtils.

jkbradley

Author: Xiangrui Meng <meng@databricks.com>

Closes #6322 from mengxr/SPARK-7535.0 and squashes the following commits:

9e9c7da [Xiangrui Meng] move JavaEvaluator to ml.evaluation as well
e179480 [Xiangrui Meng] move Evaluation to ml.evaluation in PySpark
08ef61f [Xiangrui Meng] update pipieline APIs

(cherry picked from commit 8f11c6116b)
Signed-off-by: Xiangrui Meng <meng@databricks.com>
2015-05-21 22:57:43 -07:00
..
param [SPARK-7762] [MLLIB] set default value for outputCol 2015-05-20 17:26:44 -07:00
__init__.py [SPARK-7535] [.0] [MLLIB] Audit the pipeline APIs for 1.4 2015-05-21 22:57:43 -07:00
classification.py [SPARK-7511] [MLLIB] pyspark ml seed param should be random by default or 42 is quite funny but not very random 2015-05-20 15:16:27 -07:00
evaluation.py [SPARK-7535] [.0] [MLLIB] Audit the pipeline APIs for 1.4 2015-05-21 22:57:43 -07:00
feature.py [SPARK-7794] [MLLIB] update RegexTokenizer default settings 2015-05-21 17:59:13 -07:00
pipeline.py [SPARK-7535] [.0] [MLLIB] Audit the pipeline APIs for 1.4 2015-05-21 22:57:43 -07:00
recommendation.py [SPARK-7511] [MLLIB] pyspark ml seed param should be random by default or 42 is quite funny but not very random 2015-05-20 15:16:27 -07:00
regression.py [SPARK-7511] [MLLIB] pyspark ml seed param should be random by default or 42 is quite funny but not very random 2015-05-20 15:16:27 -07:00
tests.py [SPARK-7511] [MLLIB] pyspark ml seed param should be random by default or 42 is quite funny but not very random 2015-05-20 15:16:27 -07:00
tuning.py [SPARK-7380] [MLLIB] pipeline stages should be copyable in Python 2015-05-18 12:02:26 -07:00
util.py [SPARK-7380] [MLLIB] pipeline stages should be copyable in Python 2015-05-18 12:02:26 -07:00
wrapper.py [SPARK-7535] [.0] [MLLIB] Audit the pipeline APIs for 1.4 2015-05-21 22:57:43 -07:00