spark-instrumented-optimizer/python/pyspark/ml
Marco Gaido e836c27ce0 [SPARK-23217][ML][PYTHON] Add distanceMeasure param to ClusteringEvaluator Python API
## What changes were proposed in this pull request?

The PR adds the `distanceMeasure` param to ClusteringEvaluator in the Python API. This allows the user to specify `cosine` as distance measure in addition to the default `squaredEuclidean`.

## How was this patch tested?

added UT

Author: Marco Gaido <marcogaido91@gmail.com>

Closes #20627 from mgaido91/SPARK-23217_python.
2018-02-21 12:39:36 -06:00
..
linalg [SPARK-20214][ML] Make sure converted csc matrix has sorted indices 2017-04-05 17:46:44 -07:00
param Revert "[SPARK-22797][PYSPARK] Bucketizer support multi-column" 2018-01-26 23:48:02 +02:00
__init__.py [SPARK-21633][ML][PYTHON] UnaryTransformer in Python 2017-08-04 01:01:32 -07:00
base.py [SPARK-22922][ML][PYSPARK] Pyspark portion of the fit-multiple API 2017-12-29 16:31:25 -08:00
classification.py [SPARK-18608][ML][FOLLOWUP] Fix double caching for PySpark OneVsRest. 2017-09-14 14:09:44 +08:00
clustering.py [SPARK-23344][PYTHON][ML] Add distanceMeasure param to KMeans 2018-02-10 10:46:45 -06:00
common.py [SPARK-17679] [PYSPARK] remove unnecessary Py4J ListConverter patch 2016-10-03 14:12:03 -07:00
evaluation.py [SPARK-23217][ML][PYTHON] Add distanceMeasure param to ClusteringEvaluator Python API 2018-02-21 12:39:36 -06:00
feature.py [SPARK-23381][CORE] Murmur3 hash generates a different value from other implementations 2018-02-16 17:17:55 -08:00
fpm.py [SPARK-23163][DOC][PYTHON] Sync ML Python API with Scala 2018-01-25 01:48:11 -08:00
image.py [SPARK-23256][ML][PYTHON] Add columnSchema method to PySpark image reader 2018-02-04 17:53:31 +09:00
pipeline.py [SPARK-17025][ML][PYTHON] Persistence for Pipelines with Python-only Stages 2017-08-11 23:57:08 -07:00
recommendation.py [SPARK-20679][ML] Support recommending for a subset of users/items in ALSModel 2017-10-09 10:42:33 +02:00
regression.py [SPARK-22810][ML][PYSPARK] Expose Python API for LinearRegression with huber loss. 2017-12-20 17:51:42 -08:00
stat.py [SPARK-20076][ML][PYSPARK] Add Python interface for ml.stats.Correlation 2017-04-07 11:00:10 +02:00
tests.py [SPARK-23217][ML][PYTHON] Add distanceMeasure param to ClusteringEvaluator Python API 2018-02-21 12:39:36 -06:00
tuning.py [SPARK-22922][ML][PYSPARK] Pyspark portion of the fit-multiple API 2017-12-29 16:31:25 -08:00
util.py [SPARK-22313][PYTHON] Mark/print deprecation warnings as DeprecationWarning for deprecated APIs 2017-10-24 12:44:47 +09:00
wrapper.py [SPARK-10931][ML][PYSPARK] PySpark Models Copy Param Values from Estimator 2017-08-22 17:40:50 -07:00