spark-instrumented-optimizer/python/pyspark/mllib
Sean Owen e07baf1412 [SPARK-17001][ML] Enable standardScaler to standardize sparse vectors when withMean=True
## What changes were proposed in this pull request?

Allow centering / mean scaling of sparse vectors in StandardScaler, if requested. This is for compatibility with `VectorAssembler` in common usages.

## How was this patch tested?

Jenkins tests, including new caes to reflect the new behavior.

Author: Sean Owen <sowen@cloudera.com>

Closes #14663 from srowen/SPARK-17001.
2016-08-27 08:48:56 +01:00
..
linalg [SPARK-14812][ML][MLLIB][PYTHON] Experimental, DeveloperApi annotation audit for ML 2016-07-13 12:33:39 -07:00
stat [SPARK-14812][ML][MLLIB][PYTHON] Experimental, DeveloperApi annotation audit for ML 2016-07-13 12:33:39 -07:00
__init__.py [SPARK-14817][ML][MLLIB][DOC] Made DataFrame-based API primary in MLlib guide 2016-07-15 13:38:23 -07:00
classification.py [SPARK-14812][ML][MLLIB][PYTHON] Experimental, DeveloperApi annotation audit for ML 2016-07-13 12:33:39 -07:00
clustering.py [SPARK-16961][CORE] Fixed off-by-one error that biased randomizeInPlace 2016-08-19 10:11:59 +01:00
common.py [SPARK-16348][ML][MLLIB][PYTHON] Use full classpaths for pyspark ML JVM calls 2016-07-05 17:00:24 -07:00
evaluation.py [SPARK-15823][PYSPARK][ML] Add @property for 'accuracy' in MulticlassMetrics 2016-06-10 10:09:19 +01:00
feature.py [SPARK-17001][ML] Enable standardScaler to standardize sparse vectors when withMean=True 2016-08-27 08:48:56 +01:00
fpm.py [SPARK-14812][ML][MLLIB][PYTHON] Experimental, DeveloperApi annotation audit for ML 2016-07-13 12:33:39 -07:00
random.py [SPARK-15464][ML][MLLIB][SQL][TESTS] Replace SQLContext and SparkContext with SparkSession using builder pattern in python test code 2016-05-23 18:14:48 -07:00
recommendation.py [SPARK-16348][ML][MLLIB][PYTHON] Use full classpaths for pyspark ML JVM calls 2016-07-05 17:00:24 -07:00
regression.py [MINOR] Fix Typos 'a -> an' 2016-05-26 22:39:14 -07:00
tests.py [SPARK-16961][CORE] Fixed off-by-one error that biased randomizeInPlace 2016-08-19 10:11:59 +01:00
tree.py [SPARK-14812][ML][MLLIB][PYTHON] Experimental, DeveloperApi annotation audit for ML 2016-07-13 12:33:39 -07:00
util.py [SPARK-16242][MLLIB][PYSPARK] Conversion between old/new matrix columns in a DataFrame (Python) 2016-06-28 06:28:22 -07:00