spark-instrumented-optimizer/python/pyspark/ml
Brooke Wenig 340c8b8387 [MINOR][DOC] Updated PySpark Binarizer docstring to match Scala's.
## What changes were proposed in this pull request?

PySpark's Binarizer docstring had two issues:
1) The values did not need to be in the range [0, 1].
2) It can be used for binary classification prediction.

This change corrects both of these issues by making it consistent with Scala's docstring for Binarizer.

## How was this patch tested?

Not applicable because I only changed the docstring. But if I need to do any testing, let me know and I'll do it.

Please review http://spark.apache.org/contributing.html before opening a pull request.

Closes #23934 from brookewenig/binarizer-docs-fix.

Authored-by: Brooke Wenig <brookewenig@gmail.com>
Signed-off-by: Sean Owen <sean.owen@databricks.com>
2019-03-06 19:42:41 -06:00
..
linalg [SPARK-26638][PYSPARK][ML] Pyspark vector classes always return error for unary negation 2019-01-17 14:24:21 -06:00
param [SPARK-24333][ML][PYTHON] Add fit with validation set to spark.ml GBT: Python API 2018-12-07 13:53:35 -08:00
tests [SPARK-27007][PYTHON] add rawPrediction to OneVsRest in PySpark 2019-03-02 09:09:28 -06:00
__init__.py [SPARK-24477][SPARK-24454][ML][PYTHON] Imports submodule in ml/__init__.py and add ImageSchema into __all__ 2018-06-08 09:32:11 -07:00
base.py [SPARK-22922][ML][PYSPARK] Pyspark portion of the fit-multiple API 2017-12-29 16:31:25 -08:00
classification.py [SPARK-27007][PYTHON] add rawPrediction to OneVsRest in PySpark 2019-03-02 09:09:28 -06:00
clustering.py [SPARK-26754][PYTHON] Add hasTrainingSummary to replace duplicate code in PySpark 2019-02-01 17:29:58 -06:00
common.py [SPARK-17679] [PYSPARK] remove unnecessary Py4J ListConverter patch 2016-10-03 14:12:03 -07:00
evaluation.py [SPARK-24103][ML][MLLIB] ML Evaluators should use weight column - added weight column for binary classification evaluator 2019-02-25 17:16:51 -06:00
feature.py [MINOR][DOC] Updated PySpark Binarizer docstring to match Scala's. 2019-03-06 19:42:41 -06:00
fpm.py [SPARK-26640][CORE][ML][SQL][STREAMING][PYSPARK] Code cleanup from lgtm.com analysis 2019-01-17 19:40:39 -06:00
image.py [SPARK-26559][ML][PYSPARK] ML image can't work with numpy versions prior to 1.9 2019-01-07 18:36:52 +08:00
pipeline.py [SPARK-17025][ML][PYTHON] Persistence for Pipelines with Python-only Stages 2017-08-11 23:57:08 -07:00
recommendation.py [MINOR] Update all DOI links to preferred resolver 2018-11-25 17:43:55 -06:00
regression.py [SPARK-19591][ML][PYSPARK][FOLLOWUP] Add sample weights to decision trees 2019-02-27 21:11:30 -06:00
stat.py [MINOR] Fix typos and misspellings 2018-11-05 17:34:23 -06:00
tuning.py [SPARK-25090][ML] Enforce implicit type coercion in ParamGridBuilder 2018-08-13 09:11:37 +08:00
util.py [SPARK-26754][PYTHON] Add hasTrainingSummary to replace duplicate code in PySpark 2019-02-01 17:29:58 -06:00
wrapper.py [SPARK-22798][PYTHON][ML] Add multiple column support to PySpark StringIndexer 2019-02-20 08:52:46 -06:00