spark-instrumented-optimizer/python/pyspark/ml
Yong Gang Cao ef1047fca7 [SPARK-12153][SPARK-7617][MLLIB] add support of arbitrary length sentence and other tuning for Word2Vec
add support of arbitrary length sentence by using the nature representation of sentences in the input.

add new similarity functions and add normalization option for distances in synonym finding
add new accessor for internal structure(the vocabulary and wordindex) for convenience

need instructions about how to set value for the Since annotation for newly added public functions. 1.5.3?

jira link: https://issues.apache.org/jira/browse/SPARK-12153

Author: Yong Gang Cao <ygcao@amazon.com>
Author: Yong-Gang Cao <ygcao@users.noreply.github.com>

Closes #10152 from ygcao/improvementForSentenceBoundary.
2016-02-22 09:47:36 +00:00
..
param [SPARK-13047][PYSPARK][ML] Pyspark Params.hasParam should not throw an error 2016-02-11 16:42:44 -08:00
__init__.py [SPARK-7535] [.0] [MLLIB] Audit the pipeline APIs for 1.4 2015-05-21 22:57:33 -07:00
classification.py [SPARK-10509][PYSPARK] Reduce excessive param boiler plate code 2016-01-26 15:53:48 -08:00
clustering.py [SPARK-13302][PYSPARK][TESTS] Move the temp file creation and cleanup outside of the doctests 2016-02-20 09:07:19 +00:00
evaluation.py [SPARK-10509][PYSPARK] Reduce excessive param boiler plate code 2016-01-26 15:53:48 -08:00
feature.py [SPARK-12153][SPARK-7617][MLLIB] add support of arbitrary length sentence and other tuning for Word2Vec 2016-02-22 09:47:36 +00:00
pipeline.py [SPARK-10509][PYSPARK] Reduce excessive param boiler plate code 2016-01-26 15:53:48 -08:00
recommendation.py [SPARK-13302][PYSPARK][TESTS] Move the temp file creation and cleanup outside of the doctests 2016-02-20 09:07:19 +00:00
regression.py [SPARK-13302][PYSPARK][TESTS] Move the temp file creation and cleanup outside of the doctests 2016-02-20 09:07:19 +00:00
tests.py [SPARK-13047][PYSPARK][ML] Pyspark Params.hasParam should not throw an error 2016-02-11 16:42:44 -08:00
tuning.py [SPARK-10509][PYSPARK] Reduce excessive param boiler plate code 2016-01-26 15:53:48 -08:00
util.py [SPARK-13032][ML][PYSPARK] PySpark support model export/import and take LinearRegression as example 2016-01-29 09:22:24 -08:00
wrapper.py [SPARK-13153][PYSPARK] ML persistence failed when handle no default value parameter 2016-02-11 18:38:49 -08:00