spark-instrumented-optimizer/python/pyspark/mllib
Yu ISHIKAWA 4692769655 [SPARK-6259] [MLLIB] Python API for LDA
I implemented the Python API for LDA. But I didn't implemented a method for `LDAModel.describeTopics()`, beause it's a little hard to implement it now. And adding document about that and an example code would fit for another issue.

TODO: LDAModel.describeTopics() in Python must be also implemented. But it would be nice to fit for another issue. Implementing it is a little hard, since the return value of `describeTopics` in Scala consists of Tuple classes.

Author: Yu ISHIKAWA <yuu.ishikawa@gmail.com>

Closes #6791 from yu-iskw/SPARK-6259 and squashes the following commits:

6855f59 [Yu ISHIKAWA] LDA inherits object
28bd165 [Yu ISHIKAWA] Change the place of testing code
d7a332a [Yu ISHIKAWA] Remove the doc comment about the optimizer's default value
083e226 [Yu ISHIKAWA] Add the comment about the supported values and the default value of `optimizer`
9f8bed8 [Yu ISHIKAWA] Simplify casting
faa9764 [Yu ISHIKAWA] Add some comments for the LDA paramters
98f645a [Yu ISHIKAWA] Remove the interface for `describeTopics`. Because it is not implemented.
57ac03d [Yu ISHIKAWA] Remove the unnecessary import in Python unit testing
73412c3 [Yu ISHIKAWA] Fix the typo
2278829 [Yu ISHIKAWA] Fix the indentation
39514ec [Yu ISHIKAWA] Modify how to cast the input data
8117e18 [Yu ISHIKAWA] Fix the validation problems by `lint-scala`
77fd1b7 [Yu ISHIKAWA] Not use LabeledPoint
68f0653 [Yu ISHIKAWA] Support some parameters for `ALS.train()` in Python
25ef2ac [Yu ISHIKAWA] Resolve conflicts with rebasing
2015-07-14 23:27:42 -07:00
..
stat [SPARK-7639] [PYSPARK] [MLLIB] Python API for KernelDensity 2015-06-06 14:52:14 -07:00
__init__.py [SPARK-8032] [PYSPARK] Make version checking for NumPy in MLlib more robust 2015-06-02 23:24:47 -07:00
classification.py [SPARK-4127] [MLLIB] [PYSPARK] Python bindings for StreamingLinearRegressionWithSGD 2015-06-30 10:25:59 -07:00
clustering.py [SPARK-6259] [MLLIB] Python API for LDA 2015-07-14 23:27:42 -07:00
common.py [SPARK-7922] [MLLIB] use DataFrames for user/item factors in ALSModel 2015-05-28 22:38:38 -07:00
evaluation.py [SPARK-8068] [MLLIB] Add confusionMatrix method at class MulticlassMetrics in pyspark/mllib 2015-07-08 16:21:28 -07:00
feature.py [SPARK-7104] [MLLIB] Support model save/load in Python's Word2Vec 2015-07-02 15:55:16 -07:00
fpm.py Revert "[SPARK-7212] [MLLIB] Add sequence learning flag" 2015-07-06 16:11:22 -07:00
linalg.py [SPARK-6266] [MLLIB] PySpark SparseVector missing doc for size, indices, values 2015-07-09 08:16:26 -07:00
random.py [SPARK-7899] [PYSPARK] Fix Python 3 pyspark/sql/types module conflict 2015-05-29 14:13:44 -07:00
recommendation.py [SPARK-8511] [PYSPARK] Modify a test to remove a saved model in regression.py 2015-06-22 11:53:11 -07:00
regression.py [SPARK-4127] [MLLIB] [PYSPARK] Python bindings for StreamingLinearRegressionWithSGD 2015-06-30 10:25:59 -07:00
tests.py [SPARK-7785] [MLLIB] [PYSPARK] Add __str__ and __repr__ to Matrices 2015-07-08 13:19:27 -07:00
tree.py [SPARK-7781] [MLLIB] gradient boosted trees.train regressor missing max bins 2015-06-22 22:40:19 -07:00
util.py [SPARK-6263] [MLLIB] Python MLlib API missing items: Utils 2015-07-01 11:14:07 -07:00