spark-instrumented-optimizer/python/pyspark/ml
Nick Pentreath 98057583dd [SPARK-20679][ML] Support recommending for a subset of users/items in ALSModel
This PR adds methods `recommendForUserSubset` and `recommendForItemSubset` to `ALSModel`. These allow recommending for a specified set of user / item ids rather than for every user / item (as in the `recommendForAllX` methods).

The subset methods take a `DataFrame` as input, containing ids in the column specified by the param `userCol` or `itemCol`. The model will generate recommendations for each _unique_ id in this input dataframe.

## How was this patch tested?
New unit tests in `ALSSuite` and Python doctests in `ALS`. Ran updated examples locally.

Author: Nick Pentreath <nickp@za.ibm.com>

Closes #18748 from MLnick/als-recommend-df.
2017-10-09 10:42:33 +02:00
..
linalg [SPARK-20214][ML] Make sure converted csc matrix has sorted indices 2017-04-05 17:46:44 -07:00
param [SPARK-21027][ML][PYTHON] Added tunable parallelism to one vs. rest in both Scala mllib and Pyspark 2017-09-12 10:02:27 -07:00
__init__.py [SPARK-21633][ML][PYTHON] UnaryTransformer in Python 2017-08-04 01:01:32 -07:00
base.py [SPARK-21633][ML][PYTHON] UnaryTransformer in Python 2017-08-04 01:01:32 -07:00
classification.py [SPARK-18608][ML][FOLLOWUP] Fix double caching for PySpark OneVsRest. 2017-09-14 14:09:44 +08:00
clustering.py [SPARK-10931][ML][PYSPARK] PySpark Models Copy Param Values from Estimator 2017-08-22 17:40:50 -07:00
common.py [SPARK-17679] [PYSPARK] remove unnecessary Py4J ListConverter patch 2016-10-03 14:12:03 -07:00
evaluation.py [SPARK-21981][PYTHON][ML] Added Python interface for ClusteringEvaluator 2017-09-22 13:12:33 +08:00
feature.py [SPARK-19866][ML][PYSPARK] Add local version of Word2Vec findSynonyms for spark.ml: Python API 2017-09-08 12:09:00 -07:00
fpm.py [SPARK-20768][PYSPARK][ML] Expose numPartitions (expert) param of PySpark FPGrowth. 2017-05-25 21:40:39 +08:00
pipeline.py [SPARK-17025][ML][PYTHON] Persistence for Pipelines with Python-only Stages 2017-08-11 23:57:08 -07:00
recommendation.py [SPARK-20679][ML] Support recommending for a subset of users/items in ALSModel 2017-10-09 10:42:33 +02:00
regression.py [SPARK-21523][ML] update breeze to 0.13.2 for an emergency bugfix in strong wolfe line search 2017-08-09 14:44:10 +08:00
stat.py [SPARK-20076][ML][PYSPARK] Add Python interface for ml.stats.Correlation 2017-04-07 11:00:10 +02:00
tests.py [SPARK-21854] Added LogisticRegressionTrainingSummary for MultinomialLogisticRegression in Python API 2017-09-14 13:53:28 +08:00
tuning.py [SPARK-21221][ML] CrossValidator and TrainValidationSplit Persist Nested Estimators such as OneVsRest 2017-07-17 10:07:32 -07:00
util.py [SPARK-21542][ML][PYTHON] Python persistence helper functions 2017-08-07 17:03:20 -07:00
wrapper.py [SPARK-10931][ML][PYSPARK] PySpark Models Copy Param Values from Estimator 2017-08-22 17:40:50 -07:00