spark-instrumented-optimizer/python/pyspark/mllib
Joseph K. Bradley 1db7179fae [SPARK-9805] [MLLIB] [PYTHON] [STREAMING] Added _eventually for ml streaming pyspark tests
Recently, PySpark ML streaming tests have been flaky, most likely because of the batches not being processed in time.  Proposal: Replace the use of _ssc_wait (which waits for a fixed amount of time) with a method which waits for a fixed amount of time but can terminate early based on a termination condition method.  With this, we can extend the waiting period (to make tests less flaky) but also stop early when possible (making tests faster on average, which I verified locally).

CC: mengxr tdas freeman-lab

Author: Joseph K. Bradley <joseph@databricks.com>

Closes #8087 from jkbradley/streaming-ml-tests.
2015-08-15 18:48:20 -07:00
..
linalg [SPARK-6486] [MLLIB] [PYTHON] Add BlockMatrix to PySpark. 2015-08-05 07:40:50 -07:00
stat [SPARK-8996] [MLLIB] [PYSPARK] Python API for Kolmogorov-Smirnov Test 2015-07-20 09:00:01 -07:00
__init__.py [SPARK-8032] [PYSPARK] Make version checking for NumPy in MLlib more robust 2015-06-02 23:24:47 -07:00
classification.py [SPARK-4127] [MLLIB] [PYSPARK] Python bindings for StreamingLinearRegressionWithSGD 2015-06-30 10:25:59 -07:00
clustering.py [SPARK-7105] [PYSPARK] [MLLIB] Support model save/load in GMM 2015-07-28 15:00:25 -07:00
common.py [SPARK-6485] [MLLIB] [PYTHON] Add CoordinateMatrix/RowMatrix/IndexedRowMatrix to PySpark. 2015-08-04 16:30:03 -07:00
evaluation.py [SPARK-9005] [MLLIB] Fix RegressionMetrics computation of explainedVariance 2015-07-15 13:32:25 -07:00
feature.py [SPARK-7104] [MLLIB] Support model save/load in Python's Word2Vec 2015-07-02 15:55:16 -07:00
fpm.py Revert "[SPARK-7212] [MLLIB] Add sequence learning flag" 2015-07-06 16:11:22 -07:00
random.py [SPARK-7899] [PYSPARK] Fix Python 3 pyspark/sql/types module conflict 2015-05-29 14:13:44 -07:00
recommendation.py [SPARK-8511] [PYSPARK] Modify a test to remove a saved model in regression.py 2015-06-22 11:53:11 -07:00
regression.py [MINOR] [DOC] fix mllib pydoc warnings 2015-08-13 10:16:40 -07:00
tests.py [SPARK-9805] [MLLIB] [PYTHON] [STREAMING] Added _eventually for ml streaming pyspark tests 2015-08-15 18:48:20 -07:00
tree.py [SPARK-7781] [MLLIB] gradient boosted trees.train regressor missing max bins 2015-06-22 22:40:19 -07:00
util.py [MINOR] [DOC] fix mllib pydoc warnings 2015-08-13 10:16:40 -07:00