spark-instrumented-optimizer

History

zhengruifeng ed2fe8d806 [SPARK-33111][ML] aft transform optimization ### What changes were proposed in this pull request? 1, when `predictionCol` and `quantilesCol` are both set, we only need one prediction for each row: prediction is just the variable `lambda` in `predictQuantiles`; 2, in the computation of variable `quantiles` in `predictQuantiles`, a pre-computed vector `val baseQuantiles = $(quantileProbabilities).map(q => math.exp(math.log(-math.log1p(-q)) * scale))` can be reused for each row; ### Why are the changes needed? avoid redundant computation in transform, like what we did in `ProbabilisticClassificationModel`, `GaussianMixtureModel`, etc ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? existing testsuite Closes #30000 from zhengruifeng/aft_predict_transform_opt. Authored-by: zhengruifeng <ruifengz@foxmail.com> Signed-off-by: Sean Owen <srowen@gmail.com>		2020-10-12 09:01:03 -05:00
..
benchmarks	[SPARK-29297][TESTS] Compare `core`/`mllib` module benchmarks in JDK8/11	2019-09-29 21:43:58 -07:00
src	[SPARK-33111][ML] aft transform optimization	2020-10-12 09:01:03 -05:00
pom.xml	[SPARK-30950][BUILD] Setting version to 3.1.0-SNAPSHOT	2020-02-25 19:44:31 -08:00