spark-instrumented-optimizer

History

zhengruifeng e7bc6f38b9 [SPARK-31494][ML] flatten the result dataframe of ANOVATest ### What changes were proposed in this pull request? add a new method `def test(dataset: DataFrame, featuresCol: String, labelCol: String, flatten: Boolean): DataFrame` ### Why are the changes needed? Similar to new `test` method in `ChiSquareTest`, it will: 1, support df operation on the returned df; 2, make driver no longer a bottleneck with large numFeatures ### Does this PR introduce any user-facing change? Yes, new method added ### How was this patch tested? existing testsuites Closes #28270 from zhengruifeng/flatten_anova. Authored-by: zhengruifeng <ruifengz@foxmail.com> Signed-off-by: zhengruifeng <ruifengz@foxmail.com>		2020-04-21 12:43:14 +08:00
..
benchmarks	[SPARK-29297][TESTS] Compare `core`/`mllib` module benchmarks in JDK8/11	2019-09-29 21:43:58 -07:00
src	[SPARK-31494][ML] flatten the result dataframe of ANOVATest	2020-04-21 12:43:14 +08:00
pom.xml	[SPARK-30950][BUILD] Setting version to 3.1.0-SNAPSHOT	2020-02-25 19:44:31 -08:00