spark-instrumented-optimizer

History

zhengruifeng 44563a0412 [SPARK-33518][ML] Improve performance of ML ALS recommendForAll by GEMV ### What changes were proposed in this pull request? There were a lot of works on improving ALS's recommendForAll For now, I found that it maybe futhermore optimized by 1, using GEMV and sharing a pre-allocated buffer per task; 2, using guava.ordering instead of BoundedPriorityQueue; ### Why are the changes needed? In my test, using `f2jBLAS.sgemv`, it is about 2.3X faster than existing impl. \|Impl\| Master \| GEMM \| GEMV \| GEMV + array aggregator \| GEMV + guava ordering + array aggregator \| GEMV + guava ordering\| \|------\|----------\|------------\|----------\|------------\|------------\|------------\| \|Duration\|341229\|363741\|191201\|189790\|148417\|147222\| ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? existing testsuites Closes #30468 from zhengruifeng/als_rec_opt. Authored-by: zhengruifeng <ruifengz@foxmail.com> Signed-off-by: Sean Owen <srowen@gmail.com>		2020-12-19 08:43:48 -06:00
..
benchmarks	[SPARK-29297][TESTS] Compare `core`/`mllib` module benchmarks in JDK8/11	2019-09-29 21:43:58 -07:00
src	[SPARK-33518][ML] Improve performance of ML ALS recommendForAll by GEMV	2020-12-19 08:43:48 -06:00
pom.xml	[SPARK-33662][BUILD] Setting version to 3.2.0-SNAPSHOT	2020-12-04 14:10:42 -08:00