ebf01ec3c1
### What changes were proposed in this pull request? https://github.com/apache/spark/pull/32015 added a way to run benchmarks much more easily in the same GitHub Actions build. This PR updates the benchmark results by using the way. **NOTE** that looks like GitHub Actions use four types of CPU given my observations: - Intel(R) Xeon(R) Platinum 8171M CPU 2.60GHz - Intel(R) Xeon(R) CPU E5-2673 v4 2.30GHz - Intel(R) Xeon(R) CPU E5-2673 v3 2.40GHz - Intel(R) Xeon(R) Platinum 8272CL CPU 2.60GHz Given my quick research, seems like they perform roughly similarly: ![Screen Shot 2021-04-03 at 9 31 23 PM](https://user-images.githubusercontent.com/6477701/113478478-f4b57b80-94c3-11eb-9047-f81ca8c59672.png) I couldn't find enough information about Intel(R) Xeon(R) Platinum 8272CL CPU 2.60GHz but the performance seems roughly similar given the numbers. So shouldn't be a big deal especially given that this way is much easier, encourages contributors to run more and guarantee the same number of cores and same memory with the same softwares. ### Why are the changes needed? To have a base line of the benchmarks accordingly. ### Does this PR introduce _any_ user-facing change? No, dev-only. ### How was this patch tested? It was generated from: - [Run benchmarks: * (JDK 11)](https://github.com/HyukjinKwon/spark/actions/runs/713575465) - [Run benchmarks: * (JDK 8)](https://github.com/HyukjinKwon/spark/actions/runs/713154337) Closes #32044 from HyukjinKwon/SPARK-34950. Authored-by: HyukjinKwon <gurwls223@apache.org> Signed-off-by: Max Gekk <max.gekk@gmail.com>
61 lines
4.6 KiB
Plaintext
61 lines
4.6 KiB
Plaintext
================================================================================================
|
|
Parquet writer benchmark
|
|
================================================================================================
|
|
|
|
OpenJDK 64-Bit Server VM 11.0.10+9-LTS on Linux 5.4.0-1043-azure
|
|
Intel(R) Xeon(R) CPU E5-2673 v3 @ 2.40GHz
|
|
Parquet writer benchmark: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
|
|
------------------------------------------------------------------------------------------------------------------------
|
|
Output Single Int Column 2636 2676 56 6.0 167.6 1.0X
|
|
Output Single Double Column 2743 2859 164 5.7 174.4 1.0X
|
|
Output Int and String Column 8811 8822 15 1.8 560.2 0.3X
|
|
Output Partitions 5489 5519 43 2.9 349.0 0.5X
|
|
Output Buckets 7426 7436 14 2.1 472.1 0.4X
|
|
|
|
|
|
================================================================================================
|
|
ORC writer benchmark
|
|
================================================================================================
|
|
|
|
OpenJDK 64-Bit Server VM 11.0.10+9-LTS on Linux 5.4.0-1043-azure
|
|
Intel(R) Xeon(R) CPU E5-2673 v3 @ 2.40GHz
|
|
ORC writer benchmark: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
|
|
------------------------------------------------------------------------------------------------------------------------
|
|
Output Single Int Column 1931 1944 19 8.1 122.8 1.0X
|
|
Output Single Double Column 2241 2304 90 7.0 142.5 0.9X
|
|
Output Int and String Column 7255 7318 89 2.2 461.3 0.3X
|
|
Output Partitions 4733 4832 140 3.3 300.9 0.4X
|
|
Output Buckets 6496 6547 73 2.4 413.0 0.3X
|
|
|
|
|
|
================================================================================================
|
|
JSON writer benchmark
|
|
================================================================================================
|
|
|
|
OpenJDK 64-Bit Server VM 11.0.10+9-LTS on Linux 5.4.0-1043-azure
|
|
Intel(R) Xeon(R) CPU E5-2673 v3 @ 2.40GHz
|
|
JSON writer benchmark: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
|
|
------------------------------------------------------------------------------------------------------------------------
|
|
Output Single Int Column 2521 2552 44 6.2 160.3 1.0X
|
|
Output Single Double Column 4164 4181 24 3.8 264.7 0.6X
|
|
Output Int and String Column 6972 7002 43 2.3 443.3 0.4X
|
|
Output Partitions 5246 5345 140 3.0 333.5 0.5X
|
|
Output Buckets 6860 6935 107 2.3 436.1 0.4X
|
|
|
|
|
|
================================================================================================
|
|
CSV writer benchmark
|
|
================================================================================================
|
|
|
|
OpenJDK 64-Bit Server VM 11.0.10+9-LTS on Linux 5.4.0-1043-azure
|
|
Intel(R) Xeon(R) CPU E5-2673 v3 @ 2.40GHz
|
|
CSV writer benchmark: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
|
|
------------------------------------------------------------------------------------------------------------------------
|
|
Output Single Int Column 3650 3672 31 4.3 232.1 1.0X
|
|
Output Single Double Column 4719 4726 10 3.3 300.0 0.8X
|
|
Output Int and String Column 7842 7863 30 2.0 498.6 0.5X
|
|
Output Partitions 6895 6911 23 2.3 438.4 0.5X
|
|
Output Buckets 8880 8928 67 1.8 564.6 0.4X
|
|
|
|
|