spark-instrumented-optimizer/sql/core/benchmarks/BuiltInDataSourceWriteBenchmark-results.txt
HyukjinKwon ebf01ec3c1 [SPARK-34950][TESTS] Update benchmark results to the ones created by GitHub Actions machines
### What changes were proposed in this pull request?

https://github.com/apache/spark/pull/32015 added a way to run benchmarks much more easily in the same GitHub Actions build. This PR updates the benchmark results by using the way.

**NOTE** that looks like GitHub Actions use four types of CPU given my observations:

- Intel(R) Xeon(R) Platinum 8171M CPU  2.60GHz
- Intel(R) Xeon(R) CPU E5-2673 v4  2.30GHz
- Intel(R) Xeon(R) CPU E5-2673 v3  2.40GHz
- Intel(R) Xeon(R) Platinum 8272CL CPU  2.60GHz

Given my quick research, seems like they perform roughly similarly:

![Screen Shot 2021-04-03 at 9 31 23 PM](https://user-images.githubusercontent.com/6477701/113478478-f4b57b80-94c3-11eb-9047-f81ca8c59672.png)

I couldn't find enough information about Intel(R) Xeon(R) Platinum 8272CL CPU  2.60GHz but the performance seems roughly similar given the numbers.

So shouldn't be a big deal especially given that this way is much easier, encourages contributors to run more and guarantee the same number of cores and same memory with the same softwares.

### Why are the changes needed?

To have a base line of the benchmarks accordingly.

### Does this PR introduce _any_ user-facing change?

No, dev-only.

### How was this patch tested?

It was generated from:

- [Run benchmarks: * (JDK 11)](https://github.com/HyukjinKwon/spark/actions/runs/713575465)
- [Run benchmarks: * (JDK 8)](https://github.com/HyukjinKwon/spark/actions/runs/713154337)

Closes #32044 from HyukjinKwon/SPARK-34950.

Authored-by: HyukjinKwon <gurwls223@apache.org>
Signed-off-by: Max Gekk <max.gekk@gmail.com>
2021-04-03 23:02:56 +03:00

61 lines
4.6 KiB
Plaintext

================================================================================================
Parquet writer benchmark
================================================================================================
OpenJDK 64-Bit Server VM 1.8.0_282-b08 on Linux 5.4.0-1043-azure
Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz
Parquet writer benchmark: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
Output Single Int Column 2067 2087 29 7.6 131.4 1.0X
Output Single Double Column 1977 2013 52 8.0 125.7 1.0X
Output Int and String Column 7387 7410 33 2.1 469.7 0.3X
Output Partitions 3720 3831 156 4.2 236.5 0.6X
Output Buckets 4987 5028 57 3.2 317.1 0.4X
================================================================================================
ORC writer benchmark
================================================================================================
OpenJDK 64-Bit Server VM 1.8.0_282-b08 on Linux 5.4.0-1043-azure
Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz
ORC writer benchmark: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
Output Single Int Column 1264 1284 29 12.4 80.3 1.0X
Output Single Double Column 1708 1711 4 9.2 108.6 0.7X
Output Int and String Column 5988 6019 43 2.6 380.7 0.2X
Output Partitions 3192 3269 108 4.9 203.0 0.4X
Output Buckets 4439 4472 47 3.5 282.2 0.3X
================================================================================================
JSON writer benchmark
================================================================================================
OpenJDK 64-Bit Server VM 1.8.0_282-b08 on Linux 5.4.0-1043-azure
Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz
JSON writer benchmark: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
Output Single Int Column 2187 2194 10 7.2 139.0 1.0X
Output Single Double Column 2808 2833 35 5.6 178.5 0.8X
Output Int and String Column 6190 6224 49 2.5 393.5 0.4X
Output Partitions 3646 3705 83 4.3 231.8 0.6X
Output Buckets 4891 5325 614 3.2 311.0 0.4X
================================================================================================
CSV writer benchmark
================================================================================================
OpenJDK 64-Bit Server VM 1.8.0_282-b08 on Linux 5.4.0-1043-azure
Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz
CSV writer benchmark: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
Output Single Int Column 3162 3168 9 5.0 201.0 1.0X
Output Single Double Column 3647 3660 19 4.3 231.9 0.9X
Output Int and String Column 7266 7271 6 2.2 462.0 0.4X
Output Partitions 4857 4930 103 3.2 308.8 0.7X
Output Buckets 6809 6897 124 2.3 432.9 0.5X