spark-instrumented-optimizer/sql/core/benchmarks/DataSourceReadBenchmark-results.txt

270 lines
20 KiB
Plaintext
Raw Normal View History

================================================================================================
SQL Single Numeric Column Scan
================================================================================================
OpenJDK 64-Bit Server VM 1.8.0_181-b13 on Linux 3.10.0-862.3.2.el7.x86_64
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
SQL Single TINYINT Column Scan: Best/Avg Time(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------
SQL CSV 21508 / 22112 0.7 1367.5 1.0X
SQL Json 8705 / 8825 1.8 553.4 2.5X
SQL Parquet Vectorized 157 / 186 100.0 10.0 136.7X
SQL Parquet MR 1789 / 1794 8.8 113.8 12.0X
SQL ORC Vectorized 156 / 166 100.9 9.9 138.0X
SQL ORC Vectorized with copy 218 / 225 72.1 13.9 98.6X
SQL ORC MR 1448 / 1492 10.9 92.0 14.9X
OpenJDK 64-Bit Server VM 1.8.0_181-b13 on Linux 3.10.0-862.3.2.el7.x86_64
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
Parquet Reader Single TINYINT Column Scan: Best/Avg Time(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------
ParquetReader Vectorized 202 / 211 77.7 12.9 1.0X
ParquetReader Vectorized -> Row 118 / 120 133.5 7.5 1.7X
OpenJDK 64-Bit Server VM 1.8.0_181-b13 on Linux 3.10.0-862.3.2.el7.x86_64
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
SQL Single SMALLINT Column Scan: Best/Avg Time(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------
SQL CSV 23282 / 23312 0.7 1480.2 1.0X
SQL Json 9187 / 9189 1.7 584.1 2.5X
SQL Parquet Vectorized 204 / 218 77.0 13.0 114.0X
SQL Parquet MR 1941 / 1953 8.1 123.4 12.0X
SQL ORC Vectorized 217 / 225 72.6 13.8 107.5X
SQL ORC Vectorized with copy 279 / 289 56.3 17.8 83.4X
SQL ORC MR 1541 / 1549 10.2 98.0 15.1X
OpenJDK 64-Bit Server VM 1.8.0_181-b13 on Linux 3.10.0-862.3.2.el7.x86_64
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
Parquet Reader Single SMALLINT Column Scan: Best/Avg Time(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------
ParquetReader Vectorized 288 / 297 54.6 18.3 1.0X
ParquetReader Vectorized -> Row 255 / 257 61.7 16.2 1.1X
OpenJDK 64-Bit Server VM 1.8.0_181-b13 on Linux 3.10.0-862.3.2.el7.x86_64
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
SQL Single INT Column Scan: Best/Avg Time(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------
SQL CSV 24990 / 25012 0.6 1588.8 1.0X
SQL Json 9837 / 9865 1.6 625.4 2.5X
SQL Parquet Vectorized 170 / 180 92.3 10.8 146.6X
SQL Parquet MR 2319 / 2328 6.8 147.4 10.8X
SQL ORC Vectorized 293 / 301 53.7 18.6 85.3X
SQL ORC Vectorized with copy 297 / 309 52.9 18.9 84.0X
SQL ORC MR 1667 / 1674 9.4 106.0 15.0X
OpenJDK 64-Bit Server VM 1.8.0_181-b13 on Linux 3.10.0-862.3.2.el7.x86_64
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
Parquet Reader Single INT Column Scan: Best/Avg Time(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------
ParquetReader Vectorized 257 / 274 61.3 16.3 1.0X
ParquetReader Vectorized -> Row 259 / 264 60.8 16.4 1.0X
OpenJDK 64-Bit Server VM 1.8.0_181-b13 on Linux 3.10.0-862.3.2.el7.x86_64
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
SQL Single BIGINT Column Scan: Best/Avg Time(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------
SQL CSV 32537 / 32554 0.5 2068.7 1.0X
SQL Json 12610 / 12668 1.2 801.7 2.6X
SQL Parquet Vectorized 258 / 276 61.0 16.4 126.2X
SQL Parquet MR 2422 / 2435 6.5 154.0 13.4X
SQL ORC Vectorized 378 / 385 41.6 24.0 86.2X
SQL ORC Vectorized with copy 381 / 389 41.3 24.2 85.4X
SQL ORC MR 1797 / 1819 8.8 114.3 18.1X
OpenJDK 64-Bit Server VM 1.8.0_181-b13 on Linux 3.10.0-862.3.2.el7.x86_64
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
Parquet Reader Single BIGINT Column Scan: Best/Avg Time(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------
ParquetReader Vectorized 352 / 368 44.7 22.4 1.0X
ParquetReader Vectorized -> Row 351 / 359 44.8 22.3 1.0X
OpenJDK 64-Bit Server VM 1.8.0_181-b13 on Linux 3.10.0-862.3.2.el7.x86_64
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
SQL Single FLOAT Column Scan: Best/Avg Time(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------
SQL CSV 27179 / 27184 0.6 1728.0 1.0X
SQL Json 12578 / 12585 1.3 799.7 2.2X
SQL Parquet Vectorized 161 / 171 97.5 10.3 168.5X
SQL Parquet MR 2361 / 2395 6.7 150.1 11.5X
SQL ORC Vectorized 473 / 480 33.3 30.0 57.5X
SQL ORC Vectorized with copy 478 / 483 32.9 30.4 56.8X
SQL ORC MR 1858 / 1859 8.5 118.2 14.6X
OpenJDK 64-Bit Server VM 1.8.0_181-b13 on Linux 3.10.0-862.3.2.el7.x86_64
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
Parquet Reader Single FLOAT Column Scan: Best/Avg Time(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------
ParquetReader Vectorized 251 / 255 62.7 15.9 1.0X
ParquetReader Vectorized -> Row 255 / 259 61.8 16.2 1.0X
OpenJDK 64-Bit Server VM 1.8.0_181-b13 on Linux 3.10.0-862.3.2.el7.x86_64
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
SQL Single DOUBLE Column Scan: Best/Avg Time(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------
SQL CSV 34797 / 34830 0.5 2212.3 1.0X
SQL Json 17806 / 17828 0.9 1132.1 2.0X
SQL Parquet Vectorized 260 / 269 60.6 16.5 134.0X
SQL Parquet MR 2512 / 2534 6.3 159.7 13.9X
SQL ORC Vectorized 582 / 593 27.0 37.0 59.8X
SQL ORC Vectorized with copy 576 / 584 27.3 36.6 60.4X
SQL ORC MR 2309 / 2313 6.8 146.8 15.1X
OpenJDK 64-Bit Server VM 1.8.0_181-b13 on Linux 3.10.0-862.3.2.el7.x86_64
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
Parquet Reader Single DOUBLE Column Scan: Best/Avg Time(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------
ParquetReader Vectorized 350 / 363 44.9 22.3 1.0X
ParquetReader Vectorized -> Row 350 / 366 44.9 22.3 1.0X
================================================================================================
Int and String Scan
================================================================================================
OpenJDK 64-Bit Server VM 1.8.0_181-b13 on Linux 3.10.0-862.3.2.el7.x86_64
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
Int and String Scan: Best/Avg Time(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------
SQL CSV 22486 / 22590 0.5 2144.5 1.0X
SQL Json 14124 / 14195 0.7 1347.0 1.6X
SQL Parquet Vectorized 2342 / 2347 4.5 223.4 9.6X
SQL Parquet MR 4660 / 4664 2.2 444.4 4.8X
SQL ORC Vectorized 2378 / 2379 4.4 226.8 9.5X
SQL ORC Vectorized with copy 2548 / 2571 4.1 243.0 8.8X
SQL ORC MR 4206 / 4211 2.5 401.1 5.3X
================================================================================================
Repeated String Scan
================================================================================================
OpenJDK 64-Bit Server VM 1.8.0_181-b13 on Linux 3.10.0-862.3.2.el7.x86_64
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
Repeated String: Best/Avg Time(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------
SQL CSV 12150 / 12178 0.9 1158.7 1.0X
SQL Json 7012 / 7014 1.5 668.7 1.7X
SQL Parquet Vectorized 792 / 796 13.2 75.5 15.3X
SQL Parquet MR 1961 / 1975 5.3 187.0 6.2X
SQL ORC Vectorized 482 / 485 21.8 46.0 25.2X
SQL ORC Vectorized with copy 710 / 715 14.8 67.7 17.1X
SQL ORC MR 2081 / 2083 5.0 198.5 5.8X
================================================================================================
Partitioned Table Scan
================================================================================================
OpenJDK 64-Bit Server VM 1.8.0_181-b13 on Linux 3.10.0-862.3.2.el7.x86_64
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
Partitioned Table: Best/Avg Time(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------
Data column - CSV 31789 / 31791 0.5 2021.1 1.0X
Data column - Json 12873 / 12918 1.2 818.4 2.5X
Data column - Parquet Vectorized 267 / 280 58.9 17.0 119.1X
Data column - Parquet MR 3387 / 3402 4.6 215.3 9.4X
Data column - ORC Vectorized 391 / 453 40.2 24.9 81.2X
Data column - ORC Vectorized with copy 392 / 398 40.2 24.9 81.2X
Data column - ORC MR 2508 / 2512 6.3 159.4 12.7X
Partition column - CSV 6965 / 6977 2.3 442.8 4.6X
Partition column - Json 5563 / 5576 2.8 353.7 5.7X
Partition column - Parquet Vectorized 65 / 78 241.1 4.1 487.2X
Partition column - Parquet MR 1811 / 1811 8.7 115.1 17.6X
Partition column - ORC Vectorized 66 / 73 239.0 4.2 483.0X
Partition column - ORC Vectorized with copy 65 / 70 241.1 4.1 487.3X
Partition column - ORC MR 1775 / 1778 8.9 112.8 17.9X
Both columns - CSV 30032 / 30113 0.5 1909.4 1.1X
Both columns - Json 13941 / 13959 1.1 886.3 2.3X
Both columns - Parquet Vectorized 312 / 330 50.3 19.9 101.7X
Both columns - Parquet MR 3858 / 3862 4.1 245.3 8.2X
Both columns - ORC Vectorized 431 / 437 36.5 27.4 73.8X
Both column - ORC Vectorized with copy 523 / 529 30.1 33.3 60.7X
Both columns - ORC MR 2712 / 2805 5.8 172.4 11.7X
================================================================================================
String with Nulls Scan
================================================================================================
OpenJDK 64-Bit Server VM 1.8.0_181-b13 on Linux 3.10.0-862.3.2.el7.x86_64
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
String with Nulls Scan: Best/Avg Time(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------
SQL CSV 13525 / 13823 0.8 1289.9 1.0X
SQL Json 9913 / 9921 1.1 945.3 1.4X
SQL Parquet Vectorized 1517 / 1517 6.9 144.7 8.9X
SQL Parquet MR 3996 / 4008 2.6 381.1 3.4X
ParquetReader Vectorized 1120 / 1128 9.4 106.8 12.1X
SQL ORC Vectorized 1203 / 1224 8.7 114.7 11.2X
SQL ORC Vectorized with copy 1639 / 1646 6.4 156.3 8.3X
SQL ORC MR 3720 / 3780 2.8 354.7 3.6X
OpenJDK 64-Bit Server VM 1.8.0_181-b13 on Linux 3.10.0-862.3.2.el7.x86_64
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
String with Nulls Scan: Best/Avg Time(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------
SQL CSV 15860 / 15877 0.7 1512.5 1.0X
SQL Json 7676 / 7688 1.4 732.0 2.1X
SQL Parquet Vectorized 1072 / 1084 9.8 102.2 14.8X
SQL Parquet MR 2890 / 2897 3.6 275.6 5.5X
ParquetReader Vectorized 1052 / 1053 10.0 100.4 15.1X
SQL ORC Vectorized 1248 / 1248 8.4 119.0 12.7X
SQL ORC Vectorized with copy 1627 / 1637 6.4 155.2 9.7X
SQL ORC MR 3365 / 3369 3.1 320.9 4.7X
OpenJDK 64-Bit Server VM 1.8.0_181-b13 on Linux 3.10.0-862.3.2.el7.x86_64
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
String with Nulls Scan: Best/Avg Time(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------
SQL CSV 13401 / 13561 0.8 1278.1 1.0X
SQL Json 5253 / 5303 2.0 500.9 2.6X
SQL Parquet Vectorized 233 / 242 45.0 22.2 57.6X
SQL Parquet MR 1791 / 1796 5.9 170.8 7.5X
ParquetReader Vectorized 236 / 238 44.4 22.5 56.7X
SQL ORC Vectorized 453 / 473 23.2 43.2 29.6X
SQL ORC Vectorized with copy 573 / 577 18.3 54.7 23.4X
SQL ORC MR 1846 / 1850 5.7 176.0 7.3X
================================================================================================
Single Column Scan From Wide Columns
================================================================================================
OpenJDK 64-Bit Server VM 1.8.0_181-b13 on Linux 3.10.0-862.3.2.el7.x86_64
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
Single Column Scan from 10 columns: Best/Avg Time(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------
SQL CSV 3147 / 3148 0.3 3001.1 1.0X
SQL Json 2666 / 2693 0.4 2542.9 1.2X
SQL Parquet Vectorized 54 / 58 19.5 51.3 58.5X
SQL Parquet MR 220 / 353 4.8 209.9 14.3X
SQL ORC Vectorized 63 / 77 16.8 59.7 50.3X
SQL ORC Vectorized with copy 63 / 66 16.7 59.8 50.2X
SQL ORC MR 317 / 321 3.3 302.2 9.9X
OpenJDK 64-Bit Server VM 1.8.0_181-b13 on Linux 3.10.0-862.3.2.el7.x86_64
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
Single Column Scan from 50 columns: Best/Avg Time(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------
SQL CSV 7902 / 7921 0.1 7536.2 1.0X
SQL Json 9467 / 9491 0.1 9028.6 0.8X
SQL Parquet Vectorized 73 / 79 14.3 69.8 108.0X
SQL Parquet MR 239 / 247 4.4 228.0 33.1X
SQL ORC Vectorized 78 / 84 13.4 74.6 101.0X
SQL ORC Vectorized with copy 78 / 88 13.4 74.4 101.3X
SQL ORC MR 910 / 918 1.2 867.6 8.7X
OpenJDK 64-Bit Server VM 1.8.0_181-b13 on Linux 3.10.0-862.3.2.el7.x86_64
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
Single Column Scan from 100 columns: Best/Avg Time(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------
SQL CSV 13539 / 13543 0.1 12912.0 1.0X
SQL Json 17420 / 17446 0.1 16613.1 0.8X
SQL Parquet Vectorized 103 / 120 10.2 98.1 131.6X
SQL Parquet MR 250 / 258 4.2 238.9 54.1X
SQL ORC Vectorized 99 / 104 10.6 94.6 136.5X
SQL ORC Vectorized with copy 100 / 106 10.5 95.6 135.1X
SQL ORC MR 1653 / 1659 0.6 1576.3 8.2X