spark-instrumented-optimizer

History

Wenchen Fan eaac60a1e2 [SPARK-16060][SQL][FOLLOW-UP] add a wrapper solution for vectorized orc reader ## What changes were proposed in this pull request? This is mostly from https://github.com/apache/spark/pull/13775 The wrapper solution is pretty good for string/binary type, as the ORC column vector doesn't keep bytes in a continuous memory region, and has a significant overhead when copying the data to Spark columnar batch. For other cases, the wrapper solution is almost same with the current solution. I think we can treat the wrapper solution as a baseline and keep improving the writing to Spark solution. ## How was this patch tested? existing tests. Author: Wenchen Fan <wenchen@databricks.com> Closes #20205 from cloud-fan/orc.		2018-01-10 15:16:27 +08:00
..
compatibility/src/test/scala/org/apache/spark/sql/hive/execution	[SPARK-22726][TEST] Basic tests for Binary Comparison and ImplicitTypeCasts	2017-12-11 15:55:23 -08:00
src	[SPARK-16060][SQL][FOLLOW-UP] add a wrapper solution for vectorized orc reader	2018-01-10 15:16:27 +08:00
pom.xml	[SPARK-21936][SQL] backward compatibility test framework for HiveExternalCatalog	2017-09-07 23:21:49 -07:00