spark-instrumented-optimizer/sql/hive
Wenchen Fan eaac60a1e2 [SPARK-16060][SQL][FOLLOW-UP] add a wrapper solution for vectorized orc reader
## What changes were proposed in this pull request?

This is mostly from https://github.com/apache/spark/pull/13775

The wrapper solution is pretty good for string/binary type, as the ORC column vector doesn't keep bytes in a continuous memory region, and has a significant overhead when copying the data to Spark columnar batch. For other cases, the wrapper solution is almost same with the current solution.

I think we can treat the wrapper solution as a baseline and keep improving the writing to Spark solution.

## How was this patch tested?

existing tests.

Author: Wenchen Fan <wenchen@databricks.com>

Closes #20205 from cloud-fan/orc.
2018-01-10 15:16:27 +08:00
..
compatibility/src/test/scala/org/apache/spark/sql/hive/execution [SPARK-22726][TEST] Basic tests for Binary Comparison and ImplicitTypeCasts 2017-12-11 15:55:23 -08:00
src [SPARK-16060][SQL][FOLLOW-UP] add a wrapper solution for vectorized orc reader 2018-01-10 15:16:27 +08:00
pom.xml [SPARK-21936][SQL] backward compatibility test framework for HiveExternalCatalog 2017-09-07 23:21:49 -07:00