spark-instrumented-optimizer/external
Gabbi Merz 29e154b2f1 [SPARK-27858][SQL] Fix for avro deserialization on union types with multiple non-null types
## What changes were proposed in this pull request?

This PR aims to fix an issue on a union avro type with more than one non-null value (for instance `["string", "null", "int"]`) whose the deserialization to a DataFrame would throw a `java.lang.ArrayIndexOutOfBoundsException`. The issue was that the `fieldWriter` relied on the index from the avro schema before nulls were filtered out.

## How was this patch tested?

A test for the case of multiple non-null values was added and the tests were run using sbt by running `testOnly org.apache.spark.sql.avro.AvroSuite`

Closes #24722 from gcmerz/master.

Authored-by: Gabbi Merz <gmerz@palantir.com>
Signed-off-by: Dongjoon Hyun <dhyun@apple.com>
2019-05-27 20:09:23 -07:00
..
avro [SPARK-27858][SQL] Fix for avro deserialization on union types with multiple non-null types 2019-05-27 20:09:23 -07:00
docker [SPARK-27794][R][DOCS] Use https URL for CRAN repo 2019-05-22 14:28:21 -07:00
docker-integration-tests [SPARK-27596][SQL] The JDBC 'query' option doesn't work for Oracle database 2019-05-05 21:52:23 -07:00
kafka-0-10 [SPARK-27294][SS] Add multi-cluster Kafka delegation token 2019-05-07 11:40:43 -07:00
kafka-0-10-assembly [SPARK-25956] Make Scala 2.12 as default Scala version in Spark 3.0 2018-11-14 16:22:23 -08:00
kafka-0-10-sql [SPARK-27687][SS] Rename Kafka consumer cache capacity conf and document caching 2019-05-15 10:42:09 -07:00
kafka-0-10-token-provider [SPARK-27680][CORE][SQL][GRAPHX] Remove usage of Traversable 2019-05-14 09:14:56 -05:00
kinesis-asl [SPARK-27610][YARN] Shade netty native libraries 2019-05-07 10:47:36 -07:00
kinesis-asl-assembly [SPARK-25956] Make Scala 2.12 as default Scala version in Spark 3.0 2018-11-14 16:22:23 -08:00
spark-ganglia-lgpl [SPARK-25956] Make Scala 2.12 as default Scala version in Spark 3.0 2018-11-14 16:22:23 -08:00