spark-instrumented-optimizer

History

Yuming Wang 77c996403d [SPARK-25368][SQL] Incorrect predicate pushdown returns wrong result ## What changes were proposed in this pull request? How to reproduce: ```scala val df1 = spark.createDataFrame(Seq( (1, 1) )).toDF("a", "b").withColumn("c", lit(null).cast("int")) val df2 = df1.union(df1).withColumn("d", spark_partition_id).filter($"c".isNotNull) df2.show +---+---+----+---+ \| a\| b\| c\| d\| +---+---+----+---+ \| 1\| 1\|null\| 0\| \| 1\| 1\|null\| 1\| +---+---+----+---+ ``` `filter($"c".isNotNull)` was transformed to `(null <=> c#10)` before https://github.com/apache/spark/pull/19201, but it is transformed to `(c#10 = null)` since https://github.com/apache/spark/pull/20155. This pr revert it to `(null <=> c#10)` to fix this issue. ## How was this patch tested? unit tests Closes #22368 from wangyum/SPARK-25368. Authored-by: Yuming Wang <yumwang@ebay.com> Signed-off-by: gatorsmile <gatorsmile@gmail.com>		2018-09-09 09:07:31 -07:00
..
benchmarks	[SPARK-25306][SQL] Avoid skewed filter trees to speed up `createFilter` in ORC	2018-09-05 10:24:13 +08:00
src	[SPARK-25368][SQL] Incorrect predicate pushdown returns wrong result	2018-09-09 09:07:31 -07:00
pom.xml	[SPARK-25019][BUILD] Fix orc dependency to use the same exclusion rules	2018-08-06 12:00:39 -07:00