spark-instrumented-optimizer

History

Takeshi Yamamuro 7f7b4dd519 [SPARK-31990][SS] Use toSet.toSeq in Dataset.dropDuplicates ### What changes were proposed in this pull request? This PR partially revert SPARK-31292 in order to provide a hot-fix for a bug in `Dataset.dropDuplicates`; we must preserve the input order of `colNames` for `groupCols` because the Streaming's state store depends on the `groupCols` order. ### Why are the changes needed? Bug fix. ### Does this PR introduce _any_ user-facing change? No. ### How was this patch tested? Added tests in `DataFrameSuite`. Closes #28830 from maropu/SPARK-31990. Authored-by: Takeshi Yamamuro <yamamuro@apache.org> Signed-off-by: Dongjoon Hyun <dongjoon@apache.org>		2020-06-15 07:48:48 -07:00
..
benchmarks	[SPARK-31992][SQL] Benchmark the EXCEPTION rebase mode	2020-06-15 07:25:56 +00:00
src	[SPARK-31990][SS] Use toSet.toSeq in Dataset.dropDuplicates	2020-06-15 07:48:48 -07:00
v1.2/src	[SPARK-31818][SQL] Fix pushing down filters with `java.time.Instant` values in ORC	2020-05-25 18:36:02 -07:00
v2.3/src	[SPARK-31818][SQL] Fix pushing down filters with `java.time.Instant` values in ORC	2020-05-25 18:36:02 -07:00
pom.xml	[SPARK-31765][WEBUI][TEST-MAVEN] Upgrade HtmlUnit >= 2.37.0	2020-06-11 18:27:53 -05:00