spark-instrumented-optimizer/common
Reynold Xin d3c90b74ed [SPARK-18663][SQL] Simplify CountMinSketch aggregate implementation
## What changes were proposed in this pull request?
SPARK-18429 introduced count-min sketch aggregate function for SQL, but the implementation and testing is more complicated than needed. This simplifies the test cases and removes support for data types that don't have clear equality semantics:

1. Removed support for floating point and decimal types.

2. Removed the heavy randomized tests. The underlying CountMinSketch implementation already had pretty good test coverage through randomized tests, and the SPARK-18429 implementation is just to add an aggregate function wrapper around CountMinSketch. There is no need for randomized tests at three different levels of the implementations.

## How was this patch tested?
A lot of the change is to simplify test cases.

Author: Reynold Xin <rxin@databricks.com>

Closes #16093 from rxin/SPARK-18663.
2016-12-01 21:38:52 -08:00
..
network-common [SPARK-18420][BUILD] Fix the errors caused by lint check in Java 2016-11-16 11:59:00 +00:00
network-shuffle [SPARK-17543] Missing log4j config file for tests in common/network-… 2016-09-16 10:18:45 +01:00
network-yarn [SPARK-17611][YARN][TEST] Make shuffle service test really test auth. 2016-09-20 14:17:49 -07:00
sketch [SPARK-18663][SQL] Simplify CountMinSketch aggregate implementation 2016-12-01 21:38:52 -08:00
tags [SPARK-17800] Introduce InterfaceStability annotation 2016-10-07 10:24:42 -07:00
unsafe [SPARK-18124] Observed delay based Event Time Watermarks 2016-11-14 16:46:26 -08:00