spark-instrumented-optimizer/sql/catalyst
Herman van Hovell 16fd2a2f42 [SPARK-9741] [SQL] Approximate Count Distinct using the new UDAF interface.
This PR implements a HyperLogLog based Approximate Count Distinct function using the new UDAF interface.

The implementation is inspired by the ClearSpring HyperLogLog implementation and should produce the same results.

There is still some documentation and testing left to do.

cc yhuai

Author: Herman van Hovell <hvanhovell@questtec.nl>

Closes #8362 from hvanhovell/SPARK-9741.
2015-09-30 10:12:52 -07:00
..
src [SPARK-9741] [SQL] Approximate Count Distinct using the new UDAF interface. 2015-09-30 10:12:52 -07:00
pom.xml Update version to 1.6.0-SNAPSHOT. 2015-09-15 00:54:20 -07:00