spark-instrumented-optimizer/R/pkg
Maxim Gekk 027ed2d11b [SPARK-23643][CORE][SQL][ML] Shrinking the buffer in hashSeed up to size of the seed parameter
## What changes were proposed in this pull request?

The hashSeed method allocates 64 bytes instead of 8. Other bytes are always zeros (thanks to default behavior of ByteBuffer). And they could be excluded from hash calculation because they don't differentiate inputs.

## How was this patch tested?

By running the existing tests - XORShiftRandomSuite

Closes #20793 from MaxGekk/hash-buff-size.

Lead-authored-by: Maxim Gekk <maxim.gekk@databricks.com>
Co-authored-by: Maxim Gekk <max.gekk@gmail.com>
Signed-off-by: Sean Owen <sean.owen@databricks.com>
2019-03-23 11:26:09 -05:00
..
inst [SPARK-26830][SQL][R] Vectorized R dapply() implementation 2019-02-27 14:29:58 +09:00
R [SPARK-27099][SQL] Add 'xxhash64' for hashing arbitrary columns to Long 2019-03-20 16:34:34 +08:00
src-native [SPARK-6811] Copy SparkR lib in make-distribution.sh 2015-05-23 00:04:01 -07:00
tests [SPARK-23643][CORE][SQL][ML] Shrinking the buffer in hashSeed up to size of the seed parameter 2019-03-23 11:26:09 -05:00
vignettes [SPARK-19827][R] spark.ml R API for PIC 2018-12-10 18:28:13 -06:00
.lintr [SPARK-22063][R] Fixes lint check failures in R by latest commit sha1 ID of lint-r 2017-10-01 18:42:45 +09:00
.Rbuildignore [SPARK-20877][SPARKR][FOLLOWUP] clean up after test move 2017-06-11 03:00:44 -07:00
DESCRIPTION [R] update package description 2019-02-21 19:00:36 +08:00
NAMESPACE [SPARK-27099][SQL] Add 'xxhash64' for hashing arbitrary columns to Long 2019-03-20 16:34:34 +08:00