spark-instrumented-optimizer/R/pkg/inst
Narine Kokhlikyan 7c6c692637 [SPARK-12922][SPARKR][WIP] Implement gapply() on DataFrame in SparkR
## What changes were proposed in this pull request?

gapply() applies an R function on groups grouped by one or more columns of a DataFrame, and returns a DataFrame. It is like GroupedDataSet.flatMapGroups() in the Dataset API.

Please, let me know what do you think and if you have any ideas to improve it.

Thank you!

## How was this patch tested?
Unit tests.
1. Primitive test with different column types
2. Add a boolean column
3. Compute average by a group

Author: Narine Kokhlikyan <narine.kokhlikyan@gmail.com>
Author: NarineK <narine.kokhlikyan@us.ibm.com>

Closes #12836 from NarineK/gapply2.
2016-06-15 21:42:05 -07:00
..
profile [SPARK-13812][SPARKR] Fix SparkR lint-r test errors. 2016-03-13 14:30:44 -07:00
test_support [SPARK-8607] SparkR -- jars not being added to application classpath correctly 2015-06-26 17:06:16 -07:00
tests/testthat [SPARK-12922][SPARKR][WIP] Implement gapply() on DataFrame in SparkR 2016-06-15 21:42:05 -07:00
worker [SPARK-12922][SPARKR][WIP] Implement gapply() on DataFrame in SparkR 2016-06-15 21:42:05 -07:00