spark-instrumented-optimizer

History

zero323 d7d9fa0b87 [SPARK-11086][SPARKR] Use dropFactors column-wise instead of nested loop when createDataFrame Use `dropFactors` column-wise instead of nested loop when `createDataFrame` from a `data.frame` At this moment SparkR createDataFrame is using nested loop to convert factors to character when called on a local data.frame. It works but is incredibly slow especially with data.table (~ 2 orders of magnitude compared to PySpark / Pandas version on a DateFrame of size 1M rows x 2 columns). A simple improvement is to apply `dropFactor `column-wise and then reshape output list. It should at least partially address [SPARK-8277](https://issues.apache.org/jira/browse/SPARK-8277). Author: zero323 <matthew.szymkiewicz@gmail.com> Closes #9099 from zero323/SPARK-11086.		2015-11-15 19:15:27 -08:00
..
jarTest.R	[SPARK-8607] SparkR -- jars not being added to application classpath correctly	2015-06-26 17:06:16 -07:00
packageInAJarTest.R	[SPARK-8313] R Spark packages support	2015-08-04 18:20:12 -07:00
test_binary_function.R	[SPARK-9053] [SPARKR] Fix spaces around parens, infix operators etc.	2015-07-31 09:33:38 -07:00
test_binaryFile.R	[SPARK-8808] [SPARKR] Fix assignments in SparkR.	2015-07-14 22:21:01 -07:00
test_broadcast.R	[SPARK-7230] [SPARKR] Make RDD private in SparkR.	2015-05-05 14:40:33 -07:00
test_client.R	Use vector-friendly comparison for packages argument.	2015-07-28 10:45:19 -07:00
test_context.R	[SPARK-11340][SPARKR] Support setting driver properties when starting Spark from R programmatically or from RStudio	2015-10-30 13:51:32 -07:00
test_includeJAR.R	[SPARK-8549] [SPARKR] Fix the line length of SparkR	2015-07-05 20:50:02 -07:00
test_includePackage.R	[SPARK-5654] Integrate SparkR	2015-04-08 22:45:40 -07:00
test_mllib.R	[ML][R] SparkR::glm summary result to compare with native R	2015-11-10 11:34:36 -08:00
test_parallelize_collect.R	[SPARK-7714] [SPARKR] SparkR tests should use more specific expectations than expect_true	2015-07-01 09:50:12 -07:00
test_rdd.R	[SPARK-9053] [SPARKR] Fix spaces around parens, infix operators etc.	2015-07-31 09:33:38 -07:00
test_Serde.R	[MINOR] [SPARKR] Fix some validation problems in SparkR	2015-08-26 18:14:32 -07:00
test_shuffle.R	[SPARK-8548] [SPARKR] Remove the trailing whitespaces from the SparkR files	2015-06-22 20:55:38 -07:00
test_sparkSQL.R	[SPARK-11086][SPARKR] Use dropFactors column-wise instead of nested loop when createDataFrame	2015-11-15 19:15:27 -08:00
test_take.R	[SPARK-7714] [SPARKR] SparkR tests should use more specific expectations than expect_true	2015-07-01 09:50:12 -07:00
test_textFile.R	[SPARK-8808] [SPARKR] Fix assignments in SparkR.	2015-07-14 22:21:01 -07:00
test_utils.R	[SPARK-8808] [SPARKR] Fix assignments in SparkR.	2015-07-14 22:21:01 -07:00