spark-instrumented-optimizer

History

Yanbo Liang acb9715779 [SPARK-18444][SPARKR] SparkR running in yarn-cluster mode should not download Spark package. ## What changes were proposed in this pull request? When running SparkR job in yarn-cluster mode, it will download Spark package from apache website which is not necessary. ``` ./bin/spark-submit --master yarn-cluster ./examples/src/main/r/dataframe.R ``` The following is output: ``` Attaching package: ‘SparkR’ The following objects are masked from ‘package:stats’: cov, filter, lag, na.omit, predict, sd, var, window The following objects are masked from ‘package:base’: as.data.frame, colnames, colnames<-, drop, endsWith, intersect, rank, rbind, sample, startsWith, subset, summary, transform, union Spark not found in SPARK_HOME: Spark not found in the cache directory. Installation will start. MirrorUrl not provided. Looking for preferred site from apache website... ...... ``` There's no ```SPARK_HOME``` in yarn-cluster mode since the R process is in a remote host of the yarn cluster rather than in the client host. The JVM comes up first and the R process then connects to it. So in such cases we should never have to download Spark as Spark is already running. ## How was this patch tested? Offline test. Author: Yanbo Liang <ybliang8@gmail.com> Closes #15888 from yanboliang/spark-18444.		2016-11-22 00:05:30 -08:00
..
jarTest.R	[SPARK-10683][SPARK-16510][SPARKR] Move SparkR include jar test to SparkSubmitSuite	2016-07-19 19:28:08 -07:00
packageInAJarTest.R	[SPARKR][MINOR] R examples and test updates	2016-07-13 13:33:34 -07:00
test_binary_function.R	[SPARK-16519][SPARKR] Handle SparkR RDD generics that create warnings in R CMD check	2016-08-16 11:19:18 -07:00
test_binaryFile.R	[SPARK-16519][SPARKR] Handle SparkR RDD generics that create warnings in R CMD check	2016-08-16 11:19:18 -07:00
test_broadcast.R	[SPARK-16519][SPARKR] Handle SparkR RDD generics that create warnings in R CMD check	2016-08-16 11:19:18 -07:00
test_client.R	[MINOR] [SPARKR] Update data-manipulation.R to use native csv reader	2016-05-09 09:58:36 -07:00
test_context.R	[SPARK-17577][FOLLOW-UP][SPARKR] SparkR spark.addFile supports adding directory recursively	2016-09-26 16:47:57 -07:00
test_includePackage.R	[SPARK-16519][SPARKR] Handle SparkR RDD generics that create warnings in R CMD check	2016-08-16 11:19:18 -07:00
test_jvm_api.R	[SPARK-16581][SPARKR] Fix JVM API tests in SparkR	2016-08-31 16:56:41 -07:00
test_mllib.R	[SPARK-18438][SPARKR][ML] spark.mlp should support RFormula.	2016-11-16 01:04:18 -08:00
test_parallelize_collect.R	[SPARK-16519][SPARKR] Handle SparkR RDD generics that create warnings in R CMD check	2016-08-16 11:19:18 -07:00
test_rdd.R	[SPARK-16519][SPARKR] Handle SparkR RDD generics that create warnings in R CMD check	2016-08-16 11:19:18 -07:00
test_Serde.R	[SPARK-16027][SPARKR] Fix R tests SparkSession init/stop	2016-07-17 19:02:21 -07:00
test_shuffle.R	[SPARK-16519][SPARKR] Handle SparkR RDD generics that create warnings in R CMD check	2016-08-16 11:19:18 -07:00
test_sparkR.R	[SPARK-18444][SPARKR] SparkR running in yarn-cluster mode should not download Spark package.	2016-11-22 00:05:30 -08:00
test_sparkSQL.R	[SPARK-17470][SQL] unify path for data source table and locationUri for hive serde table	2016-11-02 18:05:14 -07:00
test_take.R	[SPARK-16519][SPARKR] Handle SparkR RDD generics that create warnings in R CMD check	2016-08-16 11:19:18 -07:00
test_textFile.R	[SPARK-16519][SPARKR] Handle SparkR RDD generics that create warnings in R CMD check	2016-08-16 11:19:18 -07:00
test_utils.R	[SPARK-17838][SPARKR] Check named arguments for options and use formatted R friendly message from JVM exception message	2016-11-01 22:14:53 -07:00
test_Windows.R	[SPARK-8603][SPARKR] Use shell() instead of system2() for SparkR on Windows	2016-05-26 20:55:06 -07:00