spark-instrumented-optimizer

History

hyukjinkwon c9fe10d4ed [SPARK-17658][SPARKR] read.df/write.df API taking path optionally in SparkR ## What changes were proposed in this pull request? `write.df`/`read.df` API require path which is not actually always necessary in Spark. Currently, it only affects the datasources implementing `CreatableRelationProvider`. Currently, Spark currently does not have internal data sources implementing this but it'd affect other external datasources. In addition we'd be able to use this way in Spark's JDBC datasource after https://github.com/apache/spark/pull/12601 is merged. Before - `read.df` ```r > read.df(source = "json") Error in dispatchFunc("read.df(path = NULL, source = NULL, schema = NULL, ...)", : argument "x" is missing with no default ``` ```r > read.df(path = c(1, 2)) Error in dispatchFunc("read.df(path = NULL, source = NULL, schema = NULL, ...)", : argument "x" is missing with no default ``` ```r > read.df(c(1, 2)) Error in invokeJava(isStatic = TRUE, className, methodName, ...) : java.lang.ClassCastException: java.lang.Double cannot be cast to java.lang.String at org.apache.spark.sql.execution.datasources.DataSource.hasMetadata(DataSource.scala:300) at ... In if (is.na(object)) { : ... ``` - `write.df` ```r > write.df(df, source = "json") Error in (function (classes, fdef, mtable) : unable to find an inherited method for function ‘write.df’ for signature ‘"function", "missing"’ ``` ```r > write.df(df, source = c(1, 2)) Error in (function (classes, fdef, mtable) : unable to find an inherited method for function ‘write.df’ for signature ‘"SparkDataFrame", "missing"’ ``` ```r > write.df(df, mode = TRUE) Error in (function (classes, fdef, mtable) : unable to find an inherited method for function ‘write.df’ for signature ‘"SparkDataFrame", "missing"’ ``` After - `read.df` ```r > read.df(source = "json") Error in loadDF : analysis error - Unable to infer schema for JSON at . It must be specified manually; ``` ```r > read.df(path = c(1, 2)) Error in f(x, ...) : path should be charactor, null or omitted. ``` ```r > read.df(c(1, 2)) Error in f(x, ...) : path should be charactor, null or omitted. ``` - `write.df` ```r > write.df(df, source = "json") Error in save : illegal argument - 'path' is not specified ``` ```r > write.df(df, source = c(1, 2)) Error in .local(df, path, ...) : source should be charactor, null or omitted. It is 'parquet' by default. ``` ```r > write.df(df, mode = TRUE) Error in .local(df, path, ...) : mode should be charactor or omitted. It is 'error' by default. ``` ## How was this patch tested? Unit tests in `test_sparkSQL.R` Author: hyukjinkwon <gurwls223@gmail.com> Closes #15231 from HyukjinKwon/write-default-r.		2016-10-04 22:58:43 -07:00
..
inst	[SPARK-17658][SPARKR] read.df/write.df API taking path optionally in SparkR	2016-10-04 22:58:43 -07:00
R	[SPARK-17658][SPARKR] read.df/write.df API taking path optionally in SparkR	2016-10-04 22:58:43 -07:00
src-native	[SPARK-6811] Copy SparkR lib in make-distribution.sh	2015-05-23 00:04:01 -07:00
tests	[SPARK-12034][SPARKR] Eliminate warnings in SparkR test cases.	2015-12-07 10:38:17 -08:00
vignettes	[SPARKR][DOC] minor formatting and output cleanup for R vignettes	2016-10-04 09:22:26 -07:00
.lintr	[SPARK-12327][SPARKR] fix code for lintr warning for commented code	2016-01-03 20:53:35 +05:30
.Rbuildignore	[SPARK-16507][SPARKR] Add a CRAN checker, fix Rd aliases	2016-07-16 17:06:44 -07:00
DESCRIPTION	[SPARK-16581][SPARKR] Make JVM backend calling functions public	2016-08-29 12:55:32 -07:00
NAMESPACE	[SPARK-17577][SPARKR][CORE] SparkR support add files to Spark job and get by executors	2016-09-21 20:08:28 -07:00