spark-instrumented-optimizer

History

Prashant Sharma 40de176c93 [SPARK-16496][SQL] Add wholetext as option for reading text in SQL. ## What changes were proposed in this pull request? In multiple text analysis problems, it is not often desirable for the rows to be split by "\n". There exists a wholeText reader for RDD API, and this JIRA just adds the same support for Dataset API. ## How was this patch tested? Added relevant new tests for both scala and Java APIs Author: Prashant Sharma <prashsh1@in.ibm.com> Author: Prashant Sharma <prashant@apache.org> Closes #14151 from ScrapCodes/SPARK-16496/wholetext.		2017-12-14 11:19:34 -08:00
..
__init__.py	[SPARK-22369][PYTHON][DOCS] Exposes catalog API documentation in PySpark	2017-11-02 15:22:52 +01:00
catalog.py	[SPARK-22409] Introduce function type argument in pandas_udf	2017-11-17 16:43:08 +01:00
column.py	[SPARK-19165][PYTHON][SQL] PySpark APIs using columns as arguments should validate input types for column	2017-08-24 20:29:03 +09:00
conf.py	[SPARK-15464][ML][MLLIB][SQL][TESTS] Replace SQLContext and SparkContext with SparkSession using builder pattern in python test code	2016-05-23 18:14:48 -07:00
context.py	[SPARK-20586][SQL] Add deterministic to ScalaUDF	2017-07-25 17:19:44 -07:00
dataframe.py	[SPARK-22395][SQL][PYTHON] Fix the behavior of timestamp values for Pandas to respect session timezone	2017-11-28 16:45:22 +08:00
functions.py	[SPARK-22541][SQL] Explicitly claim that Python udfs can't be conditionally executed with short-curcuit evaluation	2017-11-21 09:36:37 +01:00
group.py	[SPARK-22409] Introduce function type argument in pandas_udf	2017-11-17 16:43:08 +01:00
readwriter.py	[SPARK-16496][SQL] Add wholetext as option for reading text in SQL.	2017-12-14 11:19:34 -08:00
session.py	[SPARK-22395][SQL][PYTHON] Fix the behavior of timestamp values for Pandas to respect session timezone	2017-11-28 16:45:22 +08:00
streaming.py	[SPARK-21756][SQL] Add JSON option to allow unquoted control characters	2017-08-25 10:18:03 -07:00
tests.py	[SPARK-22395][SQL][PYTHON] Fix the behavior of timestamp values for Pandas to respect session timezone	2017-11-28 16:45:22 +08:00
types.py	[SPARK-22395][SQL][PYTHON] Fix the behavior of timestamp values for Pandas to respect session timezone	2017-11-28 16:45:22 +08:00
udf.py	[SPARK-22409] Introduce function type argument in pandas_udf	2017-11-17 16:43:08 +01:00
utils.py	[MINOR][DOCS] Remove consecutive duplicated words/typo in Spark Repo	2017-01-04 15:07:29 +00:00
window.py	[SPARK-18690][PYTHON][SQL] Backward compatibility of unbounded frames	2016-12-02 17:39:28 -08:00