spark-instrumented-optimizer

History

William Hyun 2ab82fae57 [SPARK-31963][PYSPARK][SQL] Support both pandas 0.23 and 1.0 in serializers.py ### What changes were proposed in this pull request? This PR aims to support both pandas 0.23 and 1.0. ### Why are the changes needed? ``` $ pip install pandas==0.23.2 $ python -c "import pandas.CategoricalDtype" Traceback (most recent call last): File "<string>", line 1, in <module> ModuleNotFoundError: No module named 'pandas.CategoricalDtype' $ python -c "from pandas.api.types import CategoricalDtype" ``` ### Does this PR introduce _any_ user-facing change? No. ### How was this patch tested? Pass the Jenkins. ``` $ pip freeze \| grep pandas pandas==0.23.2 $ python/run-tests.py --python-executables python --modules pyspark-sql ... Tests passed in 359 seconds ``` Closes #28789 from williamhyun/williamhyun-patch-2. Authored-by: William Hyun <williamhyun3@gmail.com> Signed-off-by: Dongjoon Hyun <dongjoon@apache.org>		2020-06-10 14:42:45 -07:00
..
avro	[SPARK-27506][SQL][FOLLOWUP] Use option `avroSchema` to specify an evolved schema in `from_avro`	2019-12-30 18:14:21 +09:00
pandas	[SPARK-31963][PYSPARK][SQL] Support both pandas 0.23 and 1.0 in serializers.py	2020-06-10 14:42:45 -07:00
tests	[SPARK-31945][SQL][PYSPARK] Enable cache for the same Python function	2020-06-10 16:38:59 +09:00
__init__.py	[SPARK-31088][SQL] Add back HiveContext and createExternalTable	2020-03-26 23:51:15 -07:00
catalog.py	[SPARK-31088][SQL] Add back HiveContext and createExternalTable	2020-03-26 23:51:15 -07:00
column.py	[SPARK-29664][PYTHON][SQL][FOLLOW-UP] Add deprecation warnings for getItem instead	2020-04-27 14:49:22 +09:00
conf.py	[SPARK-23698][PYTHON] Resolve undefined names in Python 3	2018-08-22 10:06:59 -07:00
context.py	[SPARK-31088][SQL] Add back HiveContext and createExternalTable	2020-03-26 23:51:15 -07:00
dataframe.py	[SPARK-31895][PYTHON][SQL] Support DataFrame.explain(extended: str) case to be consistent with Scala side	2020-06-03 12:07:05 +09:00
functions.py	[SPARK-31306][DOCS] update rand() function documentation to indicate exclusive upper bound	2020-03-31 15:16:17 +09:00
group.py	[SPARK-30434][PYTHON][SQL] Move pandas related functionalities into 'pandas' sub-package	2020-01-09 10:22:50 +09:00
readwriter.py	[SPARK-31739][PYSPARK][DOCS][MINOR] Fix docstring syntax issues and misplaced space characters	2020-05-18 20:25:02 +09:00
session.py	[SPARK-30856][SQL][PYSPARK] Fix SQLContext.getOrCreate() when SparkContext is restarted	2020-02-20 12:21:24 +09:00
streaming.py	[SPARK-31739][PYSPARK][DOCS][MINOR] Fix docstring syntax issues and misplaced space characters	2020-05-18 20:25:02 +09:00
types.py	[SPARK-30941][PYSPARK] Add a note to asDict to document its behavior when there are duplicate fields	2020-03-09 11:06:45 -07:00
udf.py	[SPARK-30722][PYTHON][DOCS] Update documentation for Pandas UDF with Python type hints	2020-02-12 10:49:46 +09:00
utils.py	[SPARK-31849][PYTHON][SQL][FOLLOW-UP] More correct error message in Python UDF exception message	2020-06-09 10:24:34 +09:00
window.py	[SPARK-30188][SQL] Resolve the failed unit tests when enable AQE	2020-01-13 22:55:19 +08:00