spark-instrumented-optimizer

History

Xinrong Meng 50f7686de9 [SPARK-35599][PYTHON] Adjust `check_exact` parameter for older pd.testing ### What changes were proposed in this pull request? Adjust the `check_exact` parameter for non-numeric columns to ensure pandas-on-Spark tests passed with all pandas versions. ### Why are the changes needed? `pd.testing` utils are utilized in pandas-on-Spark tests. Due to https://github.com/pandas-dev/pandas/issues/35446, `check_exact=True` for non-numeric columns doesn't work for older pd.testing utils, e.g. `assert_series_equal`. We wanted to adjust that to ensure pandas-on-Spark tests pass for all pandas versions. ### Does this PR introduce _any_ user-facing change? No. ### How was this patch tested? Existing unit tests. Closes #32772 from xinrong-databricks/test_util. Authored-by: Xinrong Meng <xinrong.meng@databricks.com> Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>		2021-06-07 11:12:49 +09:00
..
__init__.py	[SPARK-26032][PYTHON] Break large sql/tests.py files into smaller files	2018-11-14 14:51:11 +08:00
mllibutils.py	[SPARK-26033][SPARK-26034][PYTHON][FOLLOW-UP] Small cleanup and deduplication in ml/mllib tests	2018-12-03 14:03:10 -08:00
mlutils.py	[SPARK-33520][ML][PYSPARK] make CrossValidator/TrainValidateSplit/OneVsRest Reader/Writer support Python backend estimator/evaluator	2020-12-04 08:35:50 +08:00
pandasutils.py	[SPARK-35599][PYTHON] Adjust `check_exact` parameter for older pd.testing	2021-06-07 11:12:49 +09:00
sqlutils.py	[SPARK-32714][PYTHON] Initial pyspark-stubs port	2020-09-24 14:15:36 +09:00
streamingutils.py	[SPARK-33254][PYTHON][DOCS] Migration to NumPy documentation style in Core (pyspark., pyspark.resource., etc.)	2020-11-16 10:21:50 +09:00
utils.py	[SPARK-32194][PYTHON] Use proper exception classes instead of plain Exception	2021-05-26 11:54:40 +09:00