4d2b559d92
### What changes were proposed in this pull request? Consolidate PySpark testing utils by removing `python/pyspark/pandas/testing`, and then creating a file `pandasutils` under `python/pyspark/testing` for test utilities used in `pyspark/pandas`. ### Why are the changes needed? `python/pyspark/pandas/testing` hold test utilites for pandas-on-spark, and `python/pyspark/testing` contain test utilities for pyspark. Consolidating them makes code cleaner and easier to maintain. Updated import statements are as shown below: - from pyspark.testing.sqlutils import SQLTestUtils - from pyspark.testing.pandasutils import PandasOnSparkTestCase, TestUtils (PandasOnSparkTestCase is the original ReusedSQLTestCase in `python/pyspark/pandas/testing/utils.py`) Minor improvements include: - Usage of missing library's requirement_message - `except ImportError` rather than `except` - import pyspark.pandas alias as `ps` rather than `pp` ### Does this PR introduce _any_ user-facing change? No. ### How was this patch tested? Unit tests under python/pyspark/pandas/tests. Closes #32177 from xinrong-databricks/port.merge_utils. Authored-by: Xinrong Meng <xinrong.meng@databricks.com> Signed-off-by: Takuya UESHIN <ueshin@databricks.com> |
||
---|---|---|
.. | ||
indexes | ||
missing | ||
plot | ||
spark | ||
tests | ||
typedef | ||
usage_logging | ||
__init__.py | ||
accessors.py | ||
base.py | ||
categorical.py | ||
config.py | ||
datetimes.py | ||
exceptions.py | ||
extensions.py | ||
frame.py | ||
generic.py | ||
groupby.py | ||
indexing.py | ||
internal.py | ||
ml.py | ||
mlflow.py | ||
namespace.py | ||
numpy_compat.py | ||
series.py | ||
sql_processor.py | ||
strings.py | ||
utils.py | ||
version.py | ||
window.py |