f90eb6a5db
### What changes were proposed in this pull request? With SPARK-34806 we can now easily add an equivalent for `Dataset.observe(Observation, Column, Column*)` to PySpark's `DataFrame` API. ### Why are the changes needed? This further aligns the Python DataFrame API with Scala Dataset API. ### Does this PR introduce _any_ user-facing change? Yes, it adds the `Observation` class and the `DataFrame.observe` method. ### How was this patch tested? Adds test `test_observe` to `pyspark.sql.test.test_dataframe`. Closes #33484 from EnricoMi/branch-observation-python. Authored-by: Enrico Minack <github@enrico.minack.dev> Signed-off-by: Wenchen Fan <wenchen@databricks.com> |
||
---|---|---|
.. | ||
__init__.py | ||
modules.py | ||
shellutils.py | ||
toposort.py |