spark-instrumented-optimizer/python/docs/source/reference/ps_general_functions.rst
Hyukjin Kwon 3d158f9c91 [SPARK-35587][PYTHON][DOCS] Initial porting of Koalas documentation
### What changes were proposed in this pull request?

This PR proposes to port Koalas documentation to PySpark documentation as its initial step.
It ports almost as is except these differences:

- Renamed import from `databricks.koalas` to `pyspark.pandas`.
- Renamed `to_koalas` -> `to_pandas_on_spark`
- Renamed `(Series|DataFrame).koalas` -> `(Series|DataFrame).pandas_on_spark`
- Added a `ps_` prefix in the RST file names of Koalas documentation

Other then that,

- Excluded `python/docs/build/html` in linter
- Fixed GA dependency installataion

### Why are the changes needed?

To document pandas APIs on Spark.

### Does this PR introduce _any_ user-facing change?

Yes, it adds new documentations.

### How was this patch tested?

Manually built the docs and checked the output.

Closes #32726 from HyukjinKwon/SPARK-35587.

Authored-by: Hyukjin Kwon <gurwls223@apache.org>
Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>
2021-06-04 11:11:09 +09:00

49 lines
688 B
ReStructuredText

.. _api.general_functions:
=================
General functions
=================
.. currentmodule:: pyspark.pandas
Working with options
--------------------
.. autosummary::
:toctree: api/
reset_option
get_option
set_option
option_context
Data manipulations and SQL
--------------------------
.. autosummary::
:toctree: api/
melt
merge
get_dummies
concat
sql
broadcast
Top-level missing data
----------------------
.. autosummary::
:toctree: api/
to_numeric
isna
isnull
notna
notnull
Top-level dealing with datetimelike
-----------------------------------
.. autosummary::
:toctree: api/
to_datetime
date_range