spark-instrumented-optimizer/python/pyspark/sql
Xiangrui Meng 0401cbaa8e [SPARK-7157][SQL] add sampleBy to DataFrame
Add `sampleBy` to DataFrame. rxin

Author: Xiangrui Meng <meng@databricks.com>

Closes #6769 from mengxr/SPARK-7157 and squashes the following commits:

991f26f [Xiangrui Meng] fix seed
4a14834 [Xiangrui Meng] move sampleBy to stat
832f7cc [Xiangrui Meng] add sampleBy to DataFrame
2015-06-23 17:46:29 -07:00
..
__init__.py [SPARK-8060] Improve DataFrame Python test coverage and documentation. 2015-06-03 00:23:34 -07:00
column.py [SPARK-8573] [SPARK-8568] [SQL] [PYSPARK] raise Exception if column is used in booelan expression 2015-06-23 15:51:16 -07:00
context.py [SPARK-8104] [SQL] auto alias expressions in analyzer 2015-06-22 12:13:00 -07:00
dataframe.py [SPARK-7157][SQL] add sampleBy to DataFrame 2015-06-23 17:46:29 -07:00
functions.py [SPARK-8207] [SQL] Add math function bin 2015-06-19 10:09:31 -07:00
group.py [SPARK-7322, SPARK-7836, SPARK-7822][SQL] DataFrame window function related updates 2015-05-23 08:30:05 -07:00
readwriter.py [SPARK-8532] [SQL] In Python's DataFrameWriter, save/saveAsTable/json/parquet/jdbc always override mode 2015-06-22 13:51:23 -07:00
tests.py [SPARK-8573] [SPARK-8568] [SQL] [PYSPARK] raise Exception if column is used in booelan expression 2015-06-23 15:51:16 -07:00
types.py [SPARK-6411] [SQL] [PySpark] support date/datetime with timezone in Python 2015-06-11 01:00:41 -07:00
window.py [SPARK-8146] DataFrame Python API: Alias replace in df.na 2015-06-07 01:21:02 -07:00