spark-instrumented-optimizer/python/pyspark/sql
Xiangrui Meng df32669514 [SPARK-7157][SQL] add sampleBy to DataFrame
This was previously committed but then reverted due to test failures (see #6769).

Author: Xiangrui Meng <meng@databricks.com>

Closes #7755 from rxin/SPARK-7157 and squashes the following commits:

fbf9044 [Xiangrui Meng] fix python test
542bd37 [Xiangrui Meng] update test
604fe6d [Xiangrui Meng] Merge remote-tracking branch 'apache/master' into SPARK-7157
f051afd [Xiangrui Meng] use udf instead of building expression
f4e9425 [Xiangrui Meng] Merge remote-tracking branch 'apache/master' into SPARK-7157
8fb990b [Xiangrui Meng] Merge remote-tracking branch 'apache/master' into SPARK-7157
103beb3 [Xiangrui Meng] add Java-friendly sampleBy
991f26f [Xiangrui Meng] fix seed
4a14834 [Xiangrui Meng] move sampleBy to stat
832f7cc [Xiangrui Meng] add sampleBy to DataFrame
2015-07-30 17:16:03 -07:00
..
__init__.py [SPARK-8060] Improve DataFrame Python test coverage and documentation. 2015-06-03 00:23:34 -07:00
column.py [SPARK-8573] [SPARK-8568] [SQL] [PYSPARK] raise Exception if column is used in booelan expression 2015-06-23 15:51:16 -07:00
context.py [SPARK-9116] [SQL] [PYSPARK] support Python only UDT in __main__ 2015-07-29 22:30:49 -07:00
dataframe.py [SPARK-7157][SQL] add sampleBy to DataFrame 2015-07-30 17:16:03 -07:00
functions.py [SPARK-8186] [SPARK-8187] [SPARK-8194] [SPARK-8198] [SPARK-9133] [SPARK-9290] [SQL] functions: date_add, date_sub, add_months, months_between, time-interval calculation 2015-07-30 13:21:46 -07:00
group.py [SPARK-8770][SQL] Create BinaryOperator abstract class. 2015-07-01 21:14:13 -07:00
readwriter.py [SPARK-9100] [SQL] Adds DataFrame reader/writer shortcut methods for ORC 2015-07-21 15:08:44 +08:00
tests.py [SPARK-9116] [SQL] [PYSPARK] support Python only UDT in __main__ 2015-07-29 22:30:49 -07:00
types.py [SPARK-9408] [PYSPARK] [MLLIB] Refactor linalg.py to /linalg 2015-07-30 16:57:38 -07:00
utils.py [SPARK-9166][SQL][PYSPARK] Capture and hide IllegalArgumentException in Python API 2015-07-19 00:32:56 -07:00
window.py [SPARK-8146] DataFrame Python API: Alias replace in df.na 2015-06-07 01:21:02 -07:00