spark-instrumented-optimizer/CHANGES.txt
2015-06-22 22:18:52 -07:00

15315 lines
571 KiB
Plaintext
Raw Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

Spark Change Log
----------------
Release 1.4.1
[SPARK-8548] [SPARKR] Remove the trailing whitespaces from the SparkR files
Yu ISHIKAWA <yuu.ishikawa@gmail.com>
2015-06-22 20:55:38 -0700
Commit: 2501794, github.com/apache/spark/pull/6945
[SPARK-7859] [SQL] Collect_set() behavior differences which fails the unit test under jdk8
Cheng Hao <hao.cheng@intel.com>
2015-06-22 20:04:49 -0700
Commit: d73900a, github.com/apache/spark/pull/6402
[SPARK-8532] [SQL] In Python's DataFrameWriter, save/saveAsTable/json/parquet/jdbc always override mode
Yin Huai <yhuai@databricks.com>
2015-06-22 13:51:23 -0700
Commit: 994abba, github.com/apache/spark/pull/6937
[SPARK-8511] [PYSPARK] Modify a test to remove a saved model in `regression.py`
Yu ISHIKAWA <yuu.ishikawa@gmail.com>
2015-06-22 11:53:11 -0700
Commit: 507381d, github.com/apache/spark/pull/6926
[SPARK-8420] [SQL] Fix comparision of timestamps/dates with strings (branch-1.4)
Michael Armbrust <michaeldatabricks.com>, Michael Armbrust <michael@databricks.com>
2015-06-22 10:45:33 -0700
Commit: 6598161, github.com/apache/spark/pull/6888
[SPARK-8406] [SQL] Backports SPARK-8406 and PR #6864 to branch-1.4
Cheng Lian <lian@databricks.com>
2015-06-22 10:04:29 -0700
Commit: 451c872, github.com/apache/spark/pull/6932
[HOTFIX] Hotfix branch-1.4 building by removing avgMetrics in CrossValidatorSuite
Liang-Chi Hsieh <viirya@gmail.com>
2015-06-21 22:25:08 -0700
Commit: b836bac, github.com/apache/spark/pull/6929
[SPARK-7715] [MLLIB] [ML] [DOC] Updated MLlib programming guide for release 1.4
Joseph K. Bradley <joseph@databricks.com>
2015-06-21 16:25:25 -0700
Commit: 2a7ea31, github.com/apache/spark/pull/6897
[SPARK-8379] [SQL] avoid speculative tasks write to the same file
jeanlyn <jeanlyn92@gmail.com>
2015-06-21 00:13:40 -0700
Commit: f0e4040, github.com/apache/spark/pull/6833
[SPARK-8468] [ML] Take the negative of some metrics in RegressionEvaluator to get correct cross validation
Liang-Chi Hsieh <viirya@gmail.com>
2015-06-20 13:01:59 -0700
Commit: fe59a4a, github.com/apache/spark/pull/6905
[HOTFIX] [SPARK-8489] Correct JIRA number in previous commit
Andrew Or <andrew@databricks.com>
2015-06-19 17:39:26 -0700
Commit: 9b16508
[SPARK-8390] [STREAMING] [KAFKA] fix docs related to HasOffsetRanges
cody koeninger <cody@koeninger.org>
2015-06-19 17:16:56 -0700
Commit: a7b773a, github.com/apache/spark/pull/6863
[SPARK-8389] [STREAMING] [KAFKA] Example of getting offset ranges out o…
cody koeninger <cody@koeninger.org>
2015-06-19 14:51:19 +0200
Commit: 78d0cee, github.com/apache/spark/pull/6846
[SPARK-8498] [SQL] Add regression test for SPARK-8470
Andrew Or <andrew@databricks.com>
2015-06-19 17:34:09 -0700
Commit: 2248ad8, github.com/apache/spark/pull/6909
[HOT-FIX] Fix compilation (caused by 0131142d98b191f6cc112d383aa10582a3ac35bf)
Yin Huai <yhuai@databricks.com>
2015-06-19 17:29:51 -0700
Commit: 2510365, github.com/apache/spark/pull/6913
[SPARK-8093] [SQL] Remove empty structs inferred from JSON documents
Nathan Howell <nhowell@godaddy.com>
2015-06-19 16:19:28 -0700
Commit: 0131142, github.com/apache/spark/pull/6799
[SPARK-8452] [SPARKR] expose jobGroup API in SparkR
Hossein <hossein@databricks.com>
2015-06-19 15:47:22 -0700
Commit: 1a6b510, github.com/apache/spark/pull/6889
[SPARK-8368] [SPARK-8058] [SQL] HiveContext may override the context class loader of the current thread (branch 1.4)
Yin Huai <yhuai@databricks.com>
2015-06-19 11:15:28 -0700
Commit: 9ac8393, github.com/apache/spark/pull/6895
[SPARK-7180] [SPARK-8090] [SPARK-8091] Fix a number of SerializationDebugger bugs and limitations
Tathagata Das <tathagata.das1565@gmail.com>
2015-06-19 10:52:30 -0700
Commit: 4b2c793, github.com/apache/spark/pull/6625
[SPARK-5836] [DOCS] [STREAMING] Clarify what may cause long-running Spark apps to preserve shuffle files
Sean Owen <sowen@cloudera.com>
2015-06-19 11:03:04 -0700
Commit: 3415fb9, github.com/apache/spark/pull/6901
[SPARK-8451] [SPARK-7287] SparkSubmitSuite should check exit code
Andrew Or <andrew@databricks.com>
2015-06-19 10:56:19 -0700
Commit: aedd893, github.com/apache/spark/pull/6886
[SPARK-8430] ExternalShuffleBlockResolver of shuffle service should support UnsafeShuffleManager
Lianhui Wang <lianhuiwang09@gmail.com>
2015-06-19 10:47:07 -0700
Commit: 6f2e411, github.com/apache/spark/pull/6873
[SPARK-8151] [MLLIB] pipeline components should correctly implement copy
Xiangrui Meng <meng@databricks.com>
2015-06-19 09:46:51 -0700
Commit: 1f2dafb, github.com/apache/spark/pull/6622
[SPARK-8339] [PYSPARK] integer division for python 3
Kevin Conor <kevin@discoverybayconsulting.com>
2015-06-19 00:12:20 -0700
Commit: 164b9d3, github.com/apache/spark/pull/6794
[SPARK-8458] [SQL] Don't strip scheme part of output path when writing ORC files
Cheng Lian <lian@databricks.com>
2015-06-18 22:01:52 -0700
Commit: f48f3a2, github.com/apache/spark/pull/6892
[SPARK-8080] [STREAMING] Receiver.store with Iterator does not give correct count at Spark UI
Dibyendu Bhattacharya <dibyendu.bhattacharya1@pearson.com>, U-PEROOT\UBHATD1 <UBHATD1@PIN-L-PI046.PEROOT.com>
2015-06-18 19:58:47 -0700
Commit: b55e4b9, github.com/apache/spark/pull/6707
[SPARK-8462] [DOCS] Documentation fixes for Spark SQL
Lars Francke <lars.francke@gmail.com>
2015-06-18 19:40:32 -0700
Commit: bd9bbd6, github.com/apache/spark/pull/6890
[SPARK-8446] [SQL] Add helper functions for testing SparkPlan physical operators
Josh Rosen <joshrosen@databricks.com>, Josh Rosen <rosenville@gmail.com>, Michael Armbrust <michael@databricks.com>
2015-06-18 16:45:14 -0700
Commit: 152f446, github.com/apache/spark/pull/6885
[SPARK-8376] [DOCS] Add common lang3 to the Spark Flume Sink doc
zsxwing <zsxwing@gmail.com>
2015-06-18 16:00:27 -0700
Commit: 9f293a9, github.com/apache/spark/pull/6829
[SPARK-8353] [DOCS] Show anchor links when hovering over documentation headers
Josh Rosen <joshrosen@databricks.com>
2015-06-18 15:10:09 -0700
Commit: c1da5cf, github.com/apache/spark/pull/6808
[SPARK-8202] [PYSPARK] fix infinite loop during external sort in PySpark
Davies Liu <davies@databricks.com>
2015-06-18 13:45:58 -0700
Commit: ca23c3b, github.com/apache/spark/pull/6714
[SPARK-8095] Resolve dependencies of --packages in local ivy cache
Burak Yavuz <brkyvz@gmail.com>
2015-06-17 22:33:37 -0700
Commit: 9dabc12, github.com/apache/spark/pull/6788
[SPARK-8392] RDDOperationGraph: getting cached nodes is slow
xutingjun <xutingjun@huawei.com>
2015-06-17 22:31:01 -0700
Commit: 67ad12d, github.com/apache/spark/pull/6839
[SPARK-8306] [SQL] AddJar command needs to set the new class loader to the HiveConf inside executionHive.state.
Yin Huai <yhuai@databricks.com>
2015-06-17 14:52:43 -0700
Commit: 73cf5de, github.com/apache/spark/pull/6758
[SPARK-8404] [STREAMING] [TESTS] Use thread-safe collections to make the tests more reliable
zsxwing <zsxwing@gmail.com>
2015-06-17 15:00:03 -0700
Commit: 5aedfa2, github.com/apache/spark/pull/6852
[SPARK-8373] [PYSPARK] Add emptyRDD to pyspark and fix the issue when calling sum on an empty RDD
zsxwing <zsxwing@gmail.com>
2015-06-17 13:59:39 -0700
Commit: 5e7973d, github.com/apache/spark/pull/6826
[SPARK-8372] History server shows incorrect information for application not started
Carson Wang <carson.wang@intel.com>
2015-06-17 13:41:36 -0700
Commit: f051373, github.com/apache/spark/pull/6827
[SPARK-8161] Set externalBlockStoreInitialized to be true, after ExternalBlockStore is initialized
Mingfei <mingfei.shi@intel.com>
2015-06-17 13:40:07 -0700
Commit: d75c53d, github.com/apache/spark/pull/6702
[SPARK-7515] [DOC] Update documentation for PySpark on YARN with cluster mode
Kousuke Saruta <sarutakoss.nttdata.co.jp>, Punya Biswal <pbiswal@palantir.com>, Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2015-06-17 13:37:20 -0700
Commit: a7f6979, github.com/apache/spark/pull/6040
[SPARK-8395] [DOCS] start-slave.sh docs incorrect
Sean Owen <sowen@cloudera.com>
2015-06-17 13:31:10 -0700
Commit: 320c442, github.com/apache/spark/pull/6855
[SPARK-8309] [CORE] Support for more than 12M items in OpenHashMap
Vyacheslav Baranov <slavik.baranov@gmail.com>
2015-06-17 09:42:29 +0100
Commit: a5f602e, github.com/apache/spark/pull/6763
Fix break introduced by backport
Punya Biswal <pbiswal@palantir.com>
2015-06-16 22:31:49 -0700
Commit: 877deb0, github.com/apache/spark/pull/6850
[SPARK-7916] [MLLIB] MLlib Python doc parity check for classification and regression
Yanbo Liang <ybliang8@gmail.com>
2015-06-16 14:30:30 -0700
Commit: 15d973f, github.com/apache/spark/pull/6460
[SPARK-8126] [BUILD] Make sure temp dir exists when running tests.
Marcelo Vanzin <vanzin@cloudera.com>
2015-06-16 21:10:18 +0100
Commit: b9e5d3c, github.com/apache/spark/pull/6805
[SQL] [DOC] improved a comment
Radek Ostrowski <dest.hawaii@gmail.com>, radek <radek@radeks-MacBook-Pro-2.local>
2015-06-16 21:04:26 +0100
Commit: 4da0686, github.com/apache/spark/pull/6332
[SPARK-DOCS] [SPARK-SQL] Update sql-programming-guide.md
Moussa Taifi <moutai10@gmail.com>
2015-06-16 20:59:22 +0100
Commit: 1378bdc, github.com/apache/spark/pull/6847
[SPARK-8367] [STREAMING] Add a limit for 'spark.streaming.blockInterval` since a data loss bug.
huangzhaowei <carlmartinmax@gmail.com>, huangzhaowei <SaintBacchus@users.noreply.github.com>
2015-06-16 08:16:09 +0200
Commit: f287f7e, github.com/apache/spark/pull/6818
SPARK-8336 Fix NullPointerException with functions.rand()
tedyu <yuzhihong@gmail.com>
2015-06-15 17:00:38 -0700
Commit: fff8d7e, github.com/apache/spark/pull/6793
fix read/write mixup
Peter Hoffmann <ph@peter-hoffmann.com>
2015-06-14 11:41:16 -0700
Commit: 0ffbf08, github.com/apache/spark/pull/6815
[SPARK-8358] [SQL] Wait for child resolution when resolving generators
Michael Armbrust <michael@databricks.com>
2015-06-14 11:21:42 -0700
Commit: 2805d14, github.com/apache/spark/pull/6811
[SPARK-8354] [SQL] Fix off-by-factor-of-8 error when allocating scratch space in UnsafeFixedWidthAggregationMap
Josh Rosen <joshrosen@databricks.com>
2015-06-14 09:34:35 -0700
Commit: 4634be5, github.com/apache/spark/pull/6809
[Spark-8343] [Streaming] [Docs] Improve Spark Streaming Guides.
Mike Dusenberry <dusenberrymw@gmail.com>
2015-06-13 21:22:46 -0700
Commit: 187a3d5, github.com/apache/spark/pull/6801
[SPARK-8329][SQL] Allow _ in DataSource options
Michael Armbrust <michael@databricks.com>
2015-06-12 23:11:16 -0700
Commit: 1ca431e, github.com/apache/spark/pull/6786
[SPARK-7284] [STREAMING] Updated streaming documentation
Tathagata Das <tathagata.das1565@gmail.com>
2015-06-12 15:22:59 -0700
Commit: 7c11ccf, github.com/apache/spark/pull/6781
[SPARK-8330] DAG visualization: trim whitespace from input
Andrew Or <andrew@databricks.com>
2015-06-12 11:14:55 -0700
Commit: 7608373, github.com/apache/spark/pull/6787
[SPARK-8322] [EC2] Added spark 1.4.0 into the VALID_SPARK_VERSIONS and…
Mark Smith <mark.smith@bronto.com>
2015-06-12 10:28:30 -0700
Commit: 141eab7, github.com/apache/spark/pull/6777
[SPARK-6511] [docs] Fix example command in hadoop-provided docs.
Marcelo Vanzin <vanzin@cloudera.com>
2015-06-11 15:29:03 -0700
Commit: 8b25f62, github.com/apache/spark/pull/6766
[SPARK-8310] [EC2] Update spark-ec2 branch to 1.4
Shivaram Venkataraman <shivaram@cs.berkeley.edu>
2015-06-11 13:22:08 -0700
Commit: 3a62569, github.com/apache/spark/pull/6765
[SPARK-8289] Specify stack size for consistency with Java tests - resolves test failures
Adam Roberts <aroberts@uk.ibm.com>, a-roberts <aroberts@uk.ibm.com>
2015-06-11 08:40:46 +0100
Commit: b313920, github.com/apache/spark/pull/6727
[SPARK-8285] [SQL] CombineSum should be calculated as unlimited decimal first
navis.ryu <navis@apache.org>
2015-06-10 18:19:12 -0700
Commit: 5c05b5c, github.com/apache/spark/pull/6736
[SPARK-8200] [MLLIB] Check for empty RDDs in StreamingLinearAlgorithm
Paavo <pparkkin@gmail.com>
2015-06-10 23:17:42 +0100
Commit: 59fc3f1, github.com/apache/spark/pull/6713
[SPARK-8273] Driver hangs up when yarn shutdown in client mode
WangTaoTheTonic <wangtao111@huawei.com>
2015-06-10 13:34:19 -0700
Commit: 2846a35, github.com/apache/spark/pull/6717
[SPARK-7756] CORE RDDOperationScope fix for IBM Java
Adam Roberts <aroberts@uk.ibm.com>, a-roberts <aroberts@uk.ibm.com>
2015-06-10 13:21:01 -0700
Commit: 568d1d5, github.com/apache/spark/pull/6740
[SPARK-8282] [SPARKR] Make number of threads used in RBackend configurable
Hossein <hossein@databricks.com>
2015-06-10 13:18:48 -0700
Commit: 28e8a6e, github.com/apache/spark/pull/6730
[SQL] [MINOR] Fixes a minor Java example error in SQL programming guide
Cheng Lian <lian@databricks.com>
2015-06-10 11:48:14 -0700
Commit: 7b88e6a, github.com/apache/spark/pull/6749
[SPARK-6511] [DOCUMENTATION] Explain how to use Hadoop provided builds
Patrick Wendell <patrick@databricks.com>
2015-06-09 16:14:21 -0700
Commit: a0a7f2f, github.com/apache/spark/pull/6729
[MINOR] [UI] DAG visualization: trim whitespace from input
Andrew Or <andrew@databricks.com>
2015-06-09 15:44:02 -0700
Commit: 1175cfe, github.com/apache/spark/pull/6732
[SPARK-8274] [DOCUMENTATION-MLLIB] Fix wrong URLs in MLlib Frequent Pattern Mining Documentation
FavioVazquez <favio.vazquezp@gmail.com>
2015-06-09 15:02:18 +0100
Commit: a7b7a19, github.com/apache/spark/pull/6722
[SPARK-6820] [SPARKR] Convert NAs to null type in SparkR DataFrames
hqzizania <qian.huang@intel.com>
2015-06-08 21:40:12 -0700
Commit: 0a9383d, github.com/apache/spark/pull/6190
[SPARK-8162] [HOTFIX] Fix NPE in spark-shell
Andrew Or <andrew@databricks.com>
2015-06-08 18:09:21 -0700
Commit: e9a8372, github.com/apache/spark/pull/6711
[SPARK-8126] [BUILD] Use custom temp directory during build.
Marcelo Vanzin <vanzin@cloudera.com>
2015-06-08 15:37:28 +0100
Commit: 99c2a57, github.com/apache/spark/pull/6674
[SPARK-8121] [SQL] Fixes InsertIntoHadoopFsRelation job initialization for Hadoop 1.x (branch 1.4 backport based on https://github.com/apache/spark/pull/6669)
Yin Huai <yhuai@databricks.com>
2015-06-08 11:35:30 -0700
Commit: 69197c3
[SPARK-7705] [YARN] Cleanup of .sparkStaging directory fails if application is killed
linweizhong <linweizhong@huawei.com>
2015-06-08 09:34:16 +0100
Commit: a3afc2c, github.com/apache/spark/pull/6409
[SPARK-4761] [DOC] [SQL] kryo default setting in SQL Thrift server
Daoyuan Wang <daoyuan.wang@intel.com>
2015-06-08 01:07:50 -0700
Commit: 58bfdd6, github.com/apache/spark/pull/6639
[SPARK-8004][SQL] Quote identifier in JDBC data source.
Reynold Xin <rxin@databricks.com>
2015-06-07 10:52:02 -0700
Commit: b9c046f, github.com/apache/spark/pull/6689
[SPARK-8146] DataFrame Python API: Alias replace in df.na
Reynold Xin <rxin@databricks.com>
2015-06-07 01:21:02 -0700
Commit: ff26767, github.com/apache/spark/pull/6688
[SPARK-8141] [SQL] Precompute datatypes for partition columns and reuse it
Liang-Chi Hsieh <viirya@gmail.com>
2015-06-07 15:33:48 +0800
Commit: b4d5441, github.com/apache/spark/pull/6687
[SPARK-8145] [WEBUI] Trigger a double click on the span to show full job description.
979969786 <q79969786@gmail.com>
2015-06-06 23:15:27 -0700
Commit: 9d1f4d6, github.com/apache/spark/pull/6646
[SPARK-8004][SQL] Enclose column names by JDBC Dialect
Liang-Chi Hsieh <viirya@gmail.com>
2015-06-06 22:59:31 -0700
Commit: b6fdc6c, github.com/apache/spark/pull/6577
[SPARK-7955] [CORE] Ensure executors with cached RDD blocks are not re…
Hari Shreedharan <hshreedharan@apache.org>
2015-06-06 21:13:26 -0700
Commit: 6faaf15, github.com/apache/spark/pull/6508
[SPARK-8079] [SQL] Makes InsertIntoHadoopFsRelation job/task abortion more robust
Cheng Lian <lian@databricks.com>
2015-06-06 17:23:12 +0800
Commit: d8a53fb, github.com/apache/spark/pull/6612
[SPARK-7991] [PySpark] Adding support for passing lists to describe.
amey <amey@skytree.net>
2015-06-05 13:49:33 -0700
Commit: 84523fc, github.com/apache/spark/pull/6655
[SPARK-7747] [SQL] [DOCS] spark.sql.planner.externalSort
Luca Martinetti <luca@luca.io>
2015-06-05 13:40:11 -0700
Commit: 94f65bc, github.com/apache/spark/pull/6272
[SPARK-8112] [STREAMING] Fix the negative event count issue
zsxwing <zsxwing@gmail.com>
2015-06-05 12:46:02 -0700
Commit: 200c980, github.com/apache/spark/pull/6659
Revert "[MINOR] [BUILD] Use custom temp directory during build."
Andrew Or <andrew@databricks.com>
2015-06-05 10:54:06 -0700
Commit: 429c658
[SPARK-8085] [SPARKR] Support user-specified schema in read.df
Shivaram Venkataraman <shivaram@cs.berkeley.edu>
2015-06-05 10:19:03 -0700
Commit: 3e3151e, github.com/apache/spark/pull/6620
[STREAMING] Update streaming-kafka-integration.md
Akhil Das <akhld@darktech.ca>
2015-06-05 14:23:23 +0200
Commit: 0ef2e9d, github.com/apache/spark/pull/6666
[MINOR] [BUILD] Use custom temp directory during build.
Marcelo Vanzin <vanzin@cloudera.com>
2015-06-05 14:11:38 +0200
Commit: 9b3e4c1, github.com/apache/spark/pull/6653
[MINOR] remove unused interpolation var in log message
Sean Owen <sowen@cloudera.com>
2015-06-05 00:32:46 -0700
Commit: 90cf686, github.com/apache/spark/pull/6650
[SPARK-8116][PYSPARK] Allow sc.range() to take a single argument.
Ted Blackman <ted.blackman@gmail.com>
2015-06-04 22:21:11 -0700
Commit: f02af7c, github.com/apache/spark/pull/6656
[SPARK-8098] [WEBUI] Show correct length of bytes on log page
Carson Wang <carson.wang@intel.com>
2015-06-04 16:24:50 -0700
Commit: 3ba6fc5, github.com/apache/spark/pull/6640
[SPARK-8027] [SPARKR] Move man pages creation to install-dev.sh
Shivaram Venkataraman <shivaram@cs.berkeley.edu>
2015-06-04 12:52:16 -0700
Commit: 0b71b85, github.com/apache/spark/pull/6593
[SPARK-7969] [SQL] Added a DataFrame.drop function that accepts a Column reference.
Mike Dusenberry <dusenberrymw@gmail.com>
2015-06-04 11:30:07 -0700
Commit: 81ff7a9, github.com/apache/spark/pull/6585
Fix maxTaskFailures comment
Daniel Darabos <darabos.daniel@gmail.com>
2015-06-04 13:46:49 +0200
Commit: daf9451, github.com/apache/spark/pull/6621
[BUILD] Fix Maven build for Kinesis
Andrew Or <andrew@databricks.com>
2015-06-03 20:45:31 -0700
Commit: 84da653
[SPARK-7558] Demarcate tests in unit-tests.log (1.4)
Andrew Or <andrew@databricks.com>
2015-06-03 20:46:44 -0700
Commit: bfe74b3, github.com/apache/spark/pull/6598
[BUILD] Use right branch when checking against Hive (1.4)
Andrew Or <andrew@databricks.com>
2015-06-03 18:09:14 -0700
Commit: 584a2ba, github.com/apache/spark/pull/6630
[BUILD] Increase Jenkins test timeout
Andrew Or <andrew@databricks.com>
2015-06-03 17:40:14 -0700
Commit: 96f71b1
[SPARK-8084] [SPARKR] Make SparkR scripts fail on error
Shivaram Venkataraman <shivaram@cs.berkeley.edu>
2015-06-03 17:02:16 -0700
Commit: c2c1290, github.com/apache/spark/pull/6623
[SPARK-8088] don't attempt to lower number of executors by 0
Ryan Williams <ryan.blake.williams@gmail.com>
2015-06-03 16:54:46 -0700
Commit: 1674869, github.com/apache/spark/pull/6624
[HOTFIX] [TYPO] Fix typo in #6546
Andrew Or <andrew@databricks.com>
2015-06-03 16:04:02 -0700
Commit: 0bc9a3e
[HOTFIX] Unbreak build from backporting #6546
Andrew Or <andrew@databricks.com>
2015-06-03 15:25:35 -0700
Commit: d0be950
[SPARK-8051] [MLLIB] make StringIndexerModel silent if input column does not exist
Xiangrui Meng <meng@databricks.com>
2015-06-03 15:16:24 -0700
Commit: b2a22a6, github.com/apache/spark/pull/6595
[SPARK-3674] [EC2] Clear SPARK_WORKER_INSTANCES when using YARN
Shivaram Venkataraman <shivaram@cs.berkeley.edu>
2015-06-03 15:14:38 -0700
Commit: ca21fff, github.com/apache/spark/pull/6424
[SPARK-7989] [CORE] [TESTS] Fix flaky tests in ExternalShuffleServiceSuite and SparkListenerWithClusterSuite
zsxwing <zsxwing@gmail.com>
2015-06-03 15:04:20 -0700
Commit: 7e46ea0, github.com/apache/spark/pull/6546
[SPARK-8001] [CORE] Make AsynchronousListenerBus.waitUntilEmpty throw TimeoutException if timeout
zsxwing <zsxwing@gmail.com>
2015-06-03 15:03:07 -0700
Commit: 306837e, github.com/apache/spark/pull/6550
[SPARK-8083] [MESOS] Use the correct base path in mesos driver page.
Timothy Chen <tnachen@gmail.com>
2015-06-03 14:57:23 -0700
Commit: 59399a8, github.com/apache/spark/pull/6615
[MINOR] [UI] Improve confusing message on log page
Andrew Or <andrew@databricks.com>
2015-06-03 12:10:12 -0700
Commit: 31e0ae9
[SPARK-8054] [MLLIB] Added several Java-friendly APIs + unit tests
Joseph K. Bradley <joseph@databricks.com>
2015-06-03 14:34:20 -0700
Commit: bfab61f, github.com/apache/spark/pull/6562
[SPARK-8074] Parquet should throw AnalysisException during setup for data type/name related failures.
Reynold Xin <rxin@databricks.com>
2015-06-03 13:57:57 -0700
Commit: 1f90a06, github.com/apache/spark/pull/6608
[SPARK-8063] [SPARKR] Spark master URL conflict between MASTER env variable and --master command line option.
Sun Rui <rui.sun@intel.com>
2015-06-03 11:56:35 -0700
Commit: f67a27d, github.com/apache/spark/pull/6605
[SPARK-7980] [SQL] Support SQLContext.range(end)
animesh <animesh@apache.spark>
2015-06-03 11:28:18 -0700
Commit: 0a1dad6, github.com/apache/spark/pull/6609
[SPARK-7973] [SQL] Increase the timeout of two CliSuite tests.
Yin Huai <yhuai@databricks.com>
2015-06-03 09:26:21 -0700
Commit: 54a4ea4, github.com/apache/spark/pull/6525
[SPARK-8060] Improve DataFrame Python test coverage and documentation.
Reynold Xin <rxin@databricks.com>
2015-06-03 00:23:34 -0700
Commit: ee7f365, github.com/apache/spark/pull/6601
[SPARK-8032] [PYSPARK] Make version checking for NumPy in MLlib more robust
MechCoder <manojkumarsivaraj334@gmail.com>
2015-06-02 23:24:47 -0700
Commit: bd57af3, github.com/apache/spark/pull/6579
[SPARK-8043] [MLLIB] [DOC] update NaiveBayes and SVM examples in doc
Yuhao Yang <hhbyyh@gmail.com>
2015-06-02 23:15:38 -0700
Commit: 33edb2b, github.com/apache/spark/pull/6584
[SPARK-8053] [MLLIB] renamed scalingVector to scalingVec
Joseph K. Bradley <joseph@databricks.com>
2015-06-02 22:56:56 -0700
Commit: 88399c3, github.com/apache/spark/pull/6596
[SPARK-7547] [ML] Scala Example code for ElasticNet
DB Tsai <dbt@netflix.com>
2015-06-02 19:12:08 -0700
Commit: 6391be8, github.com/apache/spark/pull/6576
[SPARK-7387] [ML] [DOC] CrossValidator example code in Python
Ram Sriharsha <rsriharsha@hw11853.local>
2015-06-02 18:53:04 -0700
Commit: 6a3e32a, github.com/apache/spark/pull/6358
Preparing development version 1.4.0-SNAPSHOT
Patrick Wendell <pwendell@gmail.com>
2015-06-02 18:06:41 -0700
Commit: ab713af
Release 1.4.0
[HOTFIX] Revert "[SPARK-7092] Update spark scala version to 2.11.6"
Patrick Wendell <patrick@databricks.com>
2015-05-19 02:28:41 -0700
Commit: 31f5d53
Revert "Preparing Spark release v1.4.0-rc1"
Patrick Wendell <patrick@databricks.com>
2015-05-19 02:27:14 -0700
Commit: 586ede6
Revert "Preparing development version 1.4.1-SNAPSHOT"
Patrick Wendell <patrick@databricks.com>
2015-05-19 02:27:07 -0700
Commit: e7309ec
Fixing a few basic typos in the Programming Guide.
Mike Dusenberry <dusenberrymw@gmail.com>
2015-05-19 08:59:45 +0100
Commit: 0748263, github.com/apache/spark/pull/6240
Preparing development version 1.4.1-SNAPSHOT
Patrick Wendell <patrick@databricks.com>
2015-05-19 07:13:24 +0000
Commit: a1d896b
Preparing Spark release v1.4.0-rc1
Patrick Wendell <patrick@databricks.com>
2015-05-19 07:13:24 +0000
Commit: 79fb01a
Updating CHANGES.txt for Spark 1.4
Patrick Wendell <patrick@databricks.com>
2015-05-19 00:12:20 -0700
Commit: 30bf333
Revert "Preparing Spark release v1.4.0-rc1"
Patrick Wendell <patrick@databricks.com>
2015-05-19 00:10:39 -0700
Commit: b0c63d2
Revert "Preparing development version 1.4.1-SNAPSHOT"
Patrick Wendell <patrick@databricks.com>
2015-05-19 00:10:37 -0700
Commit: 198a186
[SPARK-7581] [ML] [DOC] User guide for spark.ml PolynomialExpansion
Xusen Yin <yinxusen@gmail.com>
2015-05-19 00:06:33 -0700
Commit: 38a3fc8, github.com/apache/spark/pull/6113
[HOTFIX] Fixing style failures in Kinesis source
Patrick Wendell <patrick@databricks.com>
2015-05-19 00:02:06 -0700
Commit: de60c2e
Preparing development version 1.4.1-SNAPSHOT
Patrick Wendell <patrick@databricks.com>
2015-05-19 06:06:41 +0000
Commit: 40190ce
Preparing Spark release v1.4.0-rc1
Patrick Wendell <patrick@databricks.com>
2015-05-19 06:06:40 +0000
Commit: 38ccef3
Revert "Preparing Spark release v1.4.0-rc1"
Patrick Wendell <patrick@databricks.com>
2015-05-18 23:06:15 -0700
Commit: 152b029
Revert "Preparing development version 1.4.1-SNAPSHOT"
Patrick Wendell <patrick@databricks.com>
2015-05-18 23:06:13 -0700
Commit: 4d098bc
[HOTFIX]: Java 6 Build Breaks
Patrick Wendell <patrick@databricks.com>
2015-05-19 06:00:13 +0000
Commit: be1fc93
Preparing development version 1.4.1-SNAPSHOT
Patrick Wendell <patrick@databricks.com>
2015-05-19 05:01:11 +0000
Commit: 758ca74
Preparing Spark release v1.4.0-rc1
Patrick Wendell <patrick@databricks.com>
2015-05-19 05:01:11 +0000
Commit: e8e97e3
[SPARK-7687] [SQL] DataFrame.describe() should cast all aggregates to String
Josh Rosen <joshrosen@databricks.com>
2015-05-18 21:53:44 -0700
Commit: 99436bd, github.com/apache/spark/pull/6218
CHANGES.txt and changelist updaets for Spark 1.4.
Patrick Wendell <patrick@databricks.com>
2015-05-18 21:44:13 -0700
Commit: 914ecd0
[SPARK-7150] SparkContext.range() and SQLContext.range()
Daoyuan Wang <daoyuan.wang@intel.com>, Davies Liu <davies@databricks.com>
2015-05-18 21:43:12 -0700
Commit: 7fcbb2c, github.com/apache/spark/pull/6081
Version updates for Spark 1.4.0
Patrick Wendell <patrick@databricks.com>
2015-05-18 21:38:37 -0700
Commit: 9d0b7fb
[SPARK-7681] [MLLIB] Add SparseVector support for gemv
Liang-Chi Hsieh <viirya@gmail.com>
2015-05-18 21:32:36 -0700
Commit: dd9f873, github.com/apache/spark/pull/6209
[SPARK-7692] Updated Kinesis examples
Tathagata Das <tathagata.das1565@gmail.com>
2015-05-18 18:24:15 -0700
Commit: 9c48548, github.com/apache/spark/pull/6249
[SPARK-7621] [STREAMING] Report Kafka errors to StreamingListeners
jerluc <jeremyalucas@gmail.com>
2015-05-18 18:13:29 -0700
Commit: 9188ad8, github.com/apache/spark/pull/6204
[SPARK-7624] Revert #4147
Davies Liu <davies@databricks.com>
2015-05-18 16:55:45 -0700
Commit: 60cb33d, github.com/apache/spark/pull/6172
[SQL] Fix serializability of ORC table scan
Michael Armbrust <michael@databricks.com>
2015-05-18 15:24:31 -0700
Commit: f8f23c4, github.com/apache/spark/pull/6247
[SPARK-7501] [STREAMING] DAG visualization: show DStream operations
Andrew Or <andrew@databricks.com>
2015-05-18 14:33:33 -0700
Commit: a475cbc, github.com/apache/spark/pull/6034
[HOTFIX] Fix ORC build break
Michael Armbrust <michael@databricks.com>
2015-05-18 14:04:04 -0700
Commit: ba502ab, github.com/apache/spark/pull/6244
[SPARK-7658] [STREAMING] [WEBUI] Update the mouse behaviors for the timeline graphs
zsxwing <zsxwing@gmail.com>
2015-05-18 13:34:43 -0700
Commit: 39add3d, github.com/apache/spark/pull/6168
[SPARK-6216] [PYSPARK] check python version of worker with driver
Davies Liu <davies@databricks.com>
2015-05-18 12:55:13 -0700
Commit: a833209, github.com/apache/spark/pull/6203
[SPARK-7673] [SQL] WIP: HadoopFsRelation and ParquetRelation2 performance optimizations
Cheng Lian <lian@databricks.com>
2015-05-18 12:45:37 -0700
Commit: 3962348, github.com/apache/spark/pull/6225
[SPARK-7567] [SQL] [follow-up] Use a new flag to set output committer based on mapreduce apis
Yin Huai <yhuai@databricks.com>
2015-05-18 12:17:10 -0700
Commit: a385f4b, github.com/apache/spark/pull/6130
[SPARK-7269] [SQL] Incorrect analysis for aggregation(use semanticEquals)
Wenchen Fan <cloud0fan@outlook.com>
2015-05-18 12:08:28 -0700
Commit: d6f5f37, github.com/apache/spark/pull/6173
[SPARK-7631] [SQL] treenode argString should not print children
scwf <wangfei1@huawei.com>
2015-05-18 12:05:14 -0700
Commit: dbd4ec8, github.com/apache/spark/pull/6144
[SPARK-2883] [SQL] ORC data source for Spark SQL
Zhan Zhang <zhazhan@gmail.com>, Cheng Lian <lian@databricks.com>
2015-05-18 12:03:27 -0700
Commit: 65d71bd, github.com/apache/spark/pull/6194
[SPARK-7380] [MLLIB] pipeline stages should be copyable in Python
Xiangrui Meng <meng@databricks.com>, Joseph K. Bradley <joseph@databricks.com>
2015-05-18 12:02:18 -0700
Commit: cf4e04a, github.com/apache/spark/pull/6088
[SQL] [MINOR] [THIS] use private for internal field in ScalaUdf
Wenchen Fan <cloud0fan@outlook.com>
2015-05-18 12:01:30 -0700
Commit: 7d44c01, github.com/apache/spark/pull/6235
[SPARK-7570] [SQL] Ignores _temporary during partition discovery
Cheng Lian <lian@databricks.com>
2015-05-18 11:59:44 -0700
Commit: c7623a2, github.com/apache/spark/pull/6091
[SPARK-6888] [SQL] Make the jdbc driver handling user-definable
Rene Treffer <treffer@measite.de>
2015-05-18 11:55:36 -0700
Commit: b41301a, github.com/apache/spark/pull/5555
[SPARK-7627] [SPARK-7472] DAG visualization: style skipped stages
Andrew Or <andrew@databricks.com>
2015-05-18 10:59:35 -0700
Commit: a0ae8ce, github.com/apache/spark/pull/6171
[SPARK-7272] [MLLIB] User guide for PMML model export
Vincenzo Selvaggio <vselvaggio@hotmail.it>
2015-05-18 08:46:33 -0700
Commit: a95d4e1, github.com/apache/spark/pull/6219
[SPARK-6657] [PYSPARK] Fix doc warnings
Xiangrui Meng <meng@databricks.com>
2015-05-18 08:35:14 -0700
Commit: 2c94ffe, github.com/apache/spark/pull/6221
[SPARK-7299][SQL] Set precision and scale for Decimal according to JDBC metadata instead of returned BigDecimal
Liang-Chi Hsieh <viirya@gmail.com>
2015-05-18 01:10:55 -0700
Commit: 0e7cd8f, github.com/apache/spark/pull/5833
[SPARK-7694] [MLLIB] Use getOrElse for getting the threshold of LR model
Shuo Xiang <shuoxiangpub@gmail.com>
2015-05-17 21:16:52 -0700
Commit: 0b6bc8a, github.com/apache/spark/pull/6224
[SPARK-7693][Core] Remove "import scala.concurrent.ExecutionContext.Implicits.global"
zsxwing <zsxwing@gmail.com>
2015-05-17 20:37:19 -0700
Commit: 2a42d2d, github.com/apache/spark/pull/6223
[SQL] [MINOR] use catalyst type converter in ScalaUdf
Wenchen Fan <cloud0fan@outlook.com>
2015-05-17 16:51:57 -0700
Commit: be66d19, github.com/apache/spark/pull/6182
[SPARK-6514] [SPARK-5960] [SPARK-6656] [SPARK-7679] [STREAMING] [KINESIS] Updates to the Kinesis API
Tathagata Das <tathagata.das1565@gmail.com>
2015-05-17 16:49:07 -0700
Commit: e0632ff, github.com/apache/spark/pull/6147
[SPARK-7491] [SQL] Allow configuration of classloader isolation for hive
Michael Armbrust <michael@databricks.com>
2015-05-17 12:43:15 -0700
Commit: a855608, github.com/apache/spark/pull/6167
[SPARK-7686] [SQL] DescribeCommand is assigned wrong output attributes in SparkStrategies
Josh Rosen <joshrosen@databricks.com>
2015-05-17 11:59:28 -0700
Commit: 53d6ab5, github.com/apache/spark/pull/6217
[SPARK-7660] Wrap SnappyOutputStream to work around snappy-java bug
Josh Rosen <joshrosen@databricks.com>
2015-05-17 09:30:49 -0700
Commit: 6df71eb, github.com/apache/spark/pull/6176
[SPARK-7669] Builds against Hadoop 2.6+ get inconsistent curator depend…
Steve Loughran <stevel@hortonworks.com>
2015-05-17 17:03:11 +0100
Commit: 0feb3de, github.com/apache/spark/pull/6191
[SPARK-7447] [SQL] Don't re-merge Parquet schema when the relation is deserialized
Liang-Chi Hsieh <viirya@gmail.com>
2015-05-17 15:42:21 +0800
Commit: 898be62, github.com/apache/spark/pull/6012
[MINOR] Add 1.3, 1.3.1 to master branch EC2 scripts
Shivaram Venkataraman <shivaram@cs.berkeley.edu>
2015-05-17 00:12:20 -0700
Commit: 0ed376a, github.com/apache/spark/pull/6215
[MINOR] [SQL] Removes an unreachable case clause
Cheng Lian <lian@databricks.com>
2015-05-16 23:20:09 -0700
Commit: 671a6bc, github.com/apache/spark/pull/6214
[SPARK-7654][SQL] Move JDBC into DataFrame's reader/writer interface.
Reynold Xin <rxin@databricks.com>
2015-05-16 22:01:53 -0700
Commit: 17e0786, github.com/apache/spark/pull/6210
[SPARK-7655][Core] Deserializing value should not hold the TaskSchedulerImpl lock
zsxwing <zsxwing@gmail.com>
2015-05-16 21:03:22 -0700
Commit: 8494910, github.com/apache/spark/pull/6195
[SPARK-7654][MLlib] Migrate MLlib to the DataFrame reader/writer API.
Reynold Xin <rxin@databricks.com>
2015-05-16 15:03:57 -0700
Commit: bd057f8, github.com/apache/spark/pull/6211
[BUILD] update jblas dependency version to 1.2.4
Matthew Brandyberry <mbrandy@us.ibm.com>
2015-05-16 18:17:48 +0100
Commit: 8bde352, github.com/apache/spark/pull/6199
[HOTFIX] [SQL] Fixes DataFrameWriter.mode(String)
Cheng Lian <lian@databricks.com>
2015-05-16 20:55:10 +0800
Commit: 856619d, github.com/apache/spark/pull/6212
[SPARK-7655][Core][SQL] Remove 'scala.concurrent.ExecutionContext.Implicits.global' in 'ask' and 'BroadcastHashJoin'
zsxwing <zsxwing@gmail.com>
2015-05-16 00:44:29 -0700
Commit: ad5b0b1, github.com/apache/spark/pull/6200
[SPARK-7672] [CORE] Use int conversion in translating kryoserializer.buffer.mb to kryoserializer.buffer
Nishkam Ravi <nravi@cloudera.com>, nishkamravi2 <nishkamravi@gmail.com>, nravi <nravi@c1704.halxg.cloudera.com>
2015-05-16 08:24:21 +0100
Commit: e7607e5, github.com/apache/spark/pull/6198
[SPARK-4556] [BUILD] binary distribution assembly can't run in local mode
Sean Owen <sowen@cloudera.com>
2015-05-16 08:18:41 +0100
Commit: 1fc3560, github.com/apache/spark/pull/6186
[SPARK-7671] Fix wrong URLs in MLlib Data Types Documentation
FavioVazquez <favio.vazquezp@gmail.com>
2015-05-16 08:07:03 +0100
Commit: 7e3f9fe, github.com/apache/spark/pull/6196
[SPARK-7654][SQL] DataFrameReader and DataFrameWriter for input/output API
Reynold Xin <rxin@databricks.com>
2015-05-15 22:00:31 -0700
Commit: 9da55b5, github.com/apache/spark/pull/6175
[SPARK-7473] [MLLIB] Add reservoir sample in RandomForest
AiHe <ai.he@ussuning.com>
2015-05-15 20:42:35 -0700
Commit: f41be8f, github.com/apache/spark/pull/5988
[SPARK-7543] [SQL] [PySpark] split dataframe.py into multiple files
Davies Liu <davies@databricks.com>
2015-05-15 20:09:15 -0700
Commit: 8164fbc, github.com/apache/spark/pull/6201
[SPARK-7073] [SQL] [PySpark] Clean up SQL data type hierarchy in Python
Davies Liu <davies@databricks.com>
2015-05-15 20:05:26 -0700
Commit: 61806f6, github.com/apache/spark/pull/6206
[SPARK-7575] [ML] [DOC] Example code for OneVsRest
Ram Sriharsha <rsriharsha@hw11853.local>
2015-05-15 19:33:20 -0700
Commit: 04323ba, github.com/apache/spark/pull/6115
[SPARK-7563] OutputCommitCoordinator.stop() should only run on the driver
Josh Rosen <joshrosen@databricks.com>
2015-05-15 18:06:01 -0700
Commit: ed75cc0, github.com/apache/spark/pull/6197
[SPARK-7676] Bug fix and cleanup of stage timeline view
Kay Ousterhout <kayousterhout@gmail.com>
2015-05-15 17:45:14 -0700
Commit: 6f78d03, github.com/apache/spark/pull/6202
[SPARK-7556] [ML] [DOC] Add user guide for spark.ml Binarizer, including Scala, Java and Python examples
Liang-Chi Hsieh <viirya@gmail.com>
2015-05-15 15:05:04 -0700
Commit: e847d86, github.com/apache/spark/pull/6116
[SPARK-7677] [STREAMING] Add Kafka modules to the 2.11 build.
Iulian Dragos <jaguarul@gmail.com>
2015-05-15 14:57:29 -0700
Commit: 31e6404, github.com/apache/spark/pull/6149
[SPARK-7226] [SPARKR] Support math functions in R DataFrame
qhuang <qian.huang@intel.com>
2015-05-15 14:06:16 -0700
Commit: 9ef6d74, github.com/apache/spark/pull/6170
[SPARK-7296] Add timeline visualization for stages in the UI.
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2015-05-15 13:54:09 -0700
Commit: a5f7b3b, github.com/apache/spark/pull/5843
[SPARK-7504] [YARN] NullPointerException when initializing SparkContext in YARN-cluster mode
ehnalis <zoltan.zvara@gmail.com>
2015-05-15 12:14:02 -0700
Commit: 7dc0ff3, github.com/apache/spark/pull/6083
[SPARK-7664] [WEBUI] DAG visualization: Fix incorrect link paths of DAG.
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2015-05-15 11:54:13 -0700
Commit: e319719, github.com/apache/spark/pull/6184
[SPARK-5412] [DEPLOY] Cannot bind Master to a specific hostname as per the documentation
Sean Owen <sowen@cloudera.com>
2015-05-15 11:30:19 -0700
Commit: fe3c734, github.com/apache/spark/pull/6185
[CORE] Protect additional test vars from early GC
Tim Ellison <t.p.ellison@gmail.com>
2015-05-15 11:27:24 -0700
Commit: 866e4b5, github.com/apache/spark/pull/6187
[SPARK-7233] [CORE] Detect REPL mode once
Oleksii Kostyliev <etander@gmail.com>, Oleksii Kostyliev <okostyliev@thunderhead.com>
2015-05-15 11:19:56 -0700
Commit: c58b9c6, github.com/apache/spark/pull/5835
[SPARK-7651] [MLLIB] [PYSPARK] GMM predict, predictSoft should raise error on bad input
FlytxtRnD <meethu.mathew@flytxt.com>
2015-05-15 10:43:18 -0700
Commit: dfdae58, github.com/apache/spark/pull/6180
[SPARK-7668] [MLLIB] Preserve isTransposed property for Matrix after calling map function
Liang-Chi Hsieh <viirya@gmail.com>
2015-05-15 10:03:29 -0700
Commit: d1f5651, github.com/apache/spark/pull/6188
[SPARK-7503] [YARN] Resources in .sparkStaging directory can't be cleaned up on error
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2015-05-15 11:37:34 +0100
Commit: a17a0ee, github.com/apache/spark/pull/6026
[SPARK-7591] [SQL] Partitioning support API tweaks
Cheng Lian <lian@databricks.com>
2015-05-15 16:20:49 +0800
Commit: bcb2c5d, github.com/apache/spark/pull/6150
[SPARK-6258] [MLLIB] GaussianMixture Python API parity check
Yanbo Liang <ybliang8@gmail.com>
2015-05-15 00:18:39 -0700
Commit: c0bb974, github.com/apache/spark/pull/6087
[SPARK-7650] [STREAMING] [WEBUI] Move streaming css and js files to the streaming project
zsxwing <zsxwing@gmail.com>
2015-05-14 23:51:41 -0700
Commit: 0ba99f0, github.com/apache/spark/pull/6160
[CORE] Remove unreachable Heartbeat message from Worker
Kan Zhang <kzhang@apache.org>
2015-05-14 23:50:50 -0700
Commit: 6742b4e, github.com/apache/spark/pull/6163
[HOTFIX] Add workaround for SPARK-7660 to fix JavaAPISuite failures.
Josh Rosen <joshrosen@databricks.com>
2015-05-14 23:17:41 -0700
Commit: 1206a55
[SQL] When creating partitioned table scan, explicitly create UnionRDD.
Yin Huai <yhuai@databricks.com>
2015-05-15 12:04:26 +0800
Commit: 7aa269f, github.com/apache/spark/pull/6162
[SPARK-7098][SQL] Make the WHERE clause with timestamp show consistent result
Liang-Chi Hsieh <viirya@gmail.com>
2015-05-14 20:49:21 -0700
Commit: bac4522, github.com/apache/spark/pull/5682
[SPARK-7548] [SQL] Add explode function for DataFrames
Michael Armbrust <michael@databricks.com>
2015-05-14 19:49:44 -0700
Commit: 778a054, github.com/apache/spark/pull/6107
[SPARK-7619] [PYTHON] fix docstring signature
Xiangrui Meng <meng@databricks.com>
2015-05-14 18:16:22 -0700
Commit: a238c23, github.com/apache/spark/pull/6161
[SPARK-7648] [MLLIB] Add weights and intercept to GLM wrappers in spark.ml
Xiangrui Meng <meng@databricks.com>
2015-05-14 18:13:58 -0700
Commit: f91bb57, github.com/apache/spark/pull/6156
[SPARK-7645] [STREAMING] [WEBUI] Show milliseconds in the UI if the batch interval < 1 second
zsxwing <zsxwing@gmail.com>
2015-05-14 16:58:36 -0700
Commit: 79983f1, github.com/apache/spark/pull/6154
[SPARK-7649] [STREAMING] [WEBUI] Use window.localStorage to store the status rather than the url
zsxwing <zsxwing@gmail.com>
2015-05-14 16:57:33 -0700
Commit: 3358485, github.com/apache/spark/pull/6158
[SPARK-7643] [UI] use the correct size in RDDPage for storage info and partitions
Xiangrui Meng <meng@databricks.com>
2015-05-14 16:56:32 -0700
Commit: 8d8876d, github.com/apache/spark/pull/6157
[SPARK-7598] [DEPLOY] Add aliveWorkers metrics in Master
Rex Xiong <pengx@microsoft.com>
2015-05-14 16:55:31 -0700
Commit: 894214f, github.com/apache/spark/pull/6117
Make SPARK prefix a variable
tedyu <yuzhihong@gmail.com>
2015-05-14 15:26:35 -0700
Commit: fceaffc, github.com/apache/spark/pull/6153
[SPARK-7278] [PySpark] DateType should find datetime.datetime acceptable
ksonj <kson@siberie.de>
2015-05-14 15:10:58 -0700
Commit: a49a145, github.com/apache/spark/pull/6057
[SQL][minor] rename apply for QueryPlanner
Wenchen Fan <cloud0fan@outlook.com>
2015-05-14 10:25:18 -0700
Commit: aa8a0f9, github.com/apache/spark/pull/6142
[SPARK-7249] Updated Hadoop dependencies due to inconsistency in the versions
FavioVazquez <favio.vazquezp@gmail.com>
2015-05-14 15:22:58 +0100
Commit: 67ed0aa, github.com/apache/spark/pull/5786
[SPARK-7568] [ML] ml.LogisticRegression doesn't output the right prediction
DB Tsai <dbt@netflix.com>
2015-05-14 01:26:08 -0700
Commit: 58534b0, github.com/apache/spark/pull/6109
[SPARK-7407] [MLLIB] use uid + name to identify parameters
Xiangrui Meng <meng@databricks.com>
2015-05-14 01:22:15 -0700
Commit: e45cd9f, github.com/apache/spark/pull/6019
[SPARK-7595] [SQL] Window will cause resolve failed with self join
linweizhong <linweizhong@huawei.com>
2015-05-14 00:23:27 -0700
Commit: c80e0cf, github.com/apache/spark/pull/6114
[SPARK-7620] [ML] [MLLIB] Removed calling size, length in while condition to avoid extra JVM call
DB Tsai <dbt@netflix.com>
2015-05-13 22:23:21 -0700
Commit: 9ab4db2, github.com/apache/spark/pull/6137
[SPARK-7612] [MLLIB] update NB training to use mllib's BLAS
Xiangrui Meng <meng@databricks.com>
2015-05-13 21:27:17 -0700
Commit: 82f387f, github.com/apache/spark/pull/6128
[HOT FIX #6125] Do not wait for all stages to start rendering
Andrew Or <andrew@databricks.com>
2015-05-13 21:04:13 -0700
Commit: 2d4a961, github.com/apache/spark/pull/6138
[HOTFIX] Use 'new Job' in fsBasedParquet.scala
zsxwing <zsxwing@gmail.com>
2015-05-13 17:58:29 -0700
Commit: d518c03, github.com/apache/spark/pull/6136
[SPARK-6752] [STREAMING] [REVISED] Allow StreamingContext to be recreated from checkpoint and existing SparkContext
Tathagata Das <tathagata.das1565@gmail.com>
2015-05-13 17:33:15 -0700
Commit: aec8394, github.com/apache/spark/pull/6096
[SPARK-7601] [SQL] Support Insert into JDBC Datasource
Venkata Ramana Gollamudi <ramana.gollamudi@huawei.com>
2015-05-13 17:24:04 -0700
Commit: 820aaa6, github.com/apache/spark/pull/6121
[SPARK-7081] Faster sort-based shuffle path using binary processing cache-aware sort
Josh Rosen <joshrosen@databricks.com>
2015-05-13 17:07:31 -0700
Commit: c53ebea, github.com/apache/spark/pull/5868
[SPARK-7356] [STREAMING] Fix flakey tests in FlumePollingStreamSuite using SparkSink's batch CountDownLatch.
Hari Shreedharan <hshreedharan@apache.org>
2015-05-13 16:43:30 -0700
Commit: 6c0644a, github.com/apache/spark/pull/5918
[STREAMING] [MINOR] Keep streaming.UIUtils private
Andrew Or <andrew@databricks.com>
2015-05-13 16:31:24 -0700
Commit: e499a1e, github.com/apache/spark/pull/6134
[SPARK-7502] DAG visualization: gracefully handle removed stages
Andrew Or <andrew@databricks.com>
2015-05-13 16:29:52 -0700
Commit: 895d46a, github.com/apache/spark/pull/6132
[SPARK-7464] DAG visualization: highlight the same RDDs on hover
Andrew Or <andrew@databricks.com>
2015-05-13 16:29:10 -0700
Commit: 4b4f10b, github.com/apache/spark/pull/6100
[SPARK-7399] Spark compilation error for scala 2.11
Andrew Or <andrew@databricks.com>
2015-05-13 16:28:37 -0700
Commit: e6b8cef, github.com/apache/spark/pull/6129
[SPARK-7608] Clean up old state in RDDOperationGraphListener
Andrew Or <andrew@databricks.com>
2015-05-13 16:27:48 -0700
Commit: ec34230, github.com/apache/spark/pull/6125
[SQL] Move some classes into packages that are more appropriate.
Reynold Xin <rxin@databricks.com>
2015-05-13 16:15:31 -0700
Commit: acd872b, github.com/apache/spark/pull/6108
[SPARK-7303] [SQL] push down project if possible when the child is sort
scwf <wangfei1@huawei.com>
2015-05-13 16:13:48 -0700
Commit: d5c52d9, github.com/apache/spark/pull/5838
[SPARK-7382] [MLLIB] Feature Parity in PySpark for ml.classification
Burak Yavuz <brkyvz@gmail.com>
2015-05-13 15:13:09 -0700
Commit: 51230f2, github.com/apache/spark/pull/6106
[SPARK-7545] [MLLIB] Added check in Bernoulli Naive Bayes to make sure that both training and predict features have values of 0 or 1
leahmcguire <lmcguire@salesforce.com>
2015-05-13 14:13:19 -0700
Commit: d9fb905, github.com/apache/spark/pull/6073
[SPARK-7593] [ML] Python Api for ml.feature.Bucketizer
Burak Yavuz <brkyvz@gmail.com>
2015-05-13 13:21:36 -0700
Commit: 11911b0, github.com/apache/spark/pull/6124
[SPARK-7551][DataFrame] support backticks for DataFrame attribute resolution
Wenchen Fan <cloud0fan@outlook.com>
2015-05-13 12:47:48 -0700
Commit: 3a60bcb, github.com/apache/spark/pull/6074
[SPARK-7567] [SQL] Migrating Parquet data source to FSBasedRelation
Cheng Lian <lian@databricks.com>
2015-05-13 11:04:10 -0700
Commit: 90f304b, github.com/apache/spark/pull/6090
[SPARK-7589] [STREAMING] [WEBUI] Make "Input Rate" in the Streaming page consistent with other pages
zsxwing <zsxwing@gmail.com>
2015-05-13 10:01:26 -0700
Commit: 10007fb, github.com/apache/spark/pull/6102
[SPARK-6734] [SQL] Add UDTF.close support in Generate
Cheng Hao <hao.cheng@intel.com>
2015-05-14 00:14:59 +0800
Commit: 42cf4a2, github.com/apache/spark/pull/5383
[MINOR] [SQL] Removes debugging println
Cheng Lian <lian@databricks.com>
2015-05-13 23:40:13 +0800
Commit: d78f0e1, github.com/apache/spark/pull/6123
[SQL] In InsertIntoFSBasedRelation.insert, log cause before abort job/task.
Yin Huai <yhuai@databricks.com>
2015-05-13 23:36:19 +0800
Commit: 9ca28d9, github.com/apache/spark/pull/6105
[SPARK-7599] [SQL] Don't restrict customized output committers to be subclasses of FileOutputCommitter
Cheng Lian <lian@databricks.com>
2015-05-13 07:35:55 -0700
Commit: cb1fe81, github.com/apache/spark/pull/6118
[SPARK-6568] spark-shell.cmd --jars option does not accept the jar that has space in its path
Masayoshi TSUZUKI <tsudukim@oss.nttdata.co.jp>, Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2015-05-13 09:43:40 +0100
Commit: bfdecac, github.com/apache/spark/pull/5447
[SPARK-7526] [SPARKR] Specify ip of RBackend, MonitorServer and RRDD Socket server
linweizhong <linweizhong@huawei.com>
2015-05-12 23:55:44 -0700
Commit: 7bd5274, github.com/apache/spark/pull/6053
[SPARK-7482] [SPARKR] Rename some DataFrame API methods in SparkR to match their counterparts in Scala.
Sun Rui <rui.sun@intel.com>
2015-05-12 23:52:30 -0700
Commit: b18f1c6, github.com/apache/spark/pull/6007
[SPARK-7566][SQL] Add type to HiveContext.analyzer
Santiago M. Mola <santi@mola.io>
2015-05-12 23:44:21 -0700
Commit: 6ff3379, github.com/apache/spark/pull/6086
[SPARK-7321][SQL] Add Column expression for conditional statements (when/otherwise)
Reynold Xin <rxin@databricks.com>, kaka1992 <kaka_1992@163.com>
2015-05-12 21:43:34 -0700
Commit: 219a904, github.com/apache/spark/pull/6072
[SPARK-7588] Document all SQL/DataFrame public methods with @since tag
Reynold Xin <rxin@databricks.com>
2015-05-12 18:37:02 -0700
Commit: bdd5db9, github.com/apache/spark/pull/6101
[HOTFIX] Use the old Job API to support old Hadoop versions
zsxwing <zsxwing@gmail.com>
2015-05-13 08:33:24 +0800
Commit: 2cc3301, github.com/apache/spark/pull/6095
[SPARK-7572] [MLLIB] do not import Param/Params under pyspark.ml
Xiangrui Meng <meng@databricks.com>
2015-05-12 17:15:39 -0700
Commit: 08ec1af, github.com/apache/spark/pull/6094
[SPARK-7554] [STREAMING] Throw exception when an active/stopped StreamingContext is used to create DStreams and output operations
Tathagata Das <tathagata.das1565@gmail.com>
2015-05-12 17:07:21 -0700
Commit: bb81b15, github.com/apache/spark/pull/6099
[SPARK-7528] [MLLIB] make RankingMetrics Java-friendly
Xiangrui Meng <meng@databricks.com>
2015-05-12 16:53:47 -0700
Commit: 6c292a2, github.com/apache/spark/pull/6098
[SPARK-7553] [STREAMING] Added methods to maintain a singleton StreamingContext
Tathagata Das <tathagata.das1565@gmail.com>
2015-05-12 16:44:14 -0700
Commit: 91fbd93, github.com/apache/spark/pull/6070
[SPARK-7573] [ML] OneVsRest cleanups
Joseph K. Bradley <joseph@databricks.com>
2015-05-12 16:42:30 -0700
Commit: 612247f, github.com/apache/spark/pull/6097
[SPARK-7557] [ML] [DOC] User guide for spark.ml HashingTF, Tokenizer
Joseph K. Bradley <joseph@databricks.com>
2015-05-12 16:39:56 -0700
Commit: d080df1, github.com/apache/spark/pull/6093
[SPARK-7496] [MLLIB] Update Programming guide with Online LDA
Yuhao Yang <hhbyyh@gmail.com>
2015-05-12 15:12:29 -0700
Commit: fe34a59, github.com/apache/spark/pull/6046
[SPARK-7406] [STREAMING] [WEBUI] Add tooltips for "Scheduling Delay", "Processing Time" and "Total Delay"
zsxwing <zsxwing@gmail.com>
2015-05-12 14:41:21 -0700
Commit: 221375e, github.com/apache/spark/pull/5952
[SPARK-7571] [MLLIB] rename Math to math
Xiangrui Meng <meng@databricks.com>
2015-05-12 14:39:03 -0700
Commit: 2555517, github.com/apache/spark/pull/6092
[SPARK-7484][SQL]Support jdbc connection properties
Venkata Ramana Gollamudi <ramana.gollamudi@huawei.com>
2015-05-12 14:37:23 -0700
Commit: 32819fc, github.com/apache/spark/pull/6009
[SPARK-7559] [MLLIB] Bucketizer should include the right most boundary in the last bucket.
Xiangrui Meng <meng@databricks.com>
2015-05-12 14:24:26 -0700
Commit: 98ccd93, github.com/apache/spark/pull/6075
[SPARK-7569][SQL] Better error for invalid binary expressions
Michael Armbrust <michael@databricks.com>
2015-05-12 13:36:55 -0700
Commit: c68485e, github.com/apache/spark/pull/6089
[SPARK-7015] [MLLIB] [WIP] Multiclass to Binary Reduction: One Against All
Ram Sriharsha <rsriharsha@hw11853.local>
2015-05-12 13:35:12 -0700
Commit: fd16709, github.com/apache/spark/pull/5830
[SPARK-2018] [CORE] Upgrade LZF library to fix endian serialization p…
Tim Ellison <t.p.ellison@gmail.com>
2015-05-12 20:48:26 +0100
Commit: eadda92, github.com/apache/spark/pull/6077
[SPARK-7487] [ML] Feature Parity in PySpark for ml.regression
Burak Yavuz <brkyvz@gmail.com>
2015-05-12 12:17:05 -0700
Commit: 432694c, github.com/apache/spark/pull/6016
[HOT FIX #6076] DAG visualization: curve the edges
Andrew Or <andrew@databricks.com>
2015-05-12 12:06:30 -0700
Commit: ce6c400
[SPARK-7276] [DATAFRAME] speed up DataFrame.select by collapsing Project
Wenchen Fan <cloud0fan@outlook.com>
2015-05-12 11:51:55 -0700
Commit: 8be43f8, github.com/apache/spark/pull/5831
[SPARK-7500] DAG visualization: move cluster labeling to dagre-d3
Andrew Or <andrew@databricks.com>
2015-05-12 11:17:59 -0700
Commit: a236104, github.com/apache/spark/pull/6076
[DataFrame][minor] support column in field accessor
Wenchen Fan <cloud0fan@outlook.com>
2015-05-12 10:37:57 -0700
Commit: ec89286, github.com/apache/spark/pull/6080
[SPARK-3928] [SPARK-5182] [SQL] Partitioning support for the data sources API
Cheng Lian <lian@databricks.com>
2015-05-13 01:32:28 +0800
Commit: d232813, github.com/apache/spark/pull/5526
[DataFrame][minor] cleanup unapply methods in DataTypes
Wenchen Fan <cloud0fan@outlook.com>
2015-05-12 10:28:40 -0700
Commit: a9d84a9, github.com/apache/spark/pull/6079
[SPARK-6876] [PySpark] [SQL] add DataFrame na.replace in pyspark
Daoyuan Wang <daoyuan.wang@intel.com>
2015-05-12 10:23:41 -0700
Commit: 653db0a, github.com/apache/spark/pull/6003
[SPARK-7532] [STREAMING] StreamingContext.start() made to logWarning and not throw exception
Tathagata Das <tathagata.das1565@gmail.com>
2015-05-12 08:48:24 -0700
Commit: 2bbb685, github.com/apache/spark/pull/6060
[SPARK-7467] Dag visualization: treat checkpoint as an RDD operation
Andrew Or <andrew@databricks.com>
2015-05-12 01:40:55 -0700
Commit: 5601632, github.com/apache/spark/pull/6004
[SPARK-7485] [BUILD] Remove pyspark files from assembly.
Marcelo Vanzin <vanzin@cloudera.com>
2015-05-12 01:39:21 -0700
Commit: afe54b7, github.com/apache/spark/pull/6022
[MINOR] [PYSPARK] Set PYTHONPATH to python/lib/pyspark.zip rather than python/pyspark
linweizhong <linweizhong@huawei.com>
2015-05-12 01:36:27 -0700
Commit: 4092a2e, github.com/apache/spark/pull/6047
[SPARK-7534] [CORE] [WEBUI] Fix the Stage table when a stage is missing
zsxwing <zsxwing@gmail.com>
2015-05-12 01:34:33 -0700
Commit: af374ed, github.com/apache/spark/pull/6061
[SPARK-6994][SQL] Update docs for fetching Row fields by name
vidmantas zemleris <vidmantas@vinted.com>
2015-05-11 22:29:24 -0700
Commit: 6523fb8, github.com/apache/spark/pull/6030
[SQL] Rename Dialect -> ParserDialect.
Reynold Xin <rxin@databricks.com>
2015-05-11 22:06:56 -0700
Commit: c6b8148, github.com/apache/spark/pull/6071
[SPARK-7435] [SPARKR] Make DataFrame.show() consistent with that of Scala and pySpark
Joshi <rekhajoshm@gmail.com>, Rekha Joshi <rekhajoshm@gmail.com>
2015-05-11 21:02:34 -0700
Commit: 835a770, github.com/apache/spark/pull/5989
[SPARK-7509][SQL] DataFrame.drop in Python for dropping columns.
Reynold Xin <rxin@databricks.com>
2015-05-11 20:04:36 -0700
Commit: ed40ab5, github.com/apache/spark/pull/6068
[SPARK-7437] [SQL] Fold "literal in (item1, item2, ..., literal, ...)" into true or false directly
Zhongshuai Pei <799203320@qq.com>, DoingDone9 <799203320@qq.com>
2015-05-11 19:22:44 -0700
Commit: c30982d, github.com/apache/spark/pull/5972
[SPARK-7411] [SQL] Support SerDe for HiveQl in CTAS
Cheng Hao <hao.cheng@intel.com>
2015-05-11 19:21:16 -0700
Commit: 1a664a0, github.com/apache/spark/pull/5963
[SPARK-7324] [SQL] DataFrame.dropDuplicates
Reynold Xin <rxin@databricks.com>
2015-05-11 19:15:14 -0700
Commit: 8a9d234, github.com/apache/spark/pull/6066
[SPARK-7530] [STREAMING] Added StreamingContext.getState() to expose the current state of the context
Tathagata Das <tathagata.das1565@gmail.com>
2015-05-11 18:53:50 -0700
Commit: c16b47f, github.com/apache/spark/pull/6058
[SPARK-5893] [ML] Add bucketizer
Xusen Yin <yinxusen@gmail.com>, Joseph K. Bradley <joseph@databricks.com>
2015-05-11 18:41:22 -0700
Commit: f188815, github.com/apache/spark/pull/5980
Updated DataFrame.saveAsTable Hive warning to include SPARK-7550 ticket.
Reynold Xin <rxin@databricks.com>
2015-05-11 18:10:45 -0700
Commit: e1e599d, github.com/apache/spark/pull/6067
[SPARK-7462][SQL] Update documentation for retaining grouping columns in DataFrames.
Reynold Xin <rxin@databricks.com>
2015-05-11 18:07:12 -0700
Commit: eaa6116, github.com/apache/spark/pull/6062
[SPARK-7084] improve saveAsTable documentation
madhukar <phatak.dev@gmail.com>
2015-05-11 17:04:11 -0700
Commit: 0dbfe16, github.com/apache/spark/pull/5654
[SQL] Show better error messages for incorrect join types in DataFrames.
Reynold Xin <rxin@databricks.com>
2015-05-11 17:02:11 -0700
Commit: 0ff34f80, github.com/apache/spark/pull/6064
Update Documentation: leftsemi instead of semijoin
LCY Vincent <lauchunyin@gmail.com>
2015-05-11 14:48:10 -0700
Commit: 788503a, github.com/apache/spark/pull/5944
[STREAMING] [MINOR] Close files correctly when iterator is finished in streaming WAL recovery
jerryshao <saisai.shao@intel.com>
2015-05-11 14:38:58 -0700
Commit: 9e226e1, github.com/apache/spark/pull/6050
[SPARK-7516] [Minor] [DOC] Replace depreciated inferSchema() with createDataFrame()
gchen <chenguancheng@gmail.com>
2015-05-11 14:37:18 -0700
Commit: 1538b10, github.com/apache/spark/pull/6041
[SPARK-7508] JettyUtils-generated servlets to log & report all errors
Steve Loughran <stevel@hortonworks.com>
2015-05-11 13:35:06 -0700
Commit: 779174a, github.com/apache/spark/pull/6033
[SPARK-7462] By default retain group by columns in aggregate
Reynold Xin <rxin@databricks.com>, Shivaram Venkataraman <shivaram@cs.berkeley.edu>
2015-05-11 11:35:16 -0700
Commit: 9c35f02, github.com/apache/spark/pull/5996
[SPARK-7361] [STREAMING] Throw unambiguous exception when attempting to start multiple StreamingContexts in the same JVM
Tathagata Das <tathagata.das1565@gmail.com>
2015-05-11 10:58:56 -0700
Commit: 11648fa, github.com/apache/spark/pull/5907
[SPARK-7522] [EXAMPLES] Removed angle brackets from dataFormat option
Bryan Cutler <bjcutler@us.ibm.com>
2015-05-11 09:23:47 -0700
Commit: c234d78, github.com/apache/spark/pull/6049
[SPARK-6092] [MLLIB] Add RankingMetrics in PySpark/MLlib
Yanbo Liang <ybliang8@gmail.com>
2015-05-11 09:14:20 -0700
Commit: 017f9fa, github.com/apache/spark/pull/6044
[SPARK-7326] [STREAMING] Performing window() on a WindowedDStream doesn't work all the time
Wesley Miao <wesley.miao@gmail.com>, Wesley <wesley.miao@autodesk.com>
2015-05-11 12:20:06 +0100
Commit: da1be15, github.com/apache/spark/pull/5871
[SPARK-7519] [SQL] fix minor bugs in thrift server UI
tianyi <tianyi.asiainfo@gmail.com>
2015-05-11 14:08:15 +0800
Commit: fff3c86, github.com/apache/spark/pull/6048
[SPARK-7512] [SPARKR] Fix RDD's show method to use getJRDD
Shivaram Venkataraman <shivaram@cs.berkeley.edu>
2015-05-10 19:49:42 -0700
Commit: 5f227fd, github.com/apache/spark/pull/6035
[SPARK-7427] [PYSPARK] Make sharedParams match in Scala, Python
Glenn Weidner <gweidner@us.ibm.com>
2015-05-10 19:18:32 -0700
Commit: 051864e, github.com/apache/spark/pull/6023
[SPARK-5521] PCA wrapper for easy transform vectors
Kirill A. Korinskiy <catap@catap.ru>, Joseph K. Bradley <joseph@databricks.com>
2015-05-10 13:34:00 -0700
Commit: 193ff69, github.com/apache/spark/pull/4304
[SPARK-7431] [ML] [PYTHON] Made CrossValidatorModel call parent init in PySpark
Joseph K. Bradley <joseph@databricks.com>
2015-05-10 13:29:27 -0700
Commit: d49b72c, github.com/apache/spark/pull/5968
[MINOR] [SQL] Fixes variable name typo
Cheng Lian <lian@databricks.com>
2015-05-10 21:26:36 +0800
Commit: fd87b2a, github.com/apache/spark/pull/6038
[SPARK-7345][SQL] Spark cannot detect renamed columns using JDBC connector
Oleg Sidorkin <oleg.sidorkin@gmail.com>
2015-05-10 01:31:34 -0700
Commit: 5c40403, github.com/apache/spark/pull/6032
[SPARK-6091] [MLLIB] Add MulticlassMetrics in PySpark/MLlib
Yanbo Liang <ybliang8@gmail.com>
2015-05-10 00:57:14 -0700
Commit: fe46374, github.com/apache/spark/pull/6011
[SPARK-7475] [MLLIB] adjust ldaExample for online LDA
Yuhao Yang <hhbyyh@gmail.com>
2015-05-09 15:40:46 -0700
Commit: e96fc86, github.com/apache/spark/pull/6000
[BUILD] Reference fasterxml.jackson.version in sql/core/pom.xml
tedyu <yuzhihong@gmail.com>
2015-05-09 13:19:07 -0700
Commit: 5110f3e, github.com/apache/spark/pull/6031
Upgrade version of jackson-databind in sql/core/pom.xml
tedyu <yuzhihong@gmail.com>
2015-05-09 10:41:30 -0700
Commit: 6c5b9ff, github.com/apache/spark/pull/6028
[STREAMING] [DOCS] Fix wrong url about API docs of StreamingListener
dobashim <dobashim@oss.nttdata.co.jp>
2015-05-09 10:14:46 +0100
Commit: 5dbc7bb, github.com/apache/spark/pull/6024
[SPARK-7403] [WEBUI] Link URL in objects on Timeline View is wrong in case of running on YARN
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2015-05-09 10:10:29 +0100
Commit: 869a52d, github.com/apache/spark/pull/5947
[SPARK-7438] [SPARK CORE] Fixed validation of relativeSD in countApproxDistinct
Vinod K C <vinod.kc@huawei.com>
2015-05-09 10:03:15 +0100
Commit: b0460f4, github.com/apache/spark/pull/5974
[SPARK-7498] [ML] removed varargs annotation from Params.setDefaults
Joseph K. Bradley <joseph@databricks.com>
2015-05-08 21:55:54 -0700
Commit: 25972d3, github.com/apache/spark/pull/6021
[SPARK-7262] [ML] Binary LogisticRegression with L1/L2 (elastic net) using OWLQN in new ML package
DB Tsai <dbt@netflix.com>
2015-05-08 21:43:05 -0700
Commit: 80bbe72, github.com/apache/spark/pull/5967
[SPARK-7375] [SQL] Avoid row copying in exchange when sort.serializeMapOutputs takes effect
Josh Rosen <joshrosen@databricks.com>
2015-05-08 22:09:55 -0400
Commit: 21212a2, github.com/apache/spark/pull/5948
[SPARK-7231] [SPARKR] Changes to make SparkR DataFrame dplyr friendly.
Shivaram Venkataraman <shivaram@cs.berkeley.edu>
2015-05-08 18:29:57 -0700
Commit: 448ff33, github.com/apache/spark/pull/6005
[SPARK-7451] [YARN] Preemption of executors is counted as failure causing Spark job to fail
Ashwin Shankar <ashankar@netflix.com>
2015-05-08 17:51:00 -0700
Commit: 959c7b6, github.com/apache/spark/pull/5993
[SPARK-7488] [ML] Feature Parity in PySpark for ml.recommendation
Burak Yavuz <brkyvz@gmail.com>
2015-05-08 17:24:32 -0700
Commit: 85cab34, github.com/apache/spark/pull/6015
[SPARK-7237] Clean function in several RDD methods
tedyu <yuzhihong@gmail.com>
2015-05-08 17:16:38 -0700
Commit: 45b6215, github.com/apache/spark/pull/5959
[SPARK-7469] [SQL] DAG visualization: show SQL query operators
Andrew Or <andrew@databricks.com>
2015-05-08 17:15:10 -0700
Commit: cafffd0, github.com/apache/spark/pull/5999
[SPARK-6955] Perform port retries at NettyBlockTransferService level
Aaron Davidson <aaron@databricks.com>
2015-05-08 17:13:55 -0700
Commit: 1eae476, github.com/apache/spark/pull/5575
updated ec2 instance types
Brendan Collins <bcollins@blueraster.com>
2015-05-08 15:59:34 -0700
Commit: 6e35cb5, github.com/apache/spark/pull/6014
[SPARK-5913] [MLLIB] Python API for ChiSqSelector
Yanbo Liang <ybliang8@gmail.com>
2015-05-08 15:48:39 -0700
Commit: ab48df3, github.com/apache/spark/pull/5939
[SPARK-4699] [SQL] Make caseSensitive configurable in spark sql analyzer
Jacky Li <jacky.likun@huawei.com>, wangfei <wangfei1@huawei.com>, scwf <wangfei1@huawei.com>
2015-05-08 15:25:54 -0700
Commit: 21bd722, github.com/apache/spark/pull/5806
[SPARK-7390] [SQL] Only merge other CovarianceCounter when its count is greater than zero
Liang-Chi Hsieh <viirya@gmail.com>
2015-05-08 14:41:16 -0700
Commit: 5205eb4, github.com/apache/spark/pull/5931
[SPARK-7378] [CORE] Handle deep links to unloaded apps.
Marcelo Vanzin <vanzin@cloudera.com>
2015-05-08 14:12:58 -0700
Commit: 3024f6b, github.com/apache/spark/pull/5922
[MINOR] [CORE] Allow History Server to read kerberos opts from config file.
Marcelo Vanzin <vanzin@cloudera.com>
2015-05-08 14:10:27 -0700
Commit: 3da5f8b, github.com/apache/spark/pull/5998
[SPARK-7466] DAG visualization: fix orphan nodes
Andrew Or <andrew@databricks.com>
2015-05-08 14:09:39 -0700
Commit: ca2f1c5, github.com/apache/spark/pull/6002
[MINOR] Defeat early garbage collection of test suite variable
Tim Ellison <t.p.ellison@gmail.com>
2015-05-08 14:08:52 -0700
Commit: f734c58, github.com/apache/spark/pull/6010
[SPARK-7489] [SPARK SHELL] Spark shell crashes when compiled with scala 2.11
vinodkc <vinod.kc.in@gmail.com>
2015-05-08 14:07:53 -0700
Commit: 3b7fb7a, github.com/apache/spark/pull/6013
[WEBUI] Remove debug feature for vis.js
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2015-05-08 14:06:37 -0700
Commit: 1dde3b3, github.com/apache/spark/pull/5994
[MINOR] Ignore python/lib/pyspark.zip
zsxwing <zsxwing@gmail.com>
2015-05-08 14:06:02 -0700
Commit: ab0caa0, github.com/apache/spark/pull/6017
[SPARK-7490] [CORE] [Minor] MapOutputTracker.deserializeMapStatuses: close input streams
Evan Jones <ejones@twitter.com>
2015-05-08 22:00:39 +0100
Commit: 6230809, github.com/apache/spark/pull/5982
[SPARK-6627] Finished rename to ShuffleBlockResolver
Kay Ousterhout <kayousterhout@gmail.com>
2015-05-08 12:24:06 -0700
Commit: 82be68f, github.com/apache/spark/pull/5764
[SPARK-7133] [SQL] Implement struct, array, and map field accessor
Wenchen Fan <cloud0fan@outlook.com>
2015-05-08 11:49:38 -0700
Commit: f8468c4, github.com/apache/spark/pull/5744
[SPARK-7298] Harmonize style of new visualizations
Matei Zaharia <matei@databricks.com>
2015-05-08 14:41:42 -0400
Commit: 0b2c252, github.com/apache/spark/pull/5942
[SPARK-7436] Fixed instantiation of custom recovery mode factory and added tests
Jacek Lewandowski <lewandowski.jacek@gmail.com>
2015-05-08 11:38:09 -0700
Commit: 89d9487, github.com/apache/spark/pull/5976
[SPARK-6824] Fill the docs for DataFrame API in SparkR
hqzizania <qian.huang@intel.com>, qhuang <qian.huang@intel.com>
2015-05-08 11:25:04 -0700
Commit: 4f01f5b, github.com/apache/spark/pull/5969
[SPARK-7474] [MLLIB] update ParamGridBuilder doctest
Xiangrui Meng <meng@databricks.com>
2015-05-08 11:16:04 -0700
Commit: 75fed0c, github.com/apache/spark/pull/6001
[SPARK-7383] [ML] Feature Parity in PySpark for ml.features
Burak Yavuz <brkyvz@gmail.com>
2015-05-08 11:14:39 -0700
Commit: 85e1154, github.com/apache/spark/pull/5991
[SPARK-3454] separate json endpoints for data in the UI
Imran Rashid <irashid@cloudera.com>
2015-05-08 16:54:32 +0100
Commit: 532bfda, github.com/apache/spark/pull/5940
[SPARK-6869] [PYSPARK] Add pyspark archives path to PYTHONPATH
Lianhui Wang <lianhuiwang09@gmail.com>
2015-05-08 08:44:46 -0500
Commit: acf4bc1, github.com/apache/spark/pull/5580
[SPARK-7392] [CORE] bugfix: Kryo buffer size cannot be larger than 2M
Zhang, Liye <liye.zhang@intel.com>
2015-05-08 09:10:58 +0100
Commit: f5e9678, github.com/apache/spark/pull/5934
[SPARK-7232] [SQL] Add a Substitution batch for spark sql analyzer
wangfei <wangfei1@huawei.com>
2015-05-07 22:55:42 -0700
Commit: bb5872f, github.com/apache/spark/pull/5776
[SPARK-7470] [SQL] Spark shell SQLContext crashes without hive
Andrew Or <andrew@databricks.com>
2015-05-07 22:32:13 -0700
Commit: 1a3e9e9, github.com/apache/spark/pull/5997
[SPARK-6986] [SQL] Use Serializer2 in more cases.
Yin Huai <yhuai@databricks.com>
2015-05-07 20:59:42 -0700
Commit: 9d0d289, github.com/apache/spark/pull/5849
[SPARK-7452] [MLLIB] fix bug in topBykey and update test
Shuo Xiang <shuoxiangpub@gmail.com>
2015-05-07 20:55:08 -0700
Commit: 28d4238, github.com/apache/spark/pull/5990
[SPARK-6908] [SQL] Use isolated Hive client
Michael Armbrust <michael@databricks.com>
2015-05-07 19:36:24 -0700
Commit: 05454fd, github.com/apache/spark/pull/5876
[SPARK-7305] [STREAMING] [WEBUI] Make BatchPage show friendly information when jobs are dropped by SparkListener
zsxwing <zsxwing@gmail.com>
2015-05-07 17:34:44 -0700
Commit: 2e8a141, github.com/apache/spark/pull/5840
[SPARK-7450] Use UNSAFE.getLong() to speed up BitSetMethods#anySet()
tedyu <yuzhihong@gmail.com>
2015-05-07 16:53:59 -0700
Commit: 99897fe, github.com/apache/spark/pull/5897
[SPARK-2155] [SQL] [WHEN D THEN E] [ELSE F] add CaseKeyWhen for "CASE a WHEN b THEN c * END"
Wenchen Fan <cloud0fan@outlook.com>
2015-05-07 16:26:49 -0700
Commit: 622a0c5, github.com/apache/spark/pull/5979
[SPARK-5281] [SQL] Registering table on RDD is giving MissingRequirementError
Iulian Dragos <jaguarul@gmail.com>
2015-05-07 16:24:11 -0700
Commit: 9fd25f7, github.com/apache/spark/pull/5981
[SPARK-7277] [SQL] Throw exception if the property mapred.reduce.tasks is set to -1
Liang-Chi Hsieh <viirya@gmail.com>
2015-05-07 16:22:45 -0700
Commit: 7064ea0, github.com/apache/spark/pull/5811
[SQL] [MINOR] make star and multialias extend NamedExpression
scwf <wangfei1@huawei.com>
2015-05-07 16:21:24 -0700
Commit: 2425e4d, github.com/apache/spark/pull/5928
[SPARK-6948] [MLLIB] compress vectors in VectorAssembler
Xiangrui Meng <meng@databricks.com>
2015-05-07 15:45:37 -0700
Commit: 475143a, github.com/apache/spark/pull/5985
[SPARK-5726] [MLLIB] Elementwise (Hadamard) Vector Product Transformer
Octavian Geagla <ogeagla@gmail.com>, Joseph K. Bradley <joseph@databricks.com>
2015-05-07 14:49:55 -0700
Commit: 76e58b5, github.com/apache/spark/pull/4580
[SPARK-7328] [MLLIB] [PYSPARK] Pyspark.mllib.linalg.Vectors: Missing items
MechCoder <manojkumarsivaraj334@gmail.com>
2015-05-07 14:02:05 -0700
Commit: 4436e26, github.com/apache/spark/pull/5872
[SPARK-7347] DAG visualization: add tooltips to RDDs
Andrew Or <andrew@databricks.com>
2015-05-07 12:29:56 -0700
Commit: 1b742a4, github.com/apache/spark/pull/5957
[SPARK-7391] DAG visualization: auto expand if linked from another viz
Andrew Or <andrew@databricks.com>
2015-05-07 12:29:18 -0700
Commit: 800c0fc, github.com/apache/spark/pull/5958
[SPARK-7373] [MESOS] Add docker support for launching drivers in mesos cluster mode.
Timothy Chen <tnachen@gmail.com>
2015-05-07 12:23:16 -0700
Commit: 226033c, github.com/apache/spark/pull/5917
[SPARK-7399] [SPARK CORE] Fixed compilation error in scala 2.11
Tijo Thomas <tijoparacka@gmail.com>
2015-05-07 12:21:09 -0700
Commit: d4e31bf, github.com/apache/spark/pull/5966
[SPARK-5213] [SQL] Remove the duplicated SparkSQLParser
Cheng Hao <hao.cheng@intel.com>
2015-05-07 12:09:54 -0700
Commit: 2b0c423, github.com/apache/spark/pull/5965
[SPARK-7116] [SQL] [PYSPARK] Remove cache() causing memory leak
ksonj <kson@siberie.de>
2015-05-07 12:04:19 -0700
Commit: 86f141c, github.com/apache/spark/pull/5973
[SPARK-1442] [SQL] [FOLLOW-UP] Address minor comments in Window Function PR (#5604).
Yin Huai <yhuai@databricks.com>
2015-05-07 11:46:49 -0700
Commit: 9dcf4f7, github.com/apache/spark/pull/5945
[SPARK-6093] [MLLIB] Add RegressionMetrics in PySpark/MLlib
Yanbo Liang <ybliang8@gmail.com>
2015-05-07 11:18:32 -0700
Commit: ef835dc, github.com/apache/spark/pull/5941
[SPARK-7118] [Python] Add the coalesce Spark SQL function available in PySpark
Olivier Girardot <o.girardot@lateral-thoughts.com>
2015-05-07 10:58:35 -0700
Commit: 3038b26, github.com/apache/spark/pull/5698
[SPARK-7388] [SPARK-7383] wrapper for VectorAssembler in Python
Burak Yavuz <brkyvz@gmail.com>, Xiangrui Meng <meng@databricks.com>
2015-05-07 10:25:41 -0700
Commit: 6b9737a, github.com/apache/spark/pull/5930
[SPARK-7330] [SQL] avoid NPE at jdbc rdd
Daoyuan Wang <daoyuan.wang@intel.com>
2015-05-07 10:05:01 -0700
Commit: 84ee348, github.com/apache/spark/pull/5877
[SPARK-7429] [ML] Params cleanups
Joseph K. Bradley <joseph@databricks.com>
2015-05-07 01:28:44 -0700
Commit: 91ce131, github.com/apache/spark/pull/5960
[SPARK-7421] [MLLIB] OnlineLDA cleanups
Joseph K. Bradley <joseph@databricks.com>
2015-05-07 01:12:14 -0700
Commit: a038c51, github.com/apache/spark/pull/5956
[SPARK-7035] Encourage __getitem__ over __getattr__ on column access in the Python DataFrame API
ksonj <kson@siberie.de>
2015-05-07 01:02:00 -0700
Commit: b929a75, github.com/apache/spark/pull/5971
[SPARK-7295][SQL] bitwise operations for DataFrame DSL
Shiti <ssaxena.ece@gmail.com>
2015-05-07 01:00:29 -0700
Commit: 703211b, github.com/apache/spark/pull/5867
[SPARK-7217] [STREAMING] Add configuration to control the default behavior of StreamingContext.stop() implicitly calling SparkContext.stop()
Tathagata Das <tathagata.das1565@gmail.com>
2015-05-07 00:24:44 -0700
Commit: cb13c98, github.com/apache/spark/pull/5929
[SPARK-7430] [STREAMING] [TEST] General improvements to streaming tests to increase debuggability
Tathagata Das <tathagata.das1565@gmail.com>
2015-05-07 00:21:10 -0700
Commit: 065d114, github.com/apache/spark/pull/5961
[SPARK-5938] [SPARK-5443] [SQL] Improve JsonRDD performance
Nathan Howell <nhowell@godaddy.com>
2015-05-06 22:56:53 -0700
Commit: 2337ccc1, github.com/apache/spark/pull/5801
[SPARK-6812] [SPARKR] filter() on DataFrame does not work as expected.
Sun Rui <rui.sun@intel.com>
2015-05-06 22:48:16 -0700
Commit: 4948f42, github.com/apache/spark/pull/5938
[SPARK-7432] [MLLIB] disable cv doctest
Xiangrui Meng <meng@databricks.com>
2015-05-06 22:29:07 -0700
Commit: fb4967b, github.com/apache/spark/pull/5962
[SPARK-7405] [STREAMING] Fix the bug that ReceiverInputDStream doesn't report InputInfo
zsxwing <zsxwing@gmail.com>
2015-05-06 18:07:00 -0700
Commit: d6e76cb, github.com/apache/spark/pull/5950
[HOT FIX] For DAG visualization #5954
Andrew Or <andrew@databricks.com>
2015-05-06 18:02:08 -0700
Commit: 85a644b
[SPARK-7371] [SPARK-7377] [SPARK-7408] DAG visualization addendum (#5729)
Andrew Or <andrew@databricks.com>
2015-05-06 17:52:34 -0700
Commit: 76e8344, github.com/apache/spark/pull/5954
[SPARK-7396] [STREAMING] [EXAMPLE] Update KafkaWordCountProducer to use new Producer API
jerryshao <saisai.shao@intel.com>
2015-05-06 17:44:43 -0700
Commit: ba24dfa, github.com/apache/spark/pull/5936
[SPARK-6799] [SPARKR] Remove SparkR RDD examples, add dataframe examples
Shivaram Venkataraman <shivaram@cs.berkeley.edu>
2015-05-06 17:28:11 -0700
Commit: 4b91e18, github.com/apache/spark/pull/5949
[HOT FIX] [SPARK-7418] Ignore flaky SparkSubmitUtilsSuite test
Andrew Or <andrew@databricks.com>
2015-05-06 17:08:39 -0700
Commit: c0ec20a
[SPARK-5995] [ML] Make Prediction dev API public
Joseph K. Bradley <joseph@databricks.com>
2015-05-06 16:15:51 -0700
Commit: b681b93, github.com/apache/spark/pull/5913
[HOT-FIX] Move HiveWindowFunctionQuerySuite.scala to hive compatibility dir.
Yin Huai <yhuai@databricks.com>
2015-05-06 14:48:25 -0700
Commit: 14bcb84, github.com/apache/spark/pull/5951
Add `Private` annotation.
Josh Rosen <joshrosen@databricks.com>
2015-05-06 11:03:17 -0700
Commit: 2163367
[SPARK-7311] Introduce internal Serializer API for determining if serializers support object relocation
Josh Rosen <joshrosen@databricks.com>
2015-05-06 10:52:55 -0700
Commit: d651e28, github.com/apache/spark/pull/5924
[SPARK-1442] [SQL] Window Function Support for Spark SQL
Yin Huai <yhuai@databricks.com>
2015-05-06 10:43:00 -0700
Commit: b521a3b, github.com/apache/spark/pull/5604
[SPARK-6201] [SQL] promote string and do widen types for IN
Daoyuan Wang <daoyuan.wang@intel.com>
2015-05-06 10:30:42 -0700
Commit: 7212897, github.com/apache/spark/pull/4945
[SPARK-5456] [SQL] fix decimal compare for jdbc rdd
Daoyuan Wang <daoyuan.wang@intel.com>
2015-05-06 10:05:10 -0700
Commit: f1a5caf, github.com/apache/spark/pull/5803
[SQL] JavaDoc update for various DataFrame functions.
Reynold Xin <rxin@databricks.com>
2015-05-06 08:50:56 -0700
Commit: 389b755, github.com/apache/spark/pull/5935
[SPARK-6940] [MLLIB] Add CrossValidator to Python ML pipeline API
Xiangrui Meng <meng@databricks.com>
2015-05-06 01:28:43 -0700
Commit: 3e27a54, github.com/apache/spark/pull/5926
[SPARK-7384][Core][Tests] Fix flaky tests for distributed mode in BroadcastSuite
zsxwing <zsxwing@gmail.com>
2015-05-05 23:25:28 -0700
Commit: 20f9237, github.com/apache/spark/pull/5925
[SPARK-6267] [MLLIB] Python API for IsotonicRegression
Yanbo Liang <ybliang8@gmail.com>, Xiangrui Meng <meng@databricks.com>
2015-05-05 22:57:13 -0700
Commit: 384ac3c, github.com/apache/spark/pull/5890
[SPARK-7358][SQL] Move DataFrame mathfunctions into functions
Burak Yavuz <brkyvz@gmail.com>
2015-05-05 22:56:01 -0700
Commit: 8aa6681, github.com/apache/spark/pull/5923
[SPARK-6841] [SPARKR] add support for mean, median, stdev etc.
qhuang <qian.huang@intel.com>
2015-05-05 20:39:56 -0700
Commit: b5cd7dc, github.com/apache/spark/pull/5446
Revert "[SPARK-3454] separate json endpoints for data in the UI"
Reynold Xin <rxin@databricks.com>
2015-05-05 19:28:35 -0700
Commit: 765f6e1
[SPARK-6231][SQL/DF] Automatically resolve join condition ambiguity for self-joins.
Reynold Xin <rxin@databricks.com>
2015-05-05 18:59:46 -0700
Commit: e61083c, github.com/apache/spark/pull/5919
Some minor cleanup after SPARK-4550.
Sandy Ryza <sandy@cloudera.com>
2015-05-05 18:32:16 -0700
Commit: 762ff2e, github.com/apache/spark/pull/5916
[SPARK-7230] [SPARKR] Make RDD private in SparkR.
Shivaram Venkataraman <shivaram@cs.berkeley.edu>
2015-05-05 14:40:33 -0700
Commit: 4afb578, github.com/apache/spark/pull/5895
[SQL][Minor] make StringComparison extends ExpectsInputTypes
wangfei <wangfei1@huawei.com>
2015-05-05 14:24:37 -0700
Commit: b6566a2, github.com/apache/spark/pull/5905
[SPARK-7351] [STREAMING] [DOCS] Add spark.streaming.ui.retainedBatches to docs
zsxwing <zsxwing@gmail.com>
2015-05-05 13:42:23 -0700
Commit: 4c95fe5, github.com/apache/spark/pull/5899
[SPARK-7294][SQL] ADD BETWEEN
云峤 <chensong.cs@alibaba-inc.com>, kaka1992 <kaka_1992@163.com>
2015-05-05 13:23:53 -0700
Commit: c68d0e2, github.com/apache/spark/pull/5839
[SPARK-6939] [STREAMING] [WEBUI] Add timeline and histogram graphs for streaming statistics
zsxwing <zsxwing@gmail.com>
2015-05-05 12:52:16 -0700
Commit: 8109c9e, github.com/apache/spark/pull/5533
[SPARK-5888] [MLLIB] Add OneHotEncoder as a Transformer
Sandy Ryza <sandy@cloudera.com>
2015-05-05 12:34:02 -0700
Commit: 94ac9eb, github.com/apache/spark/pull/5500
[SPARK-7333] [MLLIB] Add BinaryClassificationEvaluator to PySpark
Xiangrui Meng <meng@databricks.com>
2015-05-05 11:45:37 -0700
Commit: dfb6bfc, github.com/apache/spark/pull/5885
[SPARK-7243][SQL] Reduce size for Contingency Tables in DataFrames
Burak Yavuz <brkyvz@gmail.com>
2015-05-05 11:01:25 -0700
Commit: 598902b, github.com/apache/spark/pull/5900
[SPARK-7007] [CORE] Add a metric source for ExecutorAllocationManager
jerryshao <saisai.shao@intel.com>
2015-05-05 09:43:49 -0700
Commit: 29350ee, github.com/apache/spark/pull/5589
[SPARK-7318] [STREAMING] DStream cleans objects that are not closures
Andrew Or <andrew@databricks.com>
2015-05-05 09:37:49 -0700
Commit: acc877a, github.com/apache/spark/pull/5860
[SPARK-7237] Many user provided closures are not actually cleaned
Andrew Or <andrew@databricks.com>
2015-05-05 09:37:04 -0700
Commit: 01d4022, github.com/apache/spark/pull/5787
[SPARK-6612] [MLLIB] [PYSPARK] Python KMeans parity
Hrishikesh Subramonian <hrishikesh.subramonian@flytxt.com>
2015-05-05 07:57:39 -0700
Commit: 8b63103, github.com/apache/spark/pull/5647
[SPARK-7202] [MLLIB] [PYSPARK] Add SparseMatrixPickler to SerDe
MechCoder <manojkumarsivaraj334@gmail.com>
2015-05-05 07:53:11 -0700
Commit: cd55e9a, github.com/apache/spark/pull/5775
[SPARK-7350] [STREAMING] [WEBUI] Attach the Streaming tab when calling ssc.start()
zsxwing <zsxwing@gmail.com>
2015-05-05 15:09:58 +0100
Commit: 49923f7, github.com/apache/spark/pull/5898
[SPARK-5074] [CORE] [TESTS] Fix the flakey test 'run shuffle with map stage failure' in DAGSchedulerSuite
zsxwing <zsxwing@gmail.com>
2015-05-05 15:04:14 +0100
Commit: 6f35dac, github.com/apache/spark/pull/5903
[MINOR] Minor update for document
Liang-Chi Hsieh <viirya@gmail.com>
2015-05-05 14:44:02 +0100
Commit: d288322, github.com/apache/spark/pull/5906
[SPARK-3454] separate json endpoints for data in the UI
Imran Rashid <irashid@cloudera.com>
2015-05-05 07:25:40 -0500
Commit: ff8b449, github.com/apache/spark/pull/4435
[SPARK-5112] Expose SizeEstimator as a developer api
Sandy Ryza <sandy@cloudera.com>
2015-05-05 12:38:46 +0100
Commit: 0327ca2, github.com/apache/spark/pull/3913
[SPARK-6653] [YARN] New config to specify port for sparkYarnAM actor system
shekhar.bansal <shekhar.bansal@guavus.com>
2015-05-05 11:09:51 +0100
Commit: 93af96a, github.com/apache/spark/pull/5719
[SPARK-7341] [STREAMING] [TESTS] Fix the flaky test: org.apache.spark.stre...
zsxwing <zsxwing@gmail.com>
2015-05-05 02:15:39 -0700
Commit: 0634510, github.com/apache/spark/pull/5891
[SPARK-7113] [STREAMING] Support input information reporting for Direct Kafka stream
jerryshao <saisai.shao@intel.com>
2015-05-05 02:01:06 -0700
Commit: becdb81, github.com/apache/spark/pull/5879
[HOTFIX] [TEST] Ignoring flaky tests
Tathagata Das <tathagata.das1565@gmail.com>
2015-05-05 01:58:51 -0700
Commit: e8f847a, github.com/apache/spark/pull/5901
[SPARK-7139] [STREAMING] Allow received block metadata to be saved to WAL and recovered on driver failure
Tathagata Das <tathagata.das1565@gmail.com>
2015-05-05 01:45:19 -0700
Commit: ae27c0e, github.com/apache/spark/pull/5732
[MINOR] [BUILD] Declare ivy dependency in root pom.
Marcelo Vanzin <vanzin@cloudera.com>
2015-05-05 08:56:16 +0100
Commit: 5160437, github.com/apache/spark/pull/5893
[SPARK-7314] [SPARK-3524] [PYSPARK] upgrade Pyrolite to 4.4
Xiangrui Meng <meng@databricks.com>
2015-05-04 23:52:42 -0700
Commit: 21ed108, github.com/apache/spark/pull/5850
[SPARK-7236] [CORE] Fix to prevent AkkaUtils askWithReply from sleeping on final attempt
Bryan Cutler <bjcutler@us.ibm.com>
2015-05-04 18:29:22 -0700
Commit: 48655d1, github.com/apache/spark/pull/5896
[SPARK-7266] Add ExpectsInputTypes to expressions when possible.
Reynold Xin <rxin@databricks.com>
2015-05-04 18:03:07 -0700
Commit: 1388a46, github.com/apache/spark/pull/5796
[SPARK-7243][SQL] Contingency Tables for DataFrames
Burak Yavuz <brkyvz@gmail.com>
2015-05-04 17:02:49 -0700
Commit: ecf0d8a, github.com/apache/spark/pull/5842
[SPARK-6943] [SPARK-6944] DAG visualization on SparkUI
Andrew Or <andrew@databricks.com>
2015-05-04 16:21:36 -0700
Commit: 863ec0c, github.com/apache/spark/pull/5729
[SPARK-7319][SQL] Improve the output from DataFrame.show()
云峤 <chensong.cs@alibaba-inc.com>
2015-05-04 12:08:38 -0700
Commit: 34edaa8, github.com/apache/spark/pull/5865
[SPARK-5956] [MLLIB] Pipeline components should be copyable.
Xiangrui Meng <meng@databricks.com>
2015-05-04 11:28:59 -0700
Commit: 893b310, github.com/apache/spark/pull/5820
[SPARK-5100] [SQL] add webui for thriftserver
tianyi <tianyi.asiainfo@gmail.com>
2015-05-04 16:59:34 +0800
Commit: 343d3bf, github.com/apache/spark/pull/5730
[SPARK-5563] [MLLIB] LDA with online variational inference
Yuhao Yang <hhbyyh@gmail.com>, Joseph K. Bradley <joseph@databricks.com>
2015-05-04 00:06:25 -0700
Commit: 3539cb7, github.com/apache/spark/pull/4419
[SPARK-7241] Pearson correlation for DataFrames
Burak Yavuz <brkyvz@gmail.com>
2015-05-03 21:44:39 -0700
Commit: 9646018, github.com/apache/spark/pull/5858
[SPARK-7329] [MLLIB] simplify ParamGridBuilder impl
Xiangrui Meng <meng@databricks.com>
2015-05-03 18:06:48 -0700
Commit: 1ffa8cb, github.com/apache/spark/pull/5873
[SPARK-7302] [DOCS] SPARK building documentation still mentions building for yarn 0.23
Sean Owen <sowen@cloudera.com>
2015-05-03 21:22:31 +0100
Commit: 9e25b09, github.com/apache/spark/pull/5863
[SPARK-6907] [SQL] Isolated client for HiveMetastore
Michael Armbrust <michael@databricks.com>
2015-05-03 13:12:50 -0700
Commit: daa70bf, github.com/apache/spark/pull/5851
[SPARK-7022] [PYSPARK] [ML] Add ML.Tuning.ParamGridBuilder to PySpark
Omede Firouz <ofirouz@palantir.com>, Omede <omedefirouz@gmail.com>
2015-05-03 11:42:02 -0700
Commit: f4af925, github.com/apache/spark/pull/5601
[SPARK-7031] [THRIFTSERVER] let thrift server take SPARK_DAEMON_MEMORY and SPARK_DAEMON_JAVA_OPTS
WangTaoTheTonic <wangtao111@huawei.com>
2015-05-03 00:47:47 +0100
Commit: 49549d5, github.com/apache/spark/pull/5609
[SPARK-7255] [STREAMING] [DOCUMENTATION] Added documentation for spark.streaming.kafka.maxRetries
BenFradet <benjamin.fradet@gmail.com>
2015-05-02 23:41:14 +0100
Commit: ea841ef, github.com/apache/spark/pull/5808
[SPARK-5213] [SQL] Pluggable SQL Parser Support
Cheng Hao <hao.cheng@intel.com>, scwf <wangfei1@huawei.com>
2015-05-02 15:20:07 -0700
Commit: 5d6b90d, github.com/apache/spark/pull/5827
[MINOR] [HIVE] Fix QueryPartitionSuite.
Marcelo Vanzin <vanzin@cloudera.com>
2015-05-02 23:10:35 +0100
Commit: 82c8c37, github.com/apache/spark/pull/5854
[SPARK-6030] [CORE] Using simulated field layout method to compute class shellSize
Ye Xianjin <advancedxy@gmail.com>
2015-05-02 23:08:09 +0100
Commit: bfcd528, github.com/apache/spark/pull/4783
[SPARK-7323] [SPARK CORE] Use insertAll instead of insert while merging combiners in reducer
Mridul Muralidharan <mridulm@yahoo-inc.com>
2015-05-02 23:05:51 +0100
Commit: da30352, github.com/apache/spark/pull/5862
[SPARK-3444] Fix typo in Dataframes.py introduced in []
Dean Chen <deanchen5@gmail.com>
2015-05-02 23:04:13 +0100
Commit: 856a571, github.com/apache/spark/pull/5866
[SPARK-7315] [STREAMING] [TEST] Fix flaky WALBackedBlockRDDSuite
Tathagata Das <tathagata.das1565@gmail.com>
2015-05-02 01:53:14 -0700
Commit: ecc6eb5, github.com/apache/spark/pull/5853
[SPARK-7120] [SPARK-7121] Closure cleaner nesting + documentation + tests
Andrew Or <andrew@databricks.com>
2015-05-01 23:57:58 -0700
Commit: 7394e7a, github.com/apache/spark/pull/5685
[SPARK-7242] added python api for freqItems in DataFrames
Burak Yavuz <brkyvz@gmail.com>
2015-05-01 23:43:24 -0700
Commit: 2e0f357, github.com/apache/spark/pull/5859
[SPARK-7317] [Shuffle] Expose shuffle handle
Mridul Muralidharan <mridulm@yahoo-inc.com>
2015-05-01 21:23:42 -0700
Commit: b79aeb9, github.com/apache/spark/pull/5857
[SPARK-6229] Add SASL encryption to network library.
Marcelo Vanzin <vanzin@cloudera.com>
2015-05-01 19:01:46 -0700
Commit: 38d4e9e, github.com/apache/spark/pull/5377
[SPARK-2691] [MESOS] Support for Mesos DockerInfo
Chris Heller <hellertime@gmail.com>
2015-05-01 18:41:22 -0700
Commit: 8f50a07, github.com/apache/spark/pull/3074
[SPARK-6443] [SPARK SUBMIT] Could not submit app in standalone cluster mode when HA is enabled
WangTaoTheTonic <wangtao111@huawei.com>
2015-05-01 18:38:20 -0700
Commit: b4b43df, github.com/apache/spark/pull/5116
[SPARK-7216] [MESOS] Add driver details page to Mesos cluster UI.
Timothy Chen <tnachen@gmail.com>
2015-05-01 18:36:42 -0700
Commit: 2022193, github.com/apache/spark/pull/5763
[SPARK-6954] [YARN] ExecutorAllocationManager can end up requesting a negative n...
Sandy Ryza <sandy@cloudera.com>
2015-05-01 18:32:46 -0700
Commit: 099327d, github.com/apache/spark/pull/5704
[SPARK-3444] Provide an easy way to change log level
Holden Karau <holden@pigscanfly.ca>
2015-05-01 18:02:10 -0700
Commit: ae98eec, github.com/apache/spark/pull/5791
[SPARK-2808][Streaming][Kafka] update kafka to 0.8.2
cody koeninger <cody@koeninger.org>, Helena Edelson <helena.edelson@datastax.com>
2015-05-01 17:54:56 -0700
Commit: 4786484, github.com/apache/spark/pull/4537
[SPARK-7112][Streaming][WIP] Add a InputInfoTracker to track all the input streams
jerryshao <saisai.shao@intel.com>, Saisai Shao <saisai.shao@intel.com>
2015-05-01 17:46:06 -0700
Commit: b88c275, github.com/apache/spark/pull/5680
[SPARK-7309] [CORE] [STREAMING] Shutdown the thread pools in ReceivedBlockHandler and DAGScheduler
zsxwing <zsxwing@gmail.com>
2015-05-01 17:41:55 -0700
Commit: ebc25a4, github.com/apache/spark/pull/5845
[SPARK-6999] [SQL] Remove the infinite recursive method (useless)
Cheng Hao <hao.cheng@intel.com>
2015-05-01 19:39:30 -0500
Commit: 98e7045, github.com/apache/spark/pull/5804
[SPARK-7304] [BUILD] Include $@ in call to mvn consistently in make-distribution.sh
Rajendra Gokhale (rvgcentos) <rvg@cloudera.com>
2015-05-01 17:01:36 -0700
Commit: e6fb377, github.com/apache/spark/pull/5846
[SPARK-7312][SQL] SPARK-6913 broke jdk6 build
Yin Huai <yhuai@databricks.com>
2015-05-01 16:47:00 -0700
Commit: 41c6a44, github.com/apache/spark/pull/5847
Ignore flakey test in SparkSubmitUtilsSuite
Patrick Wendell <patrick@databricks.com>
2015-05-01 14:42:58 -0700
Commit: 5c1faba
[SPARK-5342] [YARN] Allow long running Spark apps to run on secure YARN/HDFS
Hari Shreedharan <hshreedharan@apache.org>
2015-05-01 15:32:09 -0500
Commit: b1f4ca8, github.com/apache/spark/pull/5823
[SPARK-7240][SQL] Single pass covariance calculation for dataframes
Burak Yavuz <brkyvz@gmail.com>
2015-05-01 13:29:17 -0700
Commit: 4dc8d74, github.com/apache/spark/pull/5825
[SPARK-7281] [YARN] Add option to set AM's lib path in client mode.
Marcelo Vanzin <vanzin@cloudera.com>
2015-05-01 21:20:46 +0100
Commit: 7b5dd3e, github.com/apache/spark/pull/5813
[SPARK-7213] [YARN] Check for read permissions before copying a Hadoop config file
Nishkam Ravi <nravi@cloudera.com>, nishkamravi2 <nishkamravi@gmail.com>, nravi <nravi@c1704.halxg.cloudera.com>
2015-05-01 21:14:16 +0100
Commit: f53a488, github.com/apache/spark/pull/5760
Revert "[SPARK-7224] added mock repository generator for --packages tests"
Patrick Wendell <patrick@databricks.com>
2015-05-01 13:01:43 -0700
Commit: c6d9a42
Revert "[SPARK-7287] enabled fixed test"
Patrick Wendell <patrick@databricks.com>
2015-05-01 13:01:14 -0700
Commit: 58d6584
[SPARK-7274] [SQL] Create Column expression for array/struct creation.
Reynold Xin <rxin@databricks.com>
2015-05-01 12:49:02 -0700
Commit: 3753776, github.com/apache/spark/pull/5802
[SPARK-7183] [NETWORK] Fix memory leak of TransportRequestHandler.streamIds
Liang-Chi Hsieh <viirya@gmail.com>
2015-05-01 11:59:12 -0700
Commit: 1686032, github.com/apache/spark/pull/5743
[SPARK-6846] [WEBUI] [HOTFIX] return to GET for kill link in UI since YARN AM won't proxy POST
Sean Owen <sowen@cloudera.com>
2015-05-01 19:57:37 +0100
Commit: 1262e31, github.com/apache/spark/pull/5837
[SPARK-5854] personalized page rank
Dan McClary <dan.mcclary@gmail.com>, dwmclary <dan.mcclary@gmail.com>
2015-05-01 11:55:43 -0700
Commit: 7d42722, github.com/apache/spark/pull/4774
changing persistence engine trait to an abstract class
niranda <niranda.perera@gmail.com>
2015-05-01 11:27:45 -0700
Commit: 27de6fe, github.com/apache/spark/pull/5832
Limit help option regex
Chris Biow <chris.biow@10gen.com>
2015-05-01 19:26:55 +0100
Commit: c8c481d, github.com/apache/spark/pull/5816
[SPARK-5891] [ML] Add Binarizer ML Transformer
Liang-Chi Hsieh <viirya@gmail.com>
2015-05-01 08:31:01 -0700
Commit: 7630213, github.com/apache/spark/pull/5699
[SPARK-3066] [MLLIB] Support recommendAll in matrix factorization model
Debasish Das <debasish.das@one.verizon.com>, Xiangrui Meng <meng@databricks.com>
2015-05-01 08:27:46 -0700
Commit: 3b514af, github.com/apache/spark/pull/3098
[SPARK-4705] Handle multiple app attempts event logs, history server.
Marcelo Vanzin <vanzin@cloudera.com>, twinkle sachdeva <twinkle@kite.ggn.in.guavus.com>, twinkle.sachdeva <twinkle.sachdeva@guavus.com>, twinkle sachdeva <twinkle.sachdeva@guavus.com>
2015-05-01 09:50:55 -0500
Commit: 3052f49, github.com/apache/spark/pull/5432
[SPARK-3468] [WEBUI] Timeline-View feature
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2015-05-01 01:39:56 -0700
Commit: 7fe0f3f, github.com/apache/spark/pull/2342
[SPARK-6257] [PYSPARK] [MLLIB] MLlib API missing items in Recommendation
MechCoder <manojkumarsivaraj334@gmail.com>
2015-04-30 23:51:00 -0700
Commit: c24aeb6, github.com/apache/spark/pull/5807
[SPARK-7291] [CORE] Fix a flaky test in AkkaRpcEnvSuite
zsxwing <zsxwing@gmail.com>
2015-04-30 23:44:33 -0700
Commit: 14b3288, github.com/apache/spark/pull/5822
[SPARK-7287] enabled fixed test
Burak Yavuz <brkyvz@gmail.com>
2015-04-30 23:39:58 -0700
Commit: 7cf1eb7, github.com/apache/spark/pull/5826
[SPARK-4550] In sort-based shuffle, store map outputs in serialized form
Sandy Ryza <sandy@cloudera.com>
2015-04-30 23:14:14 -0700
Commit: 0a2b15c, github.com/apache/spark/pull/4450
HOTFIX: Disable buggy dependency checker
Patrick Wendell <patrick@databricks.com>
2015-04-30 22:39:58 -0700
Commit: a9fc505
[SPARK-6479] [BLOCK MANAGER] Create off-heap block storage API
Zhan Zhang <zhazhan@gmail.com>
2015-04-30 22:24:31 -0700
Commit: 36a7a68, github.com/apache/spark/pull/5430
[SPARK-7248] implemented random number generators for DataFrames
Burak Yavuz <brkyvz@gmail.com>
2015-04-30 21:56:03 -0700
Commit: b5347a4, github.com/apache/spark/pull/5819
[SPARK-7282] [STREAMING] Fix the race conditions in StreamingListenerSuite
zsxwing <zsxwing@gmail.com>
2015-04-30 21:32:11 -0700
Commit: 69a739c, github.com/apache/spark/pull/5812
Revert "[SPARK-5213] [SQL] Pluggable SQL Parser Support"
Patrick Wendell <patrick@databricks.com>
2015-04-30 20:33:36 -0700
Commit: beeafcf
[SPARK-7123] [SQL] support table.star in sqlcontext
scwf <wangfei1@huawei.com>
2015-04-30 18:50:14 -0700
Commit: 473552f, github.com/apache/spark/pull/5690
[SPARK-5213] [SQL] Pluggable SQL Parser Support
Cheng Hao <hao.cheng@intel.com>
2015-04-30 18:49:06 -0700
Commit: 3ba5aaa, github.com/apache/spark/pull/4015
[SPARK-6913][SQL] Fixed "java.sql.SQLException: No suitable driver found"
Vyacheslav Baranov <slavik.baranov@gmail.com>
2015-04-30 18:45:14 -0700
Commit: e991255, github.com/apache/spark/pull/5782
[SPARK-7109] [SQL] Push down left side filter for left semi join
wangfei <wangfei1@huawei.com>, scwf <wangfei1@huawei.com>
2015-04-30 18:18:54 -0700
Commit: a0d8a61, github.com/apache/spark/pull/5677
[SPARK-7093] [SQL] Using newPredicate in NestedLoopJoin to enable code generation
scwf <wangfei1@huawei.com>
2015-04-30 18:15:56 -0700
Commit: 0797338, github.com/apache/spark/pull/5665
[SPARK-7280][SQL] Add "drop" column/s on a data frame
rakeshchalasani <vnit.rakesh@gmail.com>
2015-04-30 17:42:50 -0700
Commit: ee04413, github.com/apache/spark/pull/5818
[SPARK-7242][SQL][MLLIB] Frequent items for DataFrames
Burak Yavuz <brkyvz@gmail.com>
2015-04-30 16:40:32 -0700
Commit: 149b3ee, github.com/apache/spark/pull/5799
[SPARK-7279] Removed diffSum which is theoretical zero in LinearRegression and coding formating
DB Tsai <dbt@netflix.com>
2015-04-30 16:26:51 -0700
Commit: 1c3e402, github.com/apache/spark/pull/5809
[Build] Enable MiMa checks for SQL
Josh Rosen <joshrosen@databricks.com>
2015-04-30 16:23:01 -0700
Commit: fa01bec, github.com/apache/spark/pull/5727
[SPARK-7267][SQL]Push down Project when it's child is Limit
Zhongshuai Pei <799203320@qq.com>, DoingDone9 <799203320@qq.com>
2015-04-30 15:22:13 -0700
Commit: 77cc25f, github.com/apache/spark/pull/5797
[SPARK-7288] Suppress compiler warnings due to use of sun.misc.Unsafe; add facade in front of Unsafe; remove use of Unsafe.setMemory
Josh Rosen <joshrosen@databricks.com>
2015-04-30 15:21:00 -0700
Commit: 07a8620, github.com/apache/spark/pull/5814
[SPARK-7196][SQL] Support precision and scale of decimal type for JDBC
Liang-Chi Hsieh <viirya@gmail.com>
2015-04-30 15:13:43 -0700
Commit: 6702324, github.com/apache/spark/pull/5777
Revert "[SPARK-5342] [YARN] Allow long running Spark apps to run on secure YARN/HDFS"
Patrick Wendell <patrick@databricks.com>
2015-04-30 14:59:20 -0700
Commit: e0628f2
[SPARK-7207] [ML] [BUILD] Added ml.recommendation, ml.regression to SparkBuild
Joseph K. Bradley <joseph@databricks.com>
2015-04-30 14:39:27 -0700
Commit: adbdb19, github.com/apache/spark/pull/5758
[SPARK-5342] [YARN] Allow long running Spark apps to run on secure YARN/HDFS
Hari Shreedharan <hshreedharan@apache.org>
2015-04-30 13:03:23 -0500
Commit: 6c65da6, github.com/apache/spark/pull/4688
[SPARK-7224] added mock repository generator for --packages tests
Burak Yavuz <brkyvz@gmail.com>
2015-04-30 10:19:08 -0700
Commit: 7dacc08, github.com/apache/spark/pull/5790
[HOTFIX] Disabling flaky test (fix in progress as part of SPARK-7224)
Patrick Wendell <patrick@databricks.com>
2015-04-30 01:02:33 -0700
Commit: 47bf406
[SPARK-1406] Mllib pmml model export
Vincenzo Selvaggio <vselvaggio@hotmail.it>, Xiangrui Meng <meng@databricks.com>, selvinsource <vselvaggio@hotmail.it>
2015-04-29 23:21:21 -0700
Commit: 254e050, github.com/apache/spark/pull/3062
[SPARK-7225][SQL] CombineLimits optimizer does not work
Zhongshuai Pei <799203320@qq.com>, DoingDone9 <799203320@qq.com>
2015-04-29 22:44:14 -0700
Commit: 4459514, github.com/apache/spark/pull/5770
Some code clean up.
DB Tsai <dbt@netflix.com>
2015-04-29 21:44:41 -0700
Commit: ba49eb1, github.com/apache/spark/pull/5794
[SPARK-7156][SQL] Addressed follow up comments for randomSplit
Burak Yavuz <brkyvz@gmail.com>
2015-04-29 19:13:47 -0700
Commit: 5553198, github.com/apache/spark/pull/5795
[SPARK-7234][SQL] Fix DateType mismatch when codegen on.
云峤 <chensong.cs@alibaba-inc.com>
2015-04-29 18:23:42 -0700
Commit: 7143f6e, github.com/apache/spark/pull/5778
[SPARK-6862] [STREAMING] [WEBUI] Add BatchPage to display details of a batch
zsxwing <zsxwing@gmail.com>
2015-04-29 18:22:14 -0700
Commit: 1b7106b, github.com/apache/spark/pull/5473
[SPARK-7176] [ML] Add validation functionality to Param
Joseph K. Bradley <joseph@databricks.com>
2015-04-29 17:26:46 -0700
Commit: 114bad6, github.com/apache/spark/pull/5740
[SQL] [Minor] Print detail query execution info when spark answer is not right
wangfei <wangfei1@huawei.com>
2015-04-29 17:00:24 -0700
Commit: 1fdfdb4, github.com/apache/spark/pull/5774
[SPARK-7259] [ML] VectorIndexer: do not copy non-ML metadata to output column
Joseph K. Bradley <joseph@databricks.com>
2015-04-29 16:35:17 -0700
Commit: b1ef6a6, github.com/apache/spark/pull/5789
[SPARK-7229] [SQL] SpecificMutableRow should take integer type as internal representation for Date
Cheng Hao <hao.cheng@intel.com>
2015-04-29 16:23:34 -0700
Commit: f8cbb0a, github.com/apache/spark/pull/5772
[SPARK-7155] [CORE] Allow newAPIHadoopFile to support comma-separated list of files as input
yongtang <yongtang@users.noreply.github.com>
2015-04-29 23:55:51 +0100
Commit: 3fc6cfd, github.com/apache/spark/pull/5708
[SPARK-7181] [CORE] fix inifite loop in Externalsorter's mergeWithAggregation
Qiping Li <liqiping1991@gmail.com>
2015-04-29 23:52:16 +0100
Commit: 7f4b583, github.com/apache/spark/pull/5737
[SPARK-7156][SQL] support RandomSplit in DataFrames
Burak Yavuz <brkyvz@gmail.com>
2015-04-29 15:34:05 -0700
Commit: d7dbce8, github.com/apache/spark/pull/5761
[SPARK-6529] [ML] Add Word2Vec transformer
Xusen Yin <yinxusen@gmail.com>
2015-04-29 14:55:32 -0700
Commit: c9d530e, github.com/apache/spark/pull/5596
[SPARK-7222] [ML] Added mathematical derivation in comment and compressed the model, removed the correction terms in LinearRegression with ElasticNet
DB Tsai <dbt@netflix.com>
2015-04-29 14:53:37 -0700
Commit: 15995c8, github.com/apache/spark/pull/5767
[SPARK-6629] cancelJobGroup() may not work for jobs whose job groups are inherited from parent threads
Josh Rosen <joshrosen@databricks.com>
2015-04-29 13:31:52 -0700
Commit: 3a180c1, github.com/apache/spark/pull/5288
[SPARK-6752] [STREAMING] [REOPENED] Allow StreamingContext to be recreated from checkpoint and existing SparkContext
Tathagata Das <tathagata.das1565@gmail.com>
2015-04-29 13:10:31 -0700
Commit: a9c4e29, github.com/apache/spark/pull/5773
[SPARK-7056] [STREAMING] Make the Write Ahead Log pluggable
Tathagata Das <tathagata.das1565@gmail.com>
2015-04-29 13:06:11 -0700
Commit: 1868bd4, github.com/apache/spark/pull/5645
Fix a typo of "threshold"
Xusen Yin <yinxusen@gmail.com>
2015-04-29 10:13:48 -0700
Commit: c0c0ba6, github.com/apache/spark/pull/5769
[SQL][Minor] fix java doc for DataFrame.agg
Wenchen Fan <cloud0fan@outlook.com>
2015-04-29 09:49:24 -0700
Commit: 81ea42b, github.com/apache/spark/pull/5712
Better error message on access to non-existing attribute
ksonj <kson@siberie.de>
2015-04-29 09:48:47 -0700
Commit: 3df9c5d, github.com/apache/spark/pull/5771
[SPARK-7223] Rename RPC askWithReply -> askWithReply, sendWithReply -> ask.
Reynold Xin <rxin@databricks.com>
2015-04-29 09:46:37 -0700
Commit: 687273d, github.com/apache/spark/pull/5768
[SPARK-6918] [YARN] Secure HBase support.
Dean Chen <deanchen5@gmail.com>
2015-04-29 08:58:33 -0500
Commit: baed3f2, github.com/apache/spark/pull/5586
[SPARK-7076][SPARK-7077][SPARK-7080][SQL] Use managed memory for aggregations
Josh Rosen <joshrosen@databricks.com>
2015-04-29 01:07:26 -0700
Commit: f49284b, github.com/apache/spark/pull/5725
[SPARK-7204] [SQL] Fix callSite for Dataframe and SQL operations
Patrick Wendell <patrick@databricks.com>
2015-04-29 00:35:08 -0700
Commit: 1fd6ed9, github.com/apache/spark/pull/5757
[SPARK-7188] added python support for math DataFrame functions
Burak Yavuz <brkyvz@gmail.com>
2015-04-29 00:09:24 -0700
Commit: fe917f5, github.com/apache/spark/pull/5750
MAINTENANCE: Automated closing of pull requests.
Patrick Wendell <patrick@databricks.com>
2015-04-28 23:38:59 -0700
Commit: 8dee274, github.com/apache/spark/pull/3205
[SPARK-7205] Support `.ivy2/local` and `.m2/repositories/` in --packages
Burak Yavuz <brkyvz@gmail.com>
2015-04-28 23:05:02 -0700
Commit: f98773a, github.com/apache/spark/pull/5755
[SPARK-7215] made coalesce and repartition a part of the query plan
Burak Yavuz <brkyvz@gmail.com>
2015-04-28 22:48:04 -0700
Commit: 271c4c6, github.com/apache/spark/pull/5762
[SPARK-6756] [MLLIB] add toSparse, toDense, numActives, numNonzeros, and compressed to Vector
Xiangrui Meng <meng@databricks.com>
2015-04-28 21:49:53 -0700
Commit: 5ef006f, github.com/apache/spark/pull/5756
[SPARK-7208] [ML] [PYTHON] Added Matrix, SparseMatrix to __all__ list in linalg.py
Joseph K. Bradley <joseph@databricks.com>
2015-04-28 21:15:47 -0700
Commit: a8aeadb, github.com/apache/spark/pull/5759
[SPARK-7138] [STREAMING] Add method to BlockGenerator to add multiple records to BlockGenerator with single callback
Tathagata Das <tathagata.das1565@gmail.com>
2015-04-28 19:31:57 -0700
Commit: 5c8f4bd, github.com/apache/spark/pull/5695
[SPARK-6965] [MLLIB] StringIndexer handles numeric input.
Xiangrui Meng <meng@databricks.com>
2015-04-28 17:41:09 -0700
Commit: d36e673, github.com/apache/spark/pull/5753
Closes #4807 Closes #5055 Closes #3583
Xiangrui Meng <meng@databricks.com>
2015-04-28 14:21:25 -0700
Commit: 555213e
[SPARK-7201] [MLLIB] move Identifiable to ml.util
Xiangrui Meng <meng@databricks.com>
2015-04-28 14:07:26 -0700
Commit: f0a1f90, github.com/apache/spark/pull/5749
[MINOR] [CORE] Warn users who try to cache RDDs with dynamic allocation on.
Marcelo Vanzin <vanzin@cloudera.com>
2015-04-28 13:49:29 -0700
Commit: 28b1af7, github.com/apache/spark/pull/5751
[SPARK-5338] [MESOS] Add cluster mode support for Mesos
Timothy Chen <tnachen@gmail.com>, Luc Bourlier <luc.bourlier@typesafe.com>
2015-04-28 13:31:08 -0700
Commit: 53befac, github.com/apache/spark/pull/5144
[SPARK-6314] [CORE] handle JsonParseException for history server
Zhang, Liye <liye.zhang@intel.com>
2015-04-28 12:33:48 -0700
Commit: 8009810, github.com/apache/spark/pull/5736
[SPARK-5932] [CORE] Use consistent naming for size properties
Ilya Ganelin <ilya.ganelin@capitalone.com>
2015-04-28 12:18:55 -0700
Commit: 2d222fb, github.com/apache/spark/pull/5574
[SPARK-4286] Add an external shuffle service that can be run as a daemon.
Iulian Dragos <jaguarul@gmail.com>
2015-04-28 12:08:18 -0700
Commit: 8aab94d, github.com/apache/spark/pull/4990
[Core][test][minor] replace try finally block with tryWithSafeFinally
Zhang, Liye <liye.zhang@intel.com>
2015-04-28 10:24:00 -0700
Commit: 52ccf1d, github.com/apache/spark/pull/5739
[SPARK-7140] [MLLIB] only scan the first 16 entries in Vector.hashCode
Xiangrui Meng <meng@databricks.com>
2015-04-28 09:59:36 -0700
Commit: b14cd23, github.com/apache/spark/pull/5697
[SPARK-5253] [ML] LinearRegression with L1/L2 (ElasticNet) using OWLQN
DB Tsai <dbt@netflix.com>, DB Tsai <dbtsai@alpinenow.com>
2015-04-28 09:46:08 -0700
Commit: 6a827d5, github.com/apache/spark/pull/4259
[SPARK-6435] spark-shell --jars option does not add all jars to classpath
Masayoshi TSUZUKI <tsudukim@oss.nttdata.co.jp>
2015-04-28 07:55:21 -0400
Commit: 268c419, github.com/apache/spark/pull/5227
[SPARK-7100] [MLLIB] Fix persisted RDD leak in GradientBoostTrees
Jim Carroll <jim@dontcallme.com>
2015-04-28 07:51:02 -0400
Commit: 75905c5, github.com/apache/spark/pull/5669
[SPARK-7168] [BUILD] Update plugin versions in Maven build and centralize versions
Sean Owen <sowen@cloudera.com>
2015-04-28 07:48:34 -0400
Commit: 7f3b3b7, github.com/apache/spark/pull/5720
[SPARK-6352] [SQL] Custom parquet output committer
Pei-Lun Lee <pllee@appier.com>
2015-04-28 16:50:18 +0800
Commit: e13cd86, github.com/apache/spark/pull/5525
[SPARK-7135][SQL] DataFrame expression for monotonically increasing IDs.
Reynold Xin <rxin@databricks.com>
2015-04-28 00:39:08 -0700
Commit: d94cd1a, github.com/apache/spark/pull/5709
[SPARK-7187] SerializationDebugger should not crash user code
Andrew Or <andrew@databricks.com>
2015-04-28 00:38:14 -0700
Commit: bf35edd, github.com/apache/spark/pull/5734
[SPARK-5946] [STREAMING] Add Python API for direct Kafka stream
jerryshao <saisai.shao@intel.com>, Saisai Shao <saisai.shao@intel.com>
2015-04-27 23:48:02 -0700
Commit: 9e4e82b, github.com/apache/spark/pull/4723
[SPARK-6829] Added math functions for DataFrames
Burak Yavuz <brkyvz@gmail.com>
2015-04-27 23:10:14 -0700
Commit: 29576e7, github.com/apache/spark/pull/5616
[SPARK-7174][Core] Move calling `TaskScheduler.executorHeartbeatReceived` to another thread
zsxwing <zsxwing@gmail.com>
2015-04-27 21:45:40 -0700
Commit: 874a2ca, github.com/apache/spark/pull/5723
[SPARK-7090] [MLLIB] Introduce LDAOptimizer to LDA to further improve extensibility
Yuhao Yang <hhbyyh@gmail.com>
2015-04-27 19:02:51 -0700
Commit: 4d9e560, github.com/apache/spark/pull/5661
[SPARK-7162] [YARN] Launcher error in yarn-client
GuoQiang Li <witgo@qq.com>
2015-04-27 19:52:41 -0400
Commit: 62888a4, github.com/apache/spark/pull/5716
[SPARK-7145] [CORE] commons-lang (2.x) classes used instead of commons-lang3 (3.x); commons-io used without dependency
Sean Owen <sowen@cloudera.com>
2015-04-27 19:50:55 -0400
Commit: ab5adb7, github.com/apache/spark/pull/5703
[SPARK-3090] [CORE] Stop SparkContext if user forgets to.
Marcelo Vanzin <vanzin@cloudera.com>
2015-04-27 19:46:17 -0400
Commit: 5d45e1f, github.com/apache/spark/pull/5696
[SPARK-6738] [CORE] Improve estimate the size of a large array
Hong Shen <hongshen@tencent.com>
2015-04-27 18:57:31 -0400
Commit: 8e1c00d, github.com/apache/spark/pull/5608
[SPARK-7103] Fix crash with SparkContext.union when RDD has no partitioner
Steven She <steven@canopylabs.com>
2015-04-27 18:55:02 -0400
Commit: b9de9e0, github.com/apache/spark/pull/5679
[SPARK-6991] [SPARKR] Adds support for zipPartitions.
hlin09 <hlin09pu@gmail.com>
2015-04-27 15:04:37 -0700
Commit: ca9f4eb, github.com/apache/spark/pull/5568
SPARK-7107 Add parameter for zookeeper.znode.parent to hbase_inputformat...
tedyu <yuzhihong@gmail.com>
2015-04-27 14:42:40 -0700
Commit: ef82bdd, github.com/apache/spark/pull/5673
[SPARK-6856] [R] Make RDD information more useful in SparkR
Jeff Harrison <jeffrharrison@gmail.com>
2015-04-27 13:38:25 -0700
Commit: 7078f60, github.com/apache/spark/pull/5667
[SPARK-4925] Publish Spark SQL hive-thriftserver maven artifact
Misha Chernetsov <chernetsov@gmail.com>
2015-04-27 11:27:56 -0700
Commit: 998aac2, github.com/apache/spark/pull/5429
[SPARK-6505] [SQL] Remove the reflection call in HiveFunctionWrapper
baishuo <vc_java@hotmail.com>
2015-04-27 14:08:05 +0800
Commit: 82bb7fd, github.com/apache/spark/pull/5660
[SQL][Minor] rename DataTypeParser.apply to DataTypeParser.parse
wangfei <wangfei1@huawei.com>
2015-04-26 21:08:47 -0700
Commit: d188b8b, github.com/apache/spark/pull/5710
[SPARK-7152][SQL] Add a Column expression for partition ID.
Reynold Xin <rxin@databricks.com>
2015-04-26 11:46:58 -0700
Commit: ca55dc9, github.com/apache/spark/pull/5705
[MINOR] [MLLIB] Refactor toString method in MLLIB
Alain <aihe@usc.edu>
2015-04-26 07:14:24 -0400
Commit: 9a5bbe0, github.com/apache/spark/pull/5687
[SPARK-6014] [CORE] [HOTFIX] Add try-catch block around ShutDownHook
Nishkam Ravi <nravi@cloudera.com>, nishkamravi2 <nishkamravi@gmail.com>, nravi <nravi@c1704.halxg.cloudera.com>
2015-04-25 20:02:23 -0400
Commit: f5473c2, github.com/apache/spark/pull/5672
[SPARK-7092] Update spark scala version to 2.11.6
Prashant Sharma <prashant.s@imaginea.com>
2015-04-25 18:07:34 -0400
Commit: a11c868, github.com/apache/spark/pull/5662
[SQL] Update SQL readme to include instructions on generating golden answer files based on Hive 0.13.1.
Yin Huai <yhuai@databricks.com>
2015-04-25 13:43:39 -0700
Commit: aa6966f, github.com/apache/spark/pull/5702
[SPARK-6113] [ML] Tree ensembles for Pipelines API
Joseph K. Bradley <joseph@databricks.com>
2015-04-25 12:27:19 -0700
Commit: a7160c4, github.com/apache/spark/pull/5626
Revert "[SPARK-6752][Streaming] Allow StreamingContext to be recreated from checkpoint and existing SparkContext"
Patrick Wendell <patrick@databricks.com>
2015-04-25 10:37:34 -0700
Commit: a61d65f
update the deprecated CountMinSketchMonoid function to TopPctCMS function
KeheCAI <caikehe@gmail.com>
2015-04-25 08:42:38 -0400
Commit: cca9905, github.com/apache/spark/pull/5629
[SPARK-7136][Docs] Spark SQL and DataFrame Guide fix example file and paths
Deborah Siegel <deborah.siegel@gmail.com>, DEBORAH SIEGEL <deborahsiegel@d-140-142-0-49.dhcp4.washington.edu>, DEBORAH SIEGEL <deborahsiegel@DEBORAHs-MacBook-Pro.local>, DEBORAH SIEGEL <deborahsiegel@d-69-91-154-197.dhcp4.washington.edu>
2015-04-24 20:25:07 -0700
Commit: 59b7cfc, github.com/apache/spark/pull/5693
[PySpark][Minor] Update sql example, so that can read file correctly
linweizhong <linweizhong@huawei.com>
2015-04-24 20:23:19 -0700
Commit: d874f8b, github.com/apache/spark/pull/5684
[SPARK-6122] [CORE] Upgrade tachyon-client version to 0.6.3
Calvin Jia <jia.calvin@gmail.com>
2015-04-24 17:57:41 -0400
Commit: 438859e, github.com/apache/spark/pull/5354
[SPARK-6852] [SPARKR] Accept numeric as numPartitions in SparkR.
Sun Rui <rui.sun@intel.com>
2015-04-24 12:52:07 -0700
Commit: caf0136, github.com/apache/spark/pull/5613
[SPARK-7033] [SPARKR] Clean usage of split. Use partition instead where applicable.
Sun Rui <rui.sun@intel.com>
2015-04-24 11:00:19 -0700
Commit: ebb77b2, github.com/apache/spark/pull/5628
[SPARK-6528] [ML] Add IDF transformer
Xusen Yin <yinxusen@gmail.com>
2015-04-24 08:29:49 -0700
Commit: 6e57d57, github.com/apache/spark/pull/5266
[SPARK-7115] [MLLIB] skip the very first 1 in poly expansion
Xiangrui Meng <meng@databricks.com>
2015-04-24 08:27:48 -0700
Commit: 78b39c7, github.com/apache/spark/pull/5681
[SPARK-5894] [ML] Add polynomial mapper
Xusen Yin <yinxusen@gmail.com>, Xiangrui Meng <meng@databricks.com>
2015-04-24 00:39:29 -0700
Commit: 8509519, github.com/apache/spark/pull/5245
Fixed a typo from the previous commit.
Reynold Xin <rxin@databricks.com>
2015-04-23 22:39:00 -0700
Commit: 4c722d7
[SQL] Fixed expression data type matching.
Reynold Xin <rxin@databricks.com>
2015-04-23 21:21:03 -0700
Commit: d3a302d, github.com/apache/spark/pull/5675
Update sql-programming-guide.md
Ken Geis <geis.ken@gmail.com>
2015-04-23 20:45:33 -0700
Commit: 67bccbd, github.com/apache/spark/pull/5674
[SPARK-7060][SQL] Add alias function to python dataframe
Yin Huai <yhuai@databricks.com>
2015-04-23 18:52:55 -0700
Commit: 2d010f7, github.com/apache/spark/pull/5634
[SPARK-7037] [CORE] Inconsistent behavior for non-spark config properties in spark-shell and spark-submit
Cheolsoo Park <cheolsoop@netflix.com>
2015-04-23 20:10:55 -0400
Commit: 336f7f5, github.com/apache/spark/pull/5617
[SPARK-6818] [SPARKR] Support column deletion in SparkR DataFrame API.
Sun Rui <rui.sun@intel.com>
2015-04-23 16:08:14 -0700
Commit: 73db132, github.com/apache/spark/pull/5655
[SQL] Break dataTypes.scala into multiple files.
Reynold Xin <rxin@databricks.com>
2015-04-23 14:48:19 -0700
Commit: 6220d93, github.com/apache/spark/pull/5670
[SPARK-7070] [MLLIB] LDA.setBeta should call setTopicConcentration.
Xiangrui Meng <meng@databricks.com>
2015-04-23 14:46:54 -0700
Commit: 1ed46a6, github.com/apache/spark/pull/5649
[SPARK-7087] [BUILD] Fix path issue change version script
Tijo Thomas <tijoparacka@gmail.com>
2015-04-23 17:23:15 -0400
Commit: 6d0749c, github.com/apache/spark/pull/5656
[SPARK-6879] [HISTORYSERVER] check if app is completed before clean it up
WangTaoTheTonic <wangtao111@huawei.com>
2015-04-23 17:20:17 -0400
Commit: baa83a9, github.com/apache/spark/pull/5491
[SPARK-7085][MLlib] Fix miniBatchFraction parameter in train method called with 4 arguments
wizz <wizz@wizz-dev01.kawasaki.flab.fujitsu.com>
2015-04-23 14:00:07 -0700
Commit: 3e91cc2, github.com/apache/spark/pull/5658
[SPARK-7058] Include RDD deserialization time in "task deserialization time" metric
Josh Rosen <joshrosen@databricks.com>
2015-04-23 13:19:03 -0700
Commit: 6afde2c, github.com/apache/spark/pull/5635
[SPARK-7055][SQL]Use correct ClassLoader for JDBC Driver in JDBCRDD.getConnector
Vinod K C <vinod.kc@huawei.com>
2015-04-23 12:00:23 -0700
Commit: c1213e6, github.com/apache/spark/pull/5633
[SPARK-6752][Streaming] Allow StreamingContext to be recreated from checkpoint and existing SparkContext
Tathagata Das <tathagata.das1565@gmail.com>
2015-04-23 11:29:34 -0700
Commit: 534f2a4, github.com/apache/spark/pull/5428
[SPARK-7044] [SQL] Fix the deadlock in script transformation
Cheng Hao <hao.cheng@intel.com>
2015-04-23 10:35:22 -0700
Commit: cc48e63, github.com/apache/spark/pull/5625
[minor][streaming]fixed scala string interpolation error
Prabeesh K <prabeesh.k@namshi.com>
2015-04-23 10:33:13 -0700
Commit: 975f53e, github.com/apache/spark/pull/5653
[HOTFIX] [SQL] Fix compilation for scala 2.11.
Prashant Sharma <prashant.s@imaginea.com>
2015-04-23 16:45:26 +0530
Commit: a7d65d3, github.com/apache/spark/pull/5652
[SPARK-7069][SQL] Rename NativeType -> AtomicType.
Reynold Xin <rxin@databricks.com>
2015-04-23 01:43:40 -0700
Commit: f60bece, github.com/apache/spark/pull/5651
[SPARK-7068][SQL] Remove PrimitiveType
Reynold Xin <rxin@databricks.com>
2015-04-22 23:55:20 -0700
Commit: 29163c5, github.com/apache/spark/pull/5646
[MLlib] Add support for BooleanType to VectorAssembler.
Reynold Xin <rxin@databricks.com>
2015-04-22 23:54:48 -0700
Commit: 2d33323, github.com/apache/spark/pull/5648
[HOTFIX][SQL] Fix broken cached test
Liang-Chi Hsieh <viirya@gmail.com>
2015-04-22 22:18:56 -0700
Commit: d9e70f3, github.com/apache/spark/pull/5640
[SPARK-7046] Remove InputMetrics from BlockResult
Kay Ousterhout <kayousterhout@gmail.com>
2015-04-22 21:42:09 -0700
Commit: 03e85b4, github.com/apache/spark/pull/5627
[SPARK-7066][MLlib] VectorAssembler should use NumericType not NativeType.
Reynold Xin <rxin@databricks.com>
2015-04-22 21:35:42 -0700
Commit: d206860, github.com/apache/spark/pull/5642
[MLlib] UnaryTransformer nullability should not depend on PrimitiveType.
Reynold Xin <rxin@databricks.com>
2015-04-22 21:35:12 -0700
Commit: 1b85e08, github.com/apache/spark/pull/5644
Disable flaky test: ReceiverSuite "block generator throttling".
Reynold Xin <rxin@databricks.com>
2015-04-22 21:24:22 -0700
Commit: b69c4f9
[SPARK-6967] [SQL] fix date type convertion in jdbcrdd
Daoyuan Wang <daoyuan.wang@intel.com>
2015-04-22 19:14:28 -0700
Commit: 04525c0, github.com/apache/spark/pull/5590
[SPARK-6827] [MLLIB] Wrap FPGrowthModel.freqItemsets and make it consistent with Java API
Yanbo Liang <ybliang8@gmail.com>
2015-04-22 17:22:26 -0700
Commit: f4f3998, github.com/apache/spark/pull/5614
[SPARK-7059][SQL] Create a DataFrame join API to facilitate equijoin.
Reynold Xin <rxin@databricks.com>
2015-04-22 15:26:58 -0700
Commit: baf865d, github.com/apache/spark/pull/5638
[SPARK-7039][SQL]JDBCRDD: Add support on type NVARCHAR
szheng79 <szheng.code@gmail.com>
2015-04-22 13:02:55 -0700
Commit: fbe7106, github.com/apache/spark/pull/5618
[SQL] Rename some apply functions.
Reynold Xin <rxin@databricks.com>
2015-04-22 11:18:01 -0700
Commit: cdf0328, github.com/apache/spark/pull/5624
[SPARK-7052][Core] Add ThreadUtils and move thread methods from Utils to ThreadUtils
zsxwing <zsxwing@gmail.com>
2015-04-22 11:08:59 -0700
Commit: 33b8562, github.com/apache/spark/pull/5631
[SPARK-6889] [DOCS] CONTRIBUTING.md updates to accompany contribution doc updates
Sean Owen <sowen@cloudera.com>
2015-04-21 22:34:31 -0700
Commit: bdc5c16, github.com/apache/spark/pull/5623
[SPARK-6113] [ML] Small cleanups after original tree API PR
Joseph K. Bradley <joseph@databricks.com>
2015-04-21 21:44:44 -0700
Commit: 607eff0, github.com/apache/spark/pull/5567
[MINOR] Comment improvements in ExternalSorter.
Patrick Wendell <patrick@databricks.com>
2015-04-21 21:04:04 -0700
Commit: 70f9f8f, github.com/apache/spark/pull/5620
[SPARK-6490][Docs] Add docs for rpc configurations
zsxwing <zsxwing@gmail.com>
2015-04-21 18:37:53 -0700
Commit: 3a3f710, github.com/apache/spark/pull/5607
[SPARK-1684] [PROJECT INFRA] Merge script should standardize SPARK-XXX prefix
texasmichelle <texasmichelle@gmail.com>
2015-04-21 18:08:29 -0700
Commit: a0761ec, github.com/apache/spark/pull/5149
Closes #5427
Reynold Xin <rxin@databricks.com>
2015-04-21 17:52:52 -0700
Commit: 41ef78a
[SPARK-6953] [PySpark] speed up python tests
Reynold Xin <rxin@databricks.com>, Xiangrui Meng <meng@databricks.com>
2015-04-21 17:49:55 -0700
Commit: 3134c3f, github.com/apache/spark/pull/5605
[SPARK-6014] [core] Revamp Spark shutdown hooks, fix shutdown races.
Marcelo Vanzin <vanzin@cloudera.com>
2015-04-21 20:33:57 -0400
Commit: e72c16e, github.com/apache/spark/pull/5560
Avoid warning message about invalid refuse_seconds value in Mesos >=0.21...
mweindel <m.weindel@usu-software.de>
2015-04-21 20:19:33 -0400
Commit: b063a61, github.com/apache/spark/pull/5597
[Minor][MLLIB] Fix a minor formatting bug in toString method in Node.scala
Alain <aihe@usc.edu>
2015-04-21 16:46:17 -0700
Commit: ae036d0, github.com/apache/spark/pull/5621
[SPARK-7036][MLLIB] ALS.train should support DataFrames in PySpark
Xiangrui Meng <meng@databricks.com>
2015-04-21 16:44:52 -0700
Commit: 686dd74, github.com/apache/spark/pull/5619
[SPARK-6065] [MLlib] Optimize word2vec.findSynonyms using blas calls
MechCoder <manojkumarsivaraj334@gmail.com>
2015-04-21 16:42:45 -0700
Commit: 7fe6142, github.com/apache/spark/pull/5467
[minor] [build] Set java options when generating mima ignores.
Marcelo Vanzin <vanzin@cloudera.com>
2015-04-21 16:35:37 -0700
Commit: a70e849, github.com/apache/spark/pull/5615
[SPARK-3386] Share and reuse SerializerInstances in shuffle paths
Josh Rosen <joshrosen@databricks.com>
2015-04-21 16:24:15 -0700
Commit: f83c0f1, github.com/apache/spark/pull/5606
[SPARK-5817] [SQL] Fix bug of udtf with column names
Cheng Hao <hao.cheng@intel.com>
2015-04-21 15:11:15 -0700
Commit: 7662ec2, github.com/apache/spark/pull/4602
[SPARK-6996][SQL] Support map types in java beans
Punya Biswal <pbiswal@palantir.com>
2015-04-21 14:50:02 -0700
Commit: 2a24bf9, github.com/apache/spark/pull/5578
[SPARK-6969][SQL] Refresh the cached table when REFRESH TABLE is used
Yin Huai <yhuai@databricks.com>
2015-04-21 14:48:42 -0700
Commit: 6265cba, github.com/apache/spark/pull/5583
[SQL][minor] make it more clear that we only need to re-throw GetField exception for UnresolvedAttribute
Wenchen Fan <cloud0fan@outlook.com>
2015-04-21 14:48:02 -0700
Commit: 03fd921, github.com/apache/spark/pull/5588
[SPARK-6994] Allow to fetch field values by name in sql.Row
vidmantas zemleris <vidmantas@vinted.com>
2015-04-21 14:47:09 -0700
Commit: 2e8c6ca, github.com/apache/spark/pull/5573
[SPARK-7011] Build(compilation) fails with scala 2.11 option, because a protected[sql] type is accessed in ml package.
Prashant Sharma <prashant.s@imaginea.com>
2015-04-21 14:43:46 -0700
Commit: 04bf34e, github.com/apache/spark/pull/5593
[SPARK-6845] [MLlib] [PySpark] Add isTranposed flag to DenseMatrix
MechCoder <manojkumarsivaraj334@gmail.com>
2015-04-21 14:36:50 -0700
Commit: 45c47fa, github.com/apache/spark/pull/5455
SPARK-3276 Added a new configuration spark.streaming.minRememberDuration
emres <emre.sevinc@gmail.com>
2015-04-21 16:39:56 -0400
Commit: c25ca7c, github.com/apache/spark/pull/5438
[SPARK-5360] [SPARK-6606] Eliminate duplicate objects in serialized CoGroupedRDD
Kay Ousterhout <kayousterhout@gmail.com>
2015-04-21 11:01:18 -0700
Commit: c035c0f, github.com/apache/spark/pull/4145
[SPARK-6985][streaming] Receiver maxRate over 1000 causes a StackOverflowError
David McGuire <david.mcguire2@nike.com>
2015-04-21 07:21:10 -0400
Commit: 5fea3e5, github.com/apache/spark/pull/5559
[SPARK-5990] [MLLIB] Model import/export for IsotonicRegression
Yanbo Liang <ybliang8@gmail.com>
2015-04-21 00:14:16 -0700
Commit: 1f2f723, github.com/apache/spark/pull/5270
[SPARK-6949] [SQL] [PySpark] Support Date/Timestamp in Column expression
Davies Liu <davies@databricks.com>
2015-04-21 00:08:18 -0700
Commit: ab9128f, github.com/apache/spark/pull/5570
[SPARK-6490][Core] Add spark.rpc.* and deprecate spark.akka.*
zsxwing <zsxwing@gmail.com>
2015-04-20 23:18:42 -0700
Commit: 8136810, github.com/apache/spark/pull/5595
[SPARK-6635][SQL] DataFrame.withColumn should replace columns with identical column names
Liang-Chi Hsieh <viirya@gmail.com>
2015-04-20 18:54:01 -0700
Commit: c736220, github.com/apache/spark/pull/5541
[SPARK-6368][SQL] Build a specialized serializer for Exchange operator.
Yin Huai <yhuai@databricks.com>
2015-04-20 18:42:50 -0700
Commit: ce7ddab, github.com/apache/spark/pull/5497
[doc][streaming] Fixed broken link in mllib section
BenFradet <benjamin.fradet@gmail.com>
2015-04-20 13:46:55 -0700
Commit: 517bdf3, github.com/apache/spark/pull/5600
fixed doc
Eric Chiang <eric.chiang.m@gmail.com>
2015-04-20 13:11:21 -0700
Commit: 97fda73, github.com/apache/spark/pull/5599
[Minor][MLlib] Incorrect path to test data is used in DecisionTreeExample
Liang-Chi Hsieh <viirya@gmail.com>
2015-04-20 10:47:37 -0700
Commit: 1ebceaa, github.com/apache/spark/pull/5594
[SPARK-6661] Python type errors should print type, not object
Elisey Zanko <elisey.zanko@gmail.com>
2015-04-20 10:44:09 -0700
Commit: 7717661, github.com/apache/spark/pull/5361
[SPARK-7003] Improve reliability of connection failure detection between Netty block transfer service endpoints
Aaron Davidson <aaron@databricks.com>
2015-04-20 09:54:21 -0700
Commit: 968ad97, github.com/apache/spark/pull/5584
[SPARK-5924] Add the ability to specify withMean or withStd parameters with StandarScaler
jrabary <Jaonary@gmail.com>
2015-04-20 09:47:56 -0700
Commit: 1be2070, github.com/apache/spark/pull/4704
[doc][mllib] Fix typo of the page title in Isotonic regression documents
dobashim <dobashim@oss.nttdata.co.jp>
2015-04-20 00:03:23 -0400
Commit: 6fe690d, github.com/apache/spark/pull/5581
[SPARK-6979][Streaming] Replace JobScheduler.eventActor and JobGenerator.eventActor with EventLoop
zsxwing <zsxwing@gmail.com>
2015-04-19 20:48:36 -0700
Commit: c776ee8, github.com/apache/spark/pull/5554
[SPARK-6983][Streaming] Update ReceiverTrackerActor to use the new Rpc interface
zsxwing <zsxwing@gmail.com>
2015-04-19 20:35:43 -0700
Commit: d8e1b7b, github.com/apache/spark/pull/5557
[SPARK-6998][MLlib] Make StreamingKMeans 'Serializable'
zsxwing <zsxwing@gmail.com>
2015-04-19 20:33:51 -0700
Commit: fa73da0, github.com/apache/spark/pull/5582
[SPARK-6963][CORE]Flaky test: o.a.s.ContextCleanerSuite automatically cleanup checkpoint
GuoQiang Li <witgo@qq.com>
2015-04-19 09:37:09 +0100
Commit: 0424da6, github.com/apache/spark/pull/5548
SPARK-6993 : Add default min, max methods for JavaDoubleRDD
Olivier Girardot <o.girardot@lateral-thoughts.com>
2015-04-18 18:21:44 -0700
Commit: 8fbd45c, github.com/apache/spark/pull/5571
Fixed doc
Gaurav Nanda <gaurav324@gmail.com>
2015-04-18 17:20:46 -0700
Commit: 729885e, github.com/apache/spark/pull/5576
[SPARK-6219] Reuse pep8.py
Nicholas Chammas <nicholas.chammas@gmail.com>
2015-04-18 16:46:28 -0700
Commit: 28683b4, github.com/apache/spark/pull/5561
[core] [minor] Make sure ConnectionManager stops.
Marcelo Vanzin <vanzin@cloudera.com>
2015-04-18 10:14:56 +0100
Commit: 327ebf0, github.com/apache/spark/pull/5566
SPARK-6992 : Fix documentation example for Spark SQL on StructType
Olivier Girardot <o.girardot@lateral-thoughts.com>
2015-04-18 00:31:01 -0700
Commit: 5f095d5, github.com/apache/spark/pull/5569
[SPARK-6975][Yarn] Fix argument validation error
jerryshao <saisai.shao@intel.com>
2015-04-17 19:17:06 -0700
Commit: d850b4b, github.com/apache/spark/pull/5551
[SPARK-5933] [core] Move config deprecation warnings to SparkConf.
Marcelo Vanzin <vanzin@cloudera.com>
2015-04-17 19:02:07 -0700
Commit: 1991337, github.com/apache/spark/pull/5562
[SPARK-6350][Mesos] Make mesosExecutorCores configurable in mesos "fine-grained" mode
Jongyoul Lee <jongyoul@gmail.com>
2015-04-17 18:30:55 -0700
Commit: 6fbeb82, github.com/apache/spark/pull/5063
[SPARK-6703][Core] Provide a way to discover existing SparkContext's
Ilya Ganelin <ilya.ganelin@capitalone.com>
2015-04-17 18:28:42 -0700
Commit: c5ed510, github.com/apache/spark/pull/5501
Minor fix to SPARK-6958: Improve Python docstring for DataFrame.sort.
Reynold Xin <rxin@databricks.com>
2015-04-17 16:30:13 -0500
Commit: a452c59, github.com/apache/spark/pull/5558
SPARK-6988 : Fix documentation regarding DataFrames using the Java API
Olivier Girardot <o.girardot@lateral-thoughts.com>
2015-04-17 16:23:10 -0500
Commit: d305e68, github.com/apache/spark/pull/5564
[SPARK-6807] [SparkR] Merge recent SparkR-pkg changes
cafreeman <cfreeman@alteryx.com>, Davies Liu <davies@databricks.com>, Zongheng Yang <zongheng.y@gmail.com>, Shivaram Venkataraman <shivaram.venkataraman@gmail.com>, Shivaram Venkataraman <shivaram@cs.berkeley.edu>, Sun Rui <rui.sun@intel.com>
2015-04-17 13:42:19 -0700
Commit: 59e206d, github.com/apache/spark/pull/5436
[SPARK-6113] [ml] Stabilize DecisionTree API
Joseph K. Bradley <joseph@databricks.com>
2015-04-17 13:15:36 -0700
Commit: a83571a, github.com/apache/spark/pull/5530
[SPARK-2669] [yarn] Distribute client configuration to AM.
Marcelo Vanzin <vanzin@cloudera.com>
2015-04-17 14:21:51 -0500
Commit: 50ab8a6, github.com/apache/spark/pull/4142
[SPARK-6957] [SPARK-6958] [SQL] improve API compatibility to pandas
Davies Liu <davies@databricks.com>
2015-04-17 11:29:27 -0500
Commit: c84d916, github.com/apache/spark/pull/5544
[SPARK-6604][PySpark]Specify ip of python server scoket
linweizhong <linweizhong@huawei.com>
2015-04-17 12:04:02 +0100
Commit: dc48ba9, github.com/apache/spark/pull/5256
[SPARK-6952] Handle long args when detecting PID reuse
Punya Biswal <pbiswal@palantir.com>
2015-04-17 11:08:37 +0100
Commit: f6a9a57, github.com/apache/spark/pull/5535
[SPARK-6046] [core] Reorganize deprecated config support in SparkConf.
Marcelo Vanzin <vanzin@cloudera.com>
2015-04-17 11:06:01 +0100
Commit: 4527761, github.com/apache/spark/pull/5514
SPARK-6846 [WEBUI] Stage kill URL easy to accidentally trigger and possibility for security issue
Sean Owen <sowen@cloudera.com>
2015-04-17 11:02:31 +0100
Commit: f7a2564, github.com/apache/spark/pull/5528
[SPARK-6972][SQL] Add Coalesce to DataFrame
Michael Armbrust <michael@databricks.com>
2015-04-16 21:49:26 -0500
Commit: 8220d52, github.com/apache/spark/pull/5545
[SPARK-6966][SQL] Use correct ClassLoader for JDBC Driver
Michael Armbrust <michael@databricks.com>
2015-04-16 17:59:49 -0700
Commit: e5949c2, github.com/apache/spark/pull/5543
[SPARK-6899][SQL] Fix type mismatch when using codegen with Average on DecimalType
Liang-Chi Hsieh <viirya@gmail.com>
2015-04-16 17:50:20 -0700
Commit: 1e43851, github.com/apache/spark/pull/5517
[SQL][Minor] Fix foreachUp of treenode
scwf <wangfei1@huawei.com>, Fei Wang <wangfei1@huawei.com>
2015-04-16 17:35:51 -0700
Commit: d966086, github.com/apache/spark/pull/5518
[SPARK-6911] [SQL] improve accessor for nested types
Davies Liu <davies@databricks.com>
2015-04-16 17:33:57 -0700
Commit: 6183b5e, github.com/apache/spark/pull/5513
SPARK-6927 [SQL] Sorting Error when codegen on
云峤 <chensong.cs@alibaba-inc.com>
2015-04-16 17:32:42 -0700
Commit: 5fe4343, github.com/apache/spark/pull/5524
[SPARK-4897] [PySpark] Python 3 support
Davies Liu <davies@databricks.com>, twneale <twneale@gmail.com>, Josh Rosen <joshrosen@databricks.com>
2015-04-16 16:20:57 -0700
Commit: 04e44b3, github.com/apache/spark/pull/5173
[SPARK-6855] [SPARKR] Set R includes to get the right collate order.
Shivaram Venkataraman <shivaram@cs.berkeley.edu>
2015-04-16 13:06:34 -0700
Commit: 55f553a, github.com/apache/spark/pull/5462
[SPARK-6934][Core] Use 'spark.akka.askTimeout' for the ask timeout
zsxwing <zsxwing@gmail.com>
2015-04-16 13:45:55 -0500
Commit: ef3fb80, github.com/apache/spark/pull/5529
[SPARK-6694][SQL]SparkSQL CLI must be able to specify an option --database on the command line.
Jin Adachi <adachij2002@yahoo.co.jp>, adachij <adachij@nttdata.co.jp>
2015-04-16 23:41:04 +0800
Commit: 3ae37b9, github.com/apache/spark/pull/5345
[SPARK-4194] [core] Make SparkContext initialization exception-safe.
Marcelo Vanzin <vanzin@cloudera.com>
2015-04-16 10:48:31 +0100
Commit: de4fa6b, github.com/apache/spark/pull/5335
SPARK-4783 [CORE] System.exit() calls in SparkContext disrupt applications embedding Spark
Sean Owen <sowen@cloudera.com>
2015-04-16 10:45:32 +0100
Commit: 6179a94, github.com/apache/spark/pull/5492
[Streaming][minor] Remove additional quote and unneeded imports
jerryshao <saisai.shao@intel.com>
2015-04-16 10:39:02 +0100
Commit: 8370550, github.com/apache/spark/pull/5540
[SPARK-6893][ML] default pipeline parameter handling in python
Xiangrui Meng <meng@databricks.com>
2015-04-15 23:49:42 -0700
Commit: 57cd1e8, github.com/apache/spark/pull/5534
SPARK-6938: All require statements now have an informative error message.
Juliet Hougland <juliet@cloudera.com>
2015-04-15 21:52:25 -0700
Commit: 52c3439, github.com/apache/spark/pull/5532
[SPARK-5277][SQL] - SparkSqlSerializer doesn't always register user specified KryoRegistrators
Max Seiden <max@platfora.com>
2015-04-15 16:15:11 -0700
Commit: 8a53de1, github.com/apache/spark/pull/5237
[SPARK-2312] Logging Unhandled messages
Isaias Barroso <isaias.barroso@gmail.com>
2015-04-15 22:40:52 +0100
Commit: d5f1b96, github.com/apache/spark/pull/2055
[SPARK-2213] [SQL] sort merge join for spark sql
Daoyuan Wang <daoyuan.wang@intel.com>, Michael Armbrust <michael@databricks.com>
2015-04-15 14:06:10 -0700
Commit: 585638e, github.com/apache/spark/pull/5208
[SPARK-6898][SQL] completely support special chars in column names
Wenchen Fan <cloud0fan@outlook.com>
2015-04-15 13:39:12 -0700
Commit: 4754e16, github.com/apache/spark/pull/5511
[SPARK-6937][MLLIB] Fixed bug in PICExample in which the radius were not being accepted on c...
sboeschhuawei <stephen.boesch@huawei.com>
2015-04-15 13:28:10 -0700
Commit: 557a797, github.com/apache/spark/pull/5531
[SPARK-6844][SQL] Clean up accumulators used in InMemoryRelation when it is uncached
Liang-Chi Hsieh <viirya@gmail.com>
2015-04-15 13:15:58 -0700
Commit: cf38fe0, github.com/apache/spark/pull/5475
[SPARK-6638] [SQL] Improve performance of StringType in SQL
Davies Liu <davies@databricks.com>
2015-04-15 13:06:38 -0700
Commit: 8584276, github.com/apache/spark/pull/5350
[SPARK-6887][SQL] ColumnBuilder misses FloatType
Yin Huai <yhuai@databricks.com>
2015-04-15 13:04:03 -0700
Commit: 785f955, github.com/apache/spark/pull/5499
[SPARK-6800][SQL] Update doc for JDBCRelation's columnPartition
Liang-Chi Hsieh <viirya@gmail.com>
2015-04-15 13:01:29 -0700
Commit: e3e4e9a, github.com/apache/spark/pull/5488
[SPARK-6730][SQL] Allow using keyword as identifier in OPTIONS
Liang-Chi Hsieh <viirya@gmail.com>
2015-04-15 13:00:19 -0700
Commit: b75b307, github.com/apache/spark/pull/5520
[SPARK-6886] [PySpark] fix big closure with shuffle
Davies Liu <davies@databricks.com>
2015-04-15 12:58:02 -0700
Commit: f11288d, github.com/apache/spark/pull/5496
SPARK-6861 [BUILD] Scalastyle config prevents building Maven child modules alone
Sean Owen <sowen@cloudera.com>
2015-04-15 15:17:58 +0100
Commit: 6c5ed8a, github.com/apache/spark/pull/5471
[HOTFIX] [SPARK-6896] [SQL] fix compile error in hive-thriftserver
Daoyuan Wang <daoyuan.wang@intel.com>
2015-04-15 10:23:53 +0100
Commit: 29aabdd, github.com/apache/spark/pull/5507
[SPARK-6871][SQL] WITH clause in CTE can not following another WITH clause
Liang-Chi Hsieh <viirya@gmail.com>
2015-04-14 23:47:16 -0700
Commit: 6be9189, github.com/apache/spark/pull/5480
[SPARK-5634] [core] Show correct message in HS when no incomplete apps f...
Marcelo Vanzin <vanzin@cloudera.com>
2015-04-14 18:52:48 -0700
Commit: 30a6e0d, github.com/apache/spark/pull/5515
[SPARK-6890] [core] Fix launcher lib work with SPARK_PREPEND_CLASSES.
Marcelo Vanzin <vanzin@cloudera.com>
2015-04-14 18:51:39 -0700
Commit: 9717389, github.com/apache/spark/pull/5504
[SPARK-6796][Streaming][WebUI] Add "Active Batches" and "Completed Batches" lists to StreamingPage
zsxwing <zsxwing@gmail.com>
2015-04-14 16:51:36 -0700
Commit: 6de282e, github.com/apache/spark/pull/5434
Revert "[SPARK-6352] [SQL] Add DirectParquetOutputCommitter"
Josh Rosen <joshrosen@databricks.com>
2015-04-14 14:07:25 -0700
Commit: a76b921
[SPARK-6769][YARN][TEST] Usage of the ListenerBus in YarnClusterSuite is wrong
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2015-04-14 14:00:49 -0700
Commit: 4d4b249, github.com/apache/spark/pull/5417
[SPARK-5808] [build] Package pyspark files in sbt assembly.
Marcelo Vanzin <vanzin@cloudera.com>
2015-04-14 13:41:38 -0700
Commit: 6577437, github.com/apache/spark/pull/5461
[SPARK-6905] Upgrade to snappy-java 1.1.1.7
Josh Rosen <joshrosen@databricks.com>
2015-04-14 13:40:07 -0700
Commit: 6adb8bc, github.com/apache/spark/pull/5512
[SPARK-6700] [yarn] Re-enable flaky test.
Marcelo Vanzin <vanzin@cloudera.com>
2015-04-14 13:34:44 -0700
Commit: b075e4b, github.com/apache/spark/pull/5459
SPARK-1706: Allow multiple executors per worker in Standalone mode
CodingCat <zhunansjtu@gmail.com>
2015-04-14 13:32:06 -0700
Commit: 8f8dc45, github.com/apache/spark/pull/731
[SPARK-2033] Automatically cleanup checkpoint
GuoQiang Li <witgo@qq.com>
2015-04-14 12:56:47 -0700
Commit: 25998e4, github.com/apache/spark/pull/855
[CORE] SPARK-6880: Fixed null check when all the dependent stages are cancelled due to previous stage failure
pankaj arora <pankaj.arora@guavus.com>
2015-04-14 12:06:46 -0700
Commit: dcf8a9f, github.com/apache/spark/pull/5494
[SPARK-6894]spark.executor.extraLibraryOptions => spark.executor.extraLibraryPath
WangTaoTheTonic <wangtao111@huawei.com>
2015-04-14 12:02:11 -0700
Commit: f63b44a, github.com/apache/spark/pull/5506
[SPARK-6081] Support fetching http/https uris in driver runner.
Timothy Chen <tnachen@gmail.com>
2015-04-14 11:48:12 -0700
Commit: 320bca4, github.com/apache/spark/pull/4832
SPARK-6878 [CORE] Fix for sum on empty RDD fails with exception
Erik van Oosten <evanoosten@ebay.com>
2015-04-14 12:39:56 +0100
Commit: 51b306b, github.com/apache/spark/pull/5489
[SPARK-6731] Bump version of apache commons-math3
Punyashloka Biswal <punya.biswal@gmail.com>
2015-04-14 11:43:06 +0100
Commit: 628a72f, github.com/apache/spark/pull/5380
[WIP][HOTFIX][SPARK-4123]: Fix bug in PR dependency (all deps. removed issue)
Brennon York <brennon.york@capitalone.com>
2015-04-13 22:31:44 -0700
Commit: 77eeb10, github.com/apache/spark/pull/5443
[SPARK-5957][ML] better handling of parameters
Xiangrui Meng <meng@databricks.com>
2015-04-13 21:18:05 -0700
Commit: 971b95b, github.com/apache/spark/pull/5431
[Minor][SparkR] Minor refactor and removes redundancy related to cleanClosure.
hlin09 <hlin09pu@gmail.com>
2015-04-13 20:43:24 -0700
Commit: 0ba3fdd, github.com/apache/spark/pull/5495
[SPARK-5794] [SQL] fix add jar
Daoyuan Wang <daoyuan.wang@intel.com>
2015-04-13 18:26:00 -0700
Commit: b45059d, github.com/apache/spark/pull/4586
[SQL] [Minor] Fix for SqlApp.scala
Fei Wang <wangfei1@huawei.com>
2015-04-13 18:23:35 -0700
Commit: 3782e1f, github.com/apache/spark/pull/5485
[Spark-4848] Allow different Worker configurations in standalone cluster
Nathan Kronenfeld <nkronenfeld@oculusinfo.com>
2015-04-13 18:21:16 -0700
Commit: 435b877, github.com/apache/spark/pull/5140
[SPARK-6877][SQL] Add code generation support for Min
Liang-Chi Hsieh <viirya@gmail.com>
2015-04-13 18:16:33 -0700
Commit: 4898dfa, github.com/apache/spark/pull/5487
[SPARK-6303][SQL] Remove unnecessary Average in GeneratedAggregate
Liang-Chi Hsieh <viirya@gmail.com>
2015-04-13 18:15:29 -0700
Commit: 5b8b324, github.com/apache/spark/pull/4996
[SPARK-6881][SparkR] Changes the checkpoint directory name.
hlin09 <hlin09pu@gmail.com>
2015-04-13 16:53:50 -0700
Commit: d7f2c19, github.com/apache/spark/pull/5493
[SPARK-5931][CORE] Use consistent naming for time properties
Ilya Ganelin <ilya.ganelin@capitalone.com>, Ilya Ganelin <ilganeli@gmail.com>
2015-04-13 16:28:07 -0700
Commit: c4ab255, github.com/apache/spark/pull/5236
[SPARK-5941] [SQL] Unit Test loads the table `src` twice for leftsemijoin.q
Cheng Hao <hao.cheng@intel.com>
2015-04-13 16:02:18 -0700
Commit: c5602bd, github.com/apache/spark/pull/4506
[SPARK-6872] [SQL] add copy in external sort
Daoyuan Wang <daoyuan.wang@intel.com>
2015-04-13 16:00:58 -0700
Commit: e63a86a, github.com/apache/spark/pull/5481
[SPARK-5972] [MLlib] Cache residuals and gradient in GBT during training and validation
MechCoder <manojkumarsivaraj334@gmail.com>
2015-04-13 15:36:33 -0700
Commit: 2a55cb4, github.com/apache/spark/pull/5330
[SQL][SPARK-6742]: Don't push down predicates which reference partition column(s)
Yash Datta <Yash.Datta@guavus.com>
2015-04-13 14:43:07 -0700
Commit: 3a205bb, github.com/apache/spark/pull/5390
[SPARK-6130] [SQL] support if not exists for insert overwrite into partition in hiveQl
Daoyuan Wang <daoyuan.wang@intel.com>
2015-04-13 14:29:07 -0700
Commit: 85ee0ca, github.com/apache/spark/pull/4865
[SPARK-5988][MLlib] add save/load for PowerIterationClusteringModel
Xusen Yin <yinxusen@gmail.com>
2015-04-13 11:53:17 -0700
Commit: 1e340c3, github.com/apache/spark/pull/5450
[SPARK-6662][YARN] Allow variable substitution in spark.yarn.historyServer.address
Cheolsoo Park <cheolsoop@netflix.com>
2015-04-13 13:45:10 -0500
Commit: 6cc5b3e, github.com/apache/spark/pull/5321
[SPARK-6765] Enable scalastyle on test code.
Reynold Xin <rxin@databricks.com>
2015-04-13 09:29:04 -0700
Commit: c5b0b29, github.com/apache/spark/pull/5486
[SPARK-6207] [YARN] [SQL] Adds delegation tokens for metastore to conf.
Doug Balog <doug.balogtarget.com>, Doug Balog <doug.balog@target.com>
2015-04-13 09:49:58 -0500
Commit: 77620be, github.com/apache/spark/pull/5031
[SPARK-6352] [SQL] Add DirectParquetOutputCommitter
Pei-Lun Lee <pllee@appier.com>
2015-04-13 21:52:00 +0800
Commit: b29663e, github.com/apache/spark/pull/5042
[SPARK-6870][Yarn] Catch InterruptedException when yarn application state monitor thread been interrupted
linweizhong <linweizhong@huawei.com>
2015-04-13 13:06:54 +0100
Commit: 202ebf0, github.com/apache/spark/pull/5479
[SPARK-6671] Add status command for spark daemons
Pradeep Chanumolu <pchanumolu@maprtech.com>
2015-04-13 13:02:55 +0100
Commit: 240ea03, github.com/apache/spark/pull/5327
[SPARK-6440][CORE]Handle IPv6 addresses properly when constructing URI
nyaapa <nyaapa@gmail.com>
2015-04-13 12:55:25 +0100
Commit: 9d117ce, github.com/apache/spark/pull/5424
[SPARK-6860][Streaming][WebUI] Fix the possible inconsistency of StreamingPage
zsxwing <zsxwing@gmail.com>
2015-04-13 12:21:29 +0100
Commit: 14ce3ea, github.com/apache/spark/pull/5470
[SPARK-6762]Fix potential resource leaks in CheckPoint CheckpointWriter and CheckpointReader
lisurprise <zhichao.li@intel.com>
2015-04-13 12:18:05 +0100
Commit: cadd7d7, github.com/apache/spark/pull/5407
[SPARK-6868][YARN] Fix broken container log link on executor page when HTTPS_ONLY.
Dean Chen <deanchen5@gmail.com>
2015-04-13 12:08:55 +0100
Commit: 950645d, github.com/apache/spark/pull/5477
[SPARK-6562][SQL] DataFrame.replace
Reynold Xin <rxin@databricks.com>
2015-04-12 22:56:12 -0700
Commit: 68d1faa, github.com/apache/spark/pull/5282
[SPARK-5885][MLLIB] Add VectorAssembler as a feature transformer
Xiangrui Meng <meng@databricks.com>
2015-04-12 22:42:01 -0700
Commit: 9294044, github.com/apache/spark/pull/5196
[SPARK-5886][ML] Add StringIndexer as a feature transformer
Xiangrui Meng <meng@databricks.com>
2015-04-12 22:41:05 -0700
Commit: 685ddcf, github.com/apache/spark/pull/4735
[SPARK-4081] [mllib] VectorIndexer
Joseph K. Bradley <joseph@databricks.com>
2015-04-12 22:38:27 -0700
Commit: d3792f5, github.com/apache/spark/pull/3000
[SPARK-6643][MLLIB] Implement StandardScalerModel missing methods
lewuathe <lewuathe@me.com>
2015-04-12 22:17:16 -0700
Commit: fc17661, github.com/apache/spark/pull/5310
[SPARK-6765] Fix test code style for core.
Reynold Xin <rxin@databricks.com>
2015-04-12 20:50:49 -0700
Commit: a1fe59d, github.com/apache/spark/pull/5484
[MINOR] a typo: coalesce
Daoyuan Wang <daoyuan.wang@intel.com>
2015-04-12 18:58:53 +0100
Commit: 04bcd67, github.com/apache/spark/pull/5482
[SPARK-6431][Streaming][Kafka] Error message for partition metadata requ...
cody koeninger <cody@koeninger.org>
2015-04-12 17:37:30 +0100
Commit: 6ac8eea, github.com/apache/spark/pull/5454
[SPARK-6843][core]Add volatile for the "state"
lisurprise <zhichao.li@intel.com>
2015-04-12 13:41:44 +0100
Commit: ddc1743, github.com/apache/spark/pull/5448
[SPARK-6866][Build] Remove duplicated dependency in launcher/pom.xml
Guancheng (G.C.) Chen <chenguancheng@gmail.com>
2015-04-12 11:36:41 +0100
Commit: e9445b1, github.com/apache/spark/pull/5476
[SPARK-6677] [SQL] [PySpark] fix cached classes
Davies Liu <davies@databricks.com>
2015-04-11 22:33:23 -0700
Commit: 5d8f7b9, github.com/apache/spark/pull/5445
MAINTENANCE: Automated closing of pull requests.
Patrick Wendell <patrick@databricks.com>
2015-04-11 22:12:56 -0700
Commit: 0cc8fcb, github.com/apache/spark/pull/4994
SPARK-6710 GraphX Fixed Wrong initial bias in GraphX SVDPlusPlus
Michael Malak <michaelmalak@yahoo.com>
2015-04-11 21:01:23 -0700
Commit: 1205f7e, github.com/apache/spark/pull/5464
[HOTFIX] Add explicit return types to fix lint errors
Josh Rosen <joshrosen@databricks.com>
2015-04-11 20:12:40 -0700
Commit: dea5dac
[SQL][minor] move `resolveGetField` into a object
Wenchen Fan <cloud0fan@outlook.com>
2015-04-11 19:35:56 -0700
Commit: 5c2844c, github.com/apache/spark/pull/5435
[SPARK-6367][SQL] Use the proper data type for those expressions that are hijacking existing data types.
Yin Huai <yhuai@databricks.com>
2015-04-11 19:26:15 -0700
Commit: 6d4e854, github.com/apache/spark/pull/5094
[SQL] Handle special characters in the authority of a Path's URI.
Yin Huai <yhuai@databricks.com>
2015-04-11 18:44:54 -0700
Commit: d2383fb, github.com/apache/spark/pull/5381
[SPARK-6379][SQL] Support a functon to call user-defined functions registered in SQLContext
Takeshi YAMAMURO <linguin.m.s@gmail.com>
2015-04-11 18:41:12 -0700
Commit: 352a5da, github.com/apache/spark/pull/5061
[SPARK-6179][SQL] Add token for "SHOW PRINCIPALS role_name" and "SHOW TRANSACTIONS" and "SHOW COMPACTIONS"
DoingDone9 <799203320@qq.com>, Zhongshuai Pei <799203320@qq.com>, Xu Tingjun <xutingjun@huawei.com>
2015-04-11 18:34:17 -0700
Commit: 48cc840, github.com/apache/spark/pull/4902
[Spark-5068][SQL]Fix bug query data when path doesn't exist for HiveContext
lazymam500 <lazyman500@gmail.com>, lazyman <lazyman500@gmail.com>
2015-04-11 18:33:14 -0700
Commit: 1f39a61, github.com/apache/spark/pull/5059
[SPARK-6199] [SQL] Support CTE in HiveContext and SQLContext
haiyang <huhaiyang@huawei.com>
2015-04-11 18:30:17 -0700
Commit: 2f53588, github.com/apache/spark/pull/4929
[Minor][SQL] Fix typo in sql
Guancheng (G.C.) Chen <chenguancheng@gmail.com>
2015-04-11 15:43:12 -0700
Commit: 7dbd371, github.com/apache/spark/pull/5474
[SPARK-6863] Fix formatting on SQL programming guide.
Santiago M. Mola <santiago.mola@sap.com>
2015-04-11 15:42:03 -0700
Commit: 6437e7c, github.com/apache/spark/pull/5472
[SPARK-6611][SQL] Add support for INTEGER as synonym of INT.
Santiago M. Mola <santiago.mola@sap.com>
2015-04-11 14:52:49 -0700
Commit: 5f7b7cd, github.com/apache/spark/pull/5271
[SPARK-6858][SQL] Register Java HashMap for SparkSqlSerializer
Liang-Chi Hsieh <viirya@gmail.com>
2015-04-11 14:50:50 -0700
Commit: 198cf2a, github.com/apache/spark/pull/5465
[SPARK-6835] [SQL] Fix bug of Hive UDTF in Lateral View (ClassNotFound)
Cheng Hao <hao.cheng@intel.com>
2015-04-11 22:11:03 +0800
Commit: 3ceb810, github.com/apache/spark/pull/5444
[hotfix] [build] Make sure JAVA_HOME is set for tests.
Marcelo Vanzin <vanzin@cloudera.com>
2015-04-11 13:10:01 +0100
Commit: 694aef0, github.com/apache/spark/pull/5441
[Minor][Core] Fix typo
Liang-Chi Hsieh <viirya@gmail.com>
2015-04-11 13:07:41 +0100
Commit: 95a0759, github.com/apache/spark/pull/5466
[SQL] [SPARK-6620] Speed up toDF() and rdd() functions by constructing converters in ScalaReflection
Volodymyr Lyubinets <vlyubin@gmail.com>
2015-04-10 16:27:56 -0700
Commit: 67d0688, github.com/apache/spark/pull/5279
[SPARK-6851][SQL] Create new instance for each converted parquet relation
Michael Armbrust <michael@databricks.com>
2015-04-10 16:05:14 -0700
Commit: 23d5f88, github.com/apache/spark/pull/5458
[SPARK-6850] [SparkR] use one partition when we need to compare the whole result
Davies Liu <davies@databricks.com>
2015-04-10 15:35:45 -0700
Commit: 68ecdb7, github.com/apache/spark/pull/5460
[SPARK-6216] [PySpark] check the python version in worker
Davies Liu <davies@databricks.com>
2015-04-10 14:04:53 -0700
Commit: 4740d6a, github.com/apache/spark/pull/5404
[SPARK-5969][PySpark] Fix descending pyspark.rdd.sortByKey.
Milan Straka <fox@ucw.cz>
2015-04-10 13:50:32 -0700
Commit: 0375134, github.com/apache/spark/pull/4761
[SQL] [SPARK-6794] Use kryo-based SparkSqlSerializer for GeneralHashedRelation
Volodymyr Lyubinets <vlyubin@gmail.com>
2015-04-10 12:09:54 -0700
Commit: b9baa4c, github.com/apache/spark/pull/5433
[SPARK-6773][Tests]Fix RAT checks still passed issue when download rat jar failed
June.He <jun.hejun@huawei.com>
2015-04-10 20:02:35 +0100
Commit: 9f5ed99, github.com/apache/spark/pull/5421
[SPARK-6766][Streaming] Fix issue about StreamingListenerBatchSubmitted and StreamingListenerBatchStarted
zsxwing <zsxwing@gmail.com>
2015-04-10 01:51:42 -0700
Commit: 18ca089, github.com/apache/spark/pull/5414
[SPARK-6211][Streaming] Add Python Kafka API unit test
jerryshao <saisai.shao@intel.com>, Saisai Shao <saisai.shao@intel.com>
2015-04-09 23:14:24 -0700
Commit: 3290d2d, github.com/apache/spark/pull/4961
[SPARK-6577] [MLlib] [PySpark] SparseMatrix should be supported in PySpark
MechCoder <manojkumarsivaraj334@gmail.com>
2015-04-09 23:10:13 -0700
Commit: e236081, github.com/apache/spark/pull/5355
[SPARK-3074] [PySpark] support groupByKey() with single huge key
Davies Liu <davies.liu@gmail.com>, Davies Liu <davies@databricks.com>
2015-04-09 17:07:23 -0700
Commit: b5c51c8, github.com/apache/spark/pull/1977
[Spark-6693][MLlib]add tostring with max lines and width for matrix
Yuhao Yang <hhbyyh@gmail.com>
2015-04-09 15:37:45 -0700
Commit: 9c67049, github.com/apache/spark/pull/5344
[SPARK-6264] [MLLIB] Support FPGrowth algorithm in Python API
Yanbo Liang <ybliang8@gmail.com>
2015-04-09 15:10:10 -0700
Commit: a0411ae, github.com/apache/spark/pull/5213
[SPARK-6758]block the right jetty package in log
WangTaoTheTonic <wangtao111@huawei.com>
2015-04-09 17:44:08 -0400
Commit: 7d92db3, github.com/apache/spark/pull/5406
[minor] [examples] Avoid packaging duplicate classes.
Marcelo Vanzin <vanzin@cloudera.com>
2015-04-09 07:07:50 -0400
Commit: 470d745, github.com/apache/spark/pull/5379
SPARK-4924 addendum. Minor assembly directory fix in load-spark-env-sh
raschild <raschild@users.noreply.github.com>
2015-04-09 07:04:18 -0400
Commit: 53f6bb1, github.com/apache/spark/pull/5261
[SPARK-6343] Doc driver-worker network reqs
Peter Parente <pparent@us.ibm.com>
2015-04-09 06:37:20 -0400
Commit: b9c51c0, github.com/apache/spark/pull/5382
[SPARK-5654] Integrate SparkR
Shivaram Venkataraman <shivaram@cs.berkeley.edu>, Shivaram Venkataraman <shivaram.venkataraman@gmail.com>, Zongheng Yang <zongheng.y@gmail.com>, cafreeman <cfreeman@alteryx.com>, Shivaram Venkataraman <shivaram@eecs.berkeley.edu>, Davies Liu <davies@databricks.com>, Davies Liu <davies.liu@gmail.com>, hlin09 <hlin09pu@gmail.com>, Sun Rui <rui.sun@intel.com>, lythesia <iranaikimi@gmail.com>, oscaroboto <oscarjr@gmail.com>, Antonio Piccolboni <antonio@piccolboni.info>, root <edward>, edwardt <edwardt.tril@gmail.com>, hqzizania <qian.huang@intel.com>, dputler <dan.putler@gmail.com>, Todd Gao <todd.gao.2013@gmail.com>, Chris Freeman <cfreeman@alteryx.com>, Felix Cheung <fcheung@AVVOMAC-119.local>, Hossein <hossein@databricks.com>, Evert Lammerts <evert@apache.org>, Felix Cheung <fcheung@avvomac-119.t-mobile.com>, felixcheung <felixcheung_m@hotmail.com>, Ryan Hafen <rhafen@gmail.com>, Ashutosh Raina <ashutoshraina@users.noreply.github.com>, Oscar Olmedo <oscarjr@gmail.com>, Josh Rosen <rosenville@gmail.com>, Yi Lu <iranaikimi@gmail.com>, Harihar Nahak <hnahak87@users.noreply.github.com>
2015-04-08 22:45:40 -0700
Commit: 2fe0a1a, github.com/apache/spark/pull/5096
[SPARK-6765] Fix test code style for SQL
Reynold Xin <rxin@databricks.com>
2015-04-08 20:35:29 -0700
Commit: 1b2aab8, github.com/apache/spark/pull/5412
[SPARK-6696] [SQL] Adds HiveContext.refreshTable to PySpark
Cheng Lian <lian@databricks.com>
2015-04-08 18:47:39 -0700
Commit: 891ada5, github.com/apache/spark/pull/5349
[SPARK-6451][SQL] supported code generation for CombineSum
Venkata Ramana Gollamudi <ramana.gollamudi@huawei.com>
2015-04-08 18:42:34 -0700
Commit: 7d7384c, github.com/apache/spark/pull/5138
[SQL][minor] remove duplicated resolveGetField and update comment
Wenchen Fan <cloud0fan@outlook.com>
2015-04-08 13:57:01 -0700
Commit: 9418280, github.com/apache/spark/pull/5304
[SPARK-4346][SPARK-3596][YARN] Commonize the monitor logic
unknown <l00251599@HGHY1L002515991.china.huawei.com>, Sephiroth-Lin <linwzhong@gmail.com>
2015-04-08 13:56:42 -0700
Commit: 55a92ef, github.com/apache/spark/pull/5305
[SPARK-5242]: Add --private-ips flag to EC2 script
Michelangelo D'Agostino <mdagostino@civisanalytics.com>
2015-04-08 16:48:45 -0400
Commit: 86403f5, github.com/apache/spark/pull/5244
[SPARK-6767][SQL] Fixed Query DSL error in spark sql Readme
Tijo Thomas <tijoparacka@gmail.com>
2015-04-08 13:42:29 -0700
Commit: 2f482d7, github.com/apache/spark/pull/5415
[SPARK-6781] [SQL] use sqlContext in python shell
Davies Liu <davies@databricks.com>
2015-04-08 13:31:45 -0700
Commit: 6ada4f6, github.com/apache/spark/pull/5425
[SPARK-6765] Fix test code style for mllib.
Reynold Xin <rxin@databricks.com>
2015-04-08 11:32:44 -0700
Commit: 66159c3, github.com/apache/spark/pull/5411
[SPARK-6765] Fix test code style for graphx.
Reynold Xin <rxin@databricks.com>
2015-04-08 11:31:48 -0700
Commit: 8d812f9, github.com/apache/spark/pull/5410
[SPARK-6753] Clone SparkConf in ShuffleSuite tests
Kay Ousterhout <kayousterhout@gmail.com>
2015-04-08 10:26:45 -0700
Commit: 9d44ddc, github.com/apache/spark/pull/5401
[SPARK-6506] [pyspark] Do not try to retrieve SPARK_HOME when not needed...
Marcelo Vanzin <vanzin@cloudera.com>
2015-04-08 10:14:52 -0700
Commit: f7e21dd, github.com/apache/spark/pull/5405
[SPARK-6765] Fix test code style for streaming.
Reynold Xin <rxin@databricks.com>
2015-04-08 00:24:59 -0700
Commit: 15e0d2b, github.com/apache/spark/pull/5409
[SPARK-6754] Remove unnecessary TaskContextHelper
Kay Ousterhout <kayousterhout@gmail.com>
2015-04-07 22:40:42 -0700
Commit: 8d2a36c, github.com/apache/spark/pull/5402
[SPARK-6705][MLLIB] Add fit intercept api to ml logisticregression
Omede Firouz <ofirouz@palantir.com>
2015-04-07 23:36:31 -0400
Commit: d138aa8, github.com/apache/spark/pull/5301
[SPARK-6737] Fix memory leak in OutputCommitCoordinator
Josh Rosen <joshrosen@databricks.com>
2015-04-07 16:18:55 -0700
Commit: c83e039, github.com/apache/spark/pull/5397
[SPARK-6748] [SQL] Makes QueryPlan.schema a lazy val
Cheng Lian <lian@databricks.com>
2015-04-08 07:00:56 +0800
Commit: 77bcceb, github.com/apache/spark/pull/5398
[SPARK-6720][MLLIB] PySpark MultivariateStatisticalSummary unit test for normL1...
lewuathe <lewuathe@me.com>
2015-04-07 14:36:57 -0700
Commit: fc957dc, github.com/apache/spark/pull/5374
Revert "[SPARK-6568] spark-shell.cmd --jars option does not accept the jar that has space in its path"
Xiangrui Meng <meng@databricks.com>
2015-04-07 14:34:15 -0700
Commit: e6f08fb
[SPARK-6568] spark-shell.cmd --jars option does not accept the jar that has space in its path
Masayoshi TSUZUKI <tsudukim@oss.nttdata.co.jp>
2015-04-07 14:29:53 -0700
Commit: 596ba77, github.com/apache/spark/pull/5347
[SPARK-6750] Upgrade ScalaStyle to 0.7.
Reynold Xin <rxin@databricks.com>
2015-04-07 12:37:33 -0700
Commit: 1232215, github.com/apache/spark/pull/5399
Replace use of .size with .length for Arrays
sksamuel <sam@sksamuel.com>
2015-04-07 10:43:22 -0700
Commit: 2c32bef, github.com/apache/spark/pull/5376
[SPARK-6733][ Scheduler]Added scala.language.existentials
Vinod K C <vinod.kc@huawei.com>
2015-04-07 10:42:08 -0700
Commit: 7162ecf, github.com/apache/spark/pull/5384
[SPARK-3591][YARN]fire and forget for YARN cluster mode
WangTaoTheTonic <wangtao111@huawei.com>
2015-04-07 08:36:25 -0500
Commit: b65bad6, github.com/apache/spark/pull/5297
[SPARK-6736][GraphX][Doc]Example of Graph#aggregateMessages has error
Sasaki Toru <sasakitoa@nttdata.co.jp>
2015-04-07 01:55:32 -0700
Commit: ae980eb, github.com/apache/spark/pull/5388
[SPARK-6636] Use public DNS hostname everywhere in spark_ec2.py
Matt Aasted <aasted@twitch.tv>
2015-04-06 23:50:48 -0700
Commit: 6f0d55d, github.com/apache/spark/pull/5302
[SPARK-6716] Change SparkContext.DRIVER_IDENTIFIER from <driver> to driver
Josh Rosen <joshrosen@databricks.com>
2015-04-06 23:33:16 -0700
Commit: a0846c4, github.com/apache/spark/pull/5372
[Minor] [SQL] [SPARK-6729] Minor fix for DriverQuirks get
Volodymyr Lyubinets <vlyubin@gmail.com>
2015-04-06 18:00:51 -0700
Commit: e40ea87, github.com/apache/spark/pull/5378
[MLlib] [SPARK-6713] Iterators in columnSimilarities for mapPartitionsWithIndex
Reza Zadeh <reza@databricks.com>
2015-04-06 13:15:01 -0700
Commit: 30363ed, github.com/apache/spark/pull/5364
SPARK-6569 [STREAMING] Down-grade same-offset message in Kafka streaming to INFO
Sean Owen <sowen@cloudera.com>
2015-04-06 10:18:56 +0100
Commit: 9fe4125, github.com/apache/spark/pull/5366
[SPARK-6673] spark-shell.cmd can't start in Windows even when spark was built
Masayoshi TSUZUKI <tsudukim@oss.nttdata.co.jp>
2015-04-06 10:11:20 +0100
Commit: 49f3882, github.com/apache/spark/pull/5328
[SPARK-6602][Core] Update MapOutputTrackerMasterActor to MapOutputTrackerMasterEndpoint
zsxwing <zsxwing@gmail.com>
2015-04-05 21:57:15 -0700
Commit: 0b5d028, github.com/apache/spark/pull/5371
[SPARK-6262][MLLIB]Implement missing methods for MultivariateStatisticalSummary
lewuathe <lewuathe@me.com>
2015-04-05 16:13:31 -0700
Commit: acffc43, github.com/apache/spark/pull/5359
[SPARK-6602][Core] Replace direct use of Akka with Spark RPC interface - part 1
zsxwing <zsxwing@gmail.com>
2015-04-04 11:52:05 -0700
Commit: f15806a, github.com/apache/spark/pull/5268
[SPARK-6607][SQL] Check invalid characters for Parquet schema and show error messages
Liang-Chi Hsieh <viirya@gmail.com>
2015-04-05 00:20:43 +0800
Commit: 7bca62f, github.com/apache/spark/pull/5263
[SQL] Use path.makeQualified in newParquet.
Yin Huai <yhuai@databricks.com>
2015-04-04 23:26:10 +0800
Commit: da25c86, github.com/apache/spark/pull/5353
[SPARK-6700] disable flaky test
Davies Liu <davies@databricks.com>
2015-04-03 15:22:21 -0700
Commit: 9b40c17, github.com/apache/spark/pull/5356
[SPARK-6647][SQL] Make trait StringComparison as BinaryPredicate and fix unit tests of string data source Filter
Liang-Chi Hsieh <viirya@gmail.com>
2015-04-03 12:35:00 -0700
Commit: 26b415e, github.com/apache/spark/pull/5309
[SPARK-6688] [core] Always use resolved URIs in EventLoggingListener.
Marcelo Vanzin <vanzin@cloudera.com>
2015-04-03 11:54:31 -0700
Commit: 14632b7, github.com/apache/spark/pull/5340
Closes #3158
Reynold Xin <rxin@databricks.com>
2015-04-03 11:53:07 -0700
Commit: ffe8cc9
[SPARK-6640][Core] Fix the race condition of creating HeartbeatReceiver and retrieving HeartbeatReceiver
zsxwing <zsxwing@gmail.com>
2015-04-03 11:44:27 -0700
Commit: 88504b7, github.com/apache/spark/pull/5306
[SPARK-6492][CORE] SparkContext.stop() can deadlock when DAGSchedulerEventProcessLoop dies
Ilya Ganelin <ilya.ganelin@capitalone.com>
2015-04-03 19:23:11 +0100
Commit: 2c43ea3, github.com/apache/spark/pull/5277
[SPARK-5203][SQL] fix union with different decimal type
guowei2 <guowei2@asiainfo.com>
2015-04-04 02:02:30 +0800
Commit: c23ba81, github.com/apache/spark/pull/4004
[Minor][SQL] Fix typo
Liang-Chi Hsieh <viirya@gmail.com>
2015-04-03 18:31:48 +0100
Commit: dc6dff2, github.com/apache/spark/pull/5352
[SPARK-6615][MLLIB] Python API for Word2Vec
lewuathe <lewuathe@me.com>
2015-04-03 09:49:50 -0700
Commit: 512a2f1, github.com/apache/spark/pull/5296
[MLLIB] Remove println in LogisticRegression.scala
Omede Firouz <ofirouz@palantir.com>
2015-04-03 10:26:43 +0100
Commit: b52c7f9, github.com/apache/spark/pull/5338
[SPARK-6560][CORE] Do not suppress exceptions from writer.write.
Stephen Haberman <stephen@exigencecorp.com>
2015-04-03 09:48:37 +0100
Commit: b0d884f, github.com/apache/spark/pull/5223
[SPARK-6428] Turn on explicit type checking for public methods.
Reynold Xin <rxin@databricks.com>
2015-04-03 01:25:02 -0700
Commit: 82701ee, github.com/apache/spark/pull/5342
[SPARK-6575][SQL] Converted Parquet Metastore tables no longer cache metadata
Yin Huai <yhuai@databricks.com>
2015-04-03 14:40:36 +0800
Commit: c42c3fc, github.com/apache/spark/pull/5339
[SPARK-6621][Core] Fix the bug that calling EventLoop.stop in EventLoop.onReceive/onError/onStart doesn't call onStop
zsxwing <zsxwing@gmail.com>
2015-04-02 22:54:30 -0700
Commit: 440ea31, github.com/apache/spark/pull/5280
[SPARK-6345][STREAMING][MLLIB] Fix for training with prediction
freeman <the.freeman.lab@gmail.com>
2015-04-02 21:37:44 -0700
Commit: 6e1c1ec, github.com/apache/spark/pull/5037
[CORE] The descriptionof jobHistory config should be spark.history.fs.logDirectory
KaiXinXiaoLei <huleilei1@huawei.com>
2015-04-02 20:24:31 -0700
Commit: 8a0aa81, github.com/apache/spark/pull/5332
[SPARK-6575][SQL] Converted Parquet Metastore tables no longer cache metadata
Yin Huai <yhuai@databricks.com>
2015-04-02 20:23:08 -0700
Commit: 4b82bd7, github.com/apache/spark/pull/5339
[SPARK-6650] [core] Stop ExecutorAllocationManager when context stops.
Marcelo Vanzin <vanzin@cloudera.com>
2015-04-02 19:48:55 -0700
Commit: 45134ec, github.com/apache/spark/pull/5311
[SPARK-6686][SQL] Use resolved output instead of names for toDF rename
Michael Armbrust <michael@databricks.com>
2015-04-02 18:30:55 -0700
Commit: 052dee0, github.com/apache/spark/pull/5337
[SPARK-6243][SQL] The Operation of match did not conside the scenarios that order.dataType does not match NativeType
DoingDone9 <799203320@qq.com>
2015-04-02 17:23:51 -0700
Commit: 947802c, github.com/apache/spark/pull/4959
[SQL][Minor] Use analyzed logical instead of unresolved in HiveComparisonTest
Cheng Hao <hao.cheng@intel.com>
2015-04-02 17:20:31 -0700
Commit: dfd2982, github.com/apache/spark/pull/4946
[SPARK-6618][SPARK-6669][SQL] Lock Hive metastore client correctly.
Yin Huai <yhuai@databricks.com>, Michael Armbrust <michael@databricks.com>
2015-04-02 16:46:50 -0700
Commit: 5db8912, github.com/apache/spark/pull/5333
[Minor] [SQL] Follow-up of PR #5210
Cheng Lian <lian@databricks.com>
2015-04-02 16:15:34 -0700
Commit: d3944b6, github.com/apache/spark/pull/5219
[SPARK-6655][SQL] We need to read the schema of a data source table stored in spark.sql.sources.schema property
Yin Huai <yhuai@databricks.com>
2015-04-02 16:02:31 -0700
Commit: 251698f, github.com/apache/spark/pull/5313
[SQL] Throw UnsupportedOperationException instead of NotImplementedError
Michael Armbrust <michael@databricks.com>
2015-04-02 16:01:03 -0700
Commit: 4214e50, github.com/apache/spark/pull/5315
SPARK-6414: Spark driver failed with NPE on job cancelation
Hung Lin <hung.lin@gmail.com>
2015-04-02 14:01:43 -0700
Commit: e3202aa, github.com/apache/spark/pull/5124
[SPARK-6667] [PySpark] remove setReuseAddress
Davies Liu <davies@databricks.com>
2015-04-02 12:18:33 -0700
Commit: 0cce545, github.com/apache/spark/pull/5324
[SPARK-6672][SQL] convert row to catalyst in createDataFrame(RDD[Row], ...)
Xiangrui Meng <meng@databricks.com>
2015-04-02 17:57:01 +0800
Commit: 424e987, github.com/apache/spark/pull/5329
[SPARK-6627] Some clean-up in shuffle code.
Patrick Wendell <patrick@databricks.com>
2015-04-01 23:42:09 -0700
Commit: 6562787, github.com/apache/spark/pull/5286
[SPARK-6663] [SQL] use Literal.create instread of constructor
Davies Liu <davies@databricks.com>
2015-04-01 23:11:38 -0700
Commit: 40df5d4, github.com/apache/spark/pull/5320
Revert "[SPARK-6618][SQL] HiveMetastoreCatalog.lookupRelation should use fine-grained lock"
Cheng Lian <lian@databricks.com>
2015-04-02 12:56:34 +0800
Commit: 2bc7fe7
[SPARK-6658][SQL] Update DataFrame documentation to fix type references.
Chet Mancini <chetmancini@gmail.com>
2015-04-01 21:39:46 -0700
Commit: 191524e, github.com/apache/spark/pull/5316
[SPARK-6578] Small rewrite to make the logic more clear in MessageWithHeader.transferTo.
Reynold Xin <rxin@databricks.com>
2015-04-01 18:36:06 -0700
Commit: 899ebcb, github.com/apache/spark/pull/5319
[SPARK-6660][MLLIB] pythonToJava doesn't recognize object arrays
Xiangrui Meng <meng@databricks.com>
2015-04-01 18:17:07 -0700
Commit: 4815bc2, github.com/apache/spark/pull/5318
[SPARK-6553] [pyspark] Support functools.partial as UDF
ksonj <kson@siberie.de>
2015-04-01 17:23:57 -0700
Commit: 757b2e9, github.com/apache/spark/pull/5206
[SPARK-6580] [MLLIB] Optimize LogisticRegressionModel.predictPoint
Yanbo Liang <ybliang8@gmail.com>
2015-04-01 17:19:36 -0700
Commit: 86b4399, github.com/apache/spark/pull/5249
[SPARK-6576] [MLlib] [PySpark] DenseMatrix in PySpark should support indexing
MechCoder <manojkumarsivaraj334@gmail.com>
2015-04-01 17:03:39 -0700
Commit: 2fa3b47, github.com/apache/spark/pull/5232
[SPARK-6642][MLLIB] use 1.2 lambda scaling and remove addImplicit from NormalEquation
Xiangrui Meng <meng@databricks.com>
2015-04-01 16:47:18 -0700
Commit: ccafd75, github.com/apache/spark/pull/5314
[SPARK-6578] [core] Fix thread-safety issue in outbound path of network library.
Marcelo Vanzin <vanzin@cloudera.com>
2015-04-01 16:06:11 -0700
Commit: f084c5d, github.com/apache/spark/pull/5234
[SPARK-6657] [Python] [Docs] fixed python doc build warnings
Joseph K. Bradley <joseph@databricks.com>
2015-04-01 15:15:47 -0700
Commit: fb25e8c, github.com/apache/spark/pull/5317
[SPARK-6651][MLLIB] delegate dense vector arithmetics to the underlying numpy array
Xiangrui Meng <meng@databricks.com>
2015-04-01 13:29:04 -0700
Commit: 2275acc, github.com/apache/spark/pull/5312
SPARK-6433 hive tests to import spark-sql test JAR for QueryTest access
Steve Loughran <stevel@hortonworks.com>
2015-04-01 16:26:54 +0100
Commit: ee11be2, github.com/apache/spark/pull/5119
[SPARK-6608] [SQL] Makes DataFrame.rdd a lazy val
Cheng Lian <lian@databricks.com>
2015-04-01 21:34:45 +0800
Commit: d36c5fc, github.com/apache/spark/pull/5265
SPARK-6626 [DOCS]: Corrected Scala:TwitterUtils parameters
jayson <jayson@ziprecruiter.com>
2015-04-01 11:12:55 +0100
Commit: 0358b08, github.com/apache/spark/pull/5295
[SPARK-6597][Minor] Replace `input:checkbox` with `input[type="checkbox"]` in additional-metrics.js
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2015-04-01 11:11:56 +0100
Commit: d824c11, github.com/apache/spark/pull/5254
[EC2] [SPARK-6600] Open ports in ec2/spark_ec2.py to allow HDFS NFS gateway
Florian Verhein <florian.verhein@gmail.com>
2015-04-01 11:10:43 +0100
Commit: 4122623, github.com/apache/spark/pull/5257
[SPARK-4655][Core] Split Stage into ShuffleMapStage and ResultStage subclasses
Ilya Ganelin <ilya.ganelin@capitalone.com>, Ilya Ganelin <ilganeli@gmail.com>
2015-04-01 11:09:00 +0100
Commit: ff1915e, github.com/apache/spark/pull/4708
[Doc] Improve Python DataFrame documentation
Reynold Xin <rxin@databricks.com>
2015-03-31 18:31:36 -0700
Commit: 305abe1, github.com/apache/spark/pull/5287
[SPARK-6614] OutputCommitCoordinator should clear authorized committer only after authorized committer fails, not after any failure
Josh Rosen <joshrosen@databricks.com>
2015-03-31 16:18:39 -0700
Commit: 3732607, github.com/apache/spark/pull/5276
[SPARK-5692] [MLlib] Word2Vec save/load
MechCoder <manojkumarsivaraj334@gmail.com>
2015-03-31 16:01:08 -0700
Commit: 0e00f12, github.com/apache/spark/pull/5291
[SPARK-6633][SQL] Should be "Contains" instead of "EndsWith" when constructing sources.StringContains
Liang-Chi Hsieh <viirya@gmail.com>
2015-03-31 13:18:07 -0700
Commit: 2036bc5, github.com/apache/spark/pull/5299
[SPARK-5371][SQL] Propagate types after function conversion, before futher resolution
Michael Armbrust <michael@databricks.com>
2015-03-31 11:34:29 -0700
Commit: beebb7f, github.com/apache/spark/pull/5278
[SPARK-6255] [MLLIB] Support multiclass classification in Python API
Yanbo Liang <ybliang8@gmail.com>
2015-03-31 11:32:14 -0700
Commit: b5bd75d, github.com/apache/spark/pull/5137
[SPARK-6598][MLLIB] Python API for IDFModel
lewuathe <lewuathe@me.com>
2015-03-31 11:25:21 -0700
Commit: 46de6c0, github.com/apache/spark/pull/5264
[SPARK-6145][SQL] fix ORDER BY on nested fields
Michael Armbrust <michael@databricks.com>
2015-03-31 11:23:18 -0700
Commit: cd48ca5, github.com/apache/spark/pull/5189
[SPARK-6575] [SQL] Adds configuration to disable schema merging while converting metastore Parquet tables
Cheng Lian <lian@databricks.com>
2015-03-31 11:21:15 -0700
Commit: 8102014, github.com/apache/spark/pull/5231
[SPARK-6555] [SQL] Overrides equals() and hashCode() for MetastoreRelation
Cheng Lian <lian@databricks.com>
2015-03-31 11:18:25 -0700
Commit: a7992ff, github.com/apache/spark/pull/5289
[SPARK-4894][mllib] Added Bernoulli option to NaiveBayes model in mllib
leahmcguire <lmcguire@salesforce.com>, Joseph K. Bradley <joseph@databricks.com>, Leah McGuire <lmcguire@salesforce.com>
2015-03-31 11:16:55 -0700
Commit: d01a6d8, github.com/apache/spark/pull/4087
[SPARK-6542][SQL] add CreateStruct
Xiangrui Meng <meng@databricks.com>
2015-03-31 17:05:23 +0800
Commit: a05835b, github.com/apache/spark/pull/5195
[SPARK-6618][SQL] HiveMetastoreCatalog.lookupRelation should use fine-grained lock
Yin Huai <yhuai@databricks.com>
2015-03-31 16:28:40 +0800
Commit: 314afd0, github.com/apache/spark/pull/5281
[SPARK-6623][SQL] Alias DataFrame.na.drop and DataFrame.na.fill in Python.
Reynold Xin <rxin@databricks.com>
2015-03-31 00:25:23 -0700
Commit: b80a030, github.com/apache/spark/pull/5284
[SPARK-6625][SQL] Add common string filters to data sources.
Reynold Xin <rxin@databricks.com>
2015-03-31 00:19:51 -0700
Commit: f07e714, github.com/apache/spark/pull/5285
[SPARK-5124][Core] Move StopCoordinator to the receive method since it does not require a reply
zsxwing <zsxwing@gmail.com>
2015-03-30 22:10:49 -0700
Commit: 5677557, github.com/apache/spark/pull/5283
[SPARK-6119][SQL] DataFrame support for missing data handling
Reynold Xin <rxin@databricks.com>
2015-03-30 20:47:10 -0700
Commit: b8ff2bc, github.com/apache/spark/pull/5274
[SPARK-6369] [SQL] Uses commit coordinator to help committing Hive and Parquet tables
Cheng Lian <lian@databricks.com>
2015-03-31 07:48:37 +0800
Commit: fde6945, github.com/apache/spark/pull/5139
[SPARK-6603] [PySpark] [SQL] add SQLContext.udf and deprecate inferSchema() and applySchema
Davies Liu <davies@databricks.com>
2015-03-30 15:47:00 -0700
Commit: f76d2e5, github.com/apache/spark/pull/5273
[HOTFIX][SPARK-4123]: Updated to fix bug where multiple dependencies added breaks Github output
Brennon York <brennon.york@capitalone.com>
2015-03-30 12:48:26 -0700
Commit: df35500, github.com/apache/spark/pull/5269
[SPARK-6592][SQL] fix filter for scaladoc to generate API doc for Row class under catalyst dir
CodingCat <zhunansjtu@gmail.com>
2015-03-30 11:54:44 -0700
Commit: 32259c6, github.com/apache/spark/pull/5252
[SPARK-6595][SQL] MetastoreRelation should be a MultiInstanceRelation
Michael Armbrust <michael@databricks.com>
2015-03-30 22:24:12 +0800
Commit: fe81f6c, github.com/apache/spark/pull/5251
[HOTFIX] Update start-slave.sh
Jose Manuel Gomez <jmgomez@stratio.com>
2015-03-30 14:59:08 +0100
Commit: 19d4c39, github.com/apache/spark/pull/5262
[SPARK-5750][SPARK-3441][SPARK-5836][CORE] Added documentation explaining shuffle
Ilya Ganelin <ilya.ganelin@capitalone.com>, Ilya Ganelin <ilganeli@gmail.com>
2015-03-30 11:52:02 +0100
Commit: 4bdfb7b, github.com/apache/spark/pull/5074
[SPARK-6596] fix the instruction on building scaladoc
CodingCat <zhunansjtu@gmail.com>
2015-03-30 11:41:43 +0100
Commit: de67330, github.com/apache/spark/pull/5253
[spark-sql] a better exception message than "scala.MatchError" for unsupported types in Schema creation
Eran Medan <ehrann.mehdan@gmail.com>
2015-03-30 00:02:52 -0700
Commit: 17b13c5, github.com/apache/spark/pull/5235
Fix string interpolator error in HeartbeatReceiver
Li Zhihui <zhihui.li@intel.com>
2015-03-29 21:30:37 -0700
Commit: 01dc9f5, github.com/apache/spark/pull/5255
[SPARK-5124][Core] A standard RPC interface and an Akka implementation
zsxwing <zsxwing@gmail.com>
2015-03-29 21:25:09 -0700
Commit: a8d53af, github.com/apache/spark/pull/4588
[SPARK-6585][Tests]Fix FileServerSuite testcase in some Env.
June.He <jun.hejun@huawei.com>
2015-03-29 12:47:22 +0100
Commit: 0e2753f, github.com/apache/spark/pull/5239
[SPARK-6558] Utils.getCurrentUserName returns the full principal name instead of login name
Thomas Graves <tgraves@apache.org>
2015-03-29 12:43:30 +0100
Commit: 52ece26, github.com/apache/spark/pull/5229
[SPARK-6406] Launch Spark using assembly jar instead of a separate launcher jar
Nishkam Ravi <nravi@cloudera.com>, nishkamravi2 <nishkamravi@gmail.com>, nravi <nravi@c1704.halxg.cloudera.com>
2015-03-29 12:40:37 +0100
Commit: e3eb393, github.com/apache/spark/pull/5085
[SPARK-4123][Project Infra]: Show new dependencies added in pull requests
Brennon York <brennon.york@capitalone.com>
2015-03-29 12:37:53 +0100
Commit: 55153f5, github.com/apache/spark/pull/5093
[DOC] Improvements to Python docs.
Reynold Xin <rxin@databricks.com>
2015-03-28 23:59:27 -0700
Commit: 5eef00d, github.com/apache/spark/pull/5238
[SPARK-6571][MLLIB] use wrapper in MatrixFactorizationModel.load
Xiangrui Meng <meng@databricks.com>
2015-03-28 15:08:05 -0700
Commit: f75f633, github.com/apache/spark/pull/5243
[SPARK-6552][Deploy][Doc]expose start-slave.sh to user and update outdated doc
WangTaoTheTonic <wangtao111@huawei.com>
2015-03-28 12:32:35 +0000
Commit: 9963143, github.com/apache/spark/pull/5205
[SPARK-6538][SQL] Add missing nullable Metastore fields when merging a Parquet schema
Adam Budde <budde@amazon.com>
2015-03-28 09:14:09 +0800
Commit: 5909f09, github.com/apache/spark/pull/5214
[SPARK-6564][SQL] SQLContext.emptyDataFrame should contain 0 row, not 1 row
Reynold Xin <rxin@databricks.com>
2015-03-27 14:56:57 -0700
Commit: 3af7334, github.com/apache/spark/pull/5226
[SPARK-6526][ML] Add Normalizer transformer in ML package
Xusen Yin <yinxusen@gmail.com>
2015-03-27 13:29:10 -0700
Commit: d5497ab, github.com/apache/spark/pull/5181
[SPARK-6574] [PySpark] fix sql example
Davies Liu <davies@databricks.com>
2015-03-27 11:42:26 -0700
Commit: 887e1b7, github.com/apache/spark/pull/5230
[SPARK-6550][SQL] Use analyzed plan in DataFrame
Michael Armbrust <michael@databricks.com>
2015-03-27 11:40:00 -0700
Commit: 5d9c37c, github.com/apache/spark/pull/5217
[SPARK-6544][build] Increment Avro version from 1.7.6 to 1.7.7
Dean Chen <deanchen5@gmail.com>
2015-03-27 14:32:51 +0000
Commit: aa2b991, github.com/apache/spark/pull/5193
[SPARK-6556][Core] Fix wrong parsing logic of executorTimeoutMs and checkTimeoutIntervalMs in HeartbeatReceiver
zsxwing <zsxwing@gmail.com>
2015-03-27 12:31:06 +0000
Commit: da546b7, github.com/apache/spark/pull/5209
[SPARK-6341][mllib] Upgrade breeze from 0.11.1 to 0.11.2
Yu ISHIKAWA <yuu.ishikawa@gmail.com>
2015-03-27 00:15:02 -0700
Commit: f43a610, github.com/apache/spark/pull/5222
[SPARK-6405] Limiting the maximum Kryo buffer size to be 2GB.
mcheah <mcheah@palantir.com>
2015-03-26 22:48:42 -0700
Commit: 49d2ec6, github.com/apache/spark/pull/5218
[SPARK-6510][GraphX]: Add Graph#minus method to act as Set#difference
Brennon York <brennon.york@capitalone.com>
2015-03-26 19:08:09 -0700
Commit: 39fb579, github.com/apache/spark/pull/5175
[DOCS][SQL] Fix JDBC example
Michael Armbrust <michael@databricks.com>
2015-03-26 14:51:46 -0700
Commit: aad0032, github.com/apache/spark/pull/5192
[SPARK-6554] [SQL] Don't push down predicates which reference partition column(s)
Cheng Lian <lian@databricks.com>
2015-03-26 13:11:37 -0700
Commit: 71a0d40, github.com/apache/spark/pull/5210
[SPARK-6117] [SQL] Improvements to DataFrame.describe()
Reynold Xin <rxin@databricks.com>
2015-03-26 12:26:13 -0700
Commit: 784fcd5, github.com/apache/spark/pull/5201
SPARK-6532 [BUILD] LDAModel.scala fails scalastyle on Windows
Sean Owen <sowen@cloudera.com>
2015-03-26 10:52:31 -0700
Commit: c3a52a0, github.com/apache/spark/pull/5211
SPARK-6480 [CORE] histogram() bucket function is wrong in some simple edge cases
Sean Owen <sowen@cloudera.com>
2015-03-26 15:00:23 +0000
Commit: fe15ea9, github.com/apache/spark/pull/5148
[MLlib]remove unused import
Yuhao Yang <hhbyyh@gmail.com>
2015-03-26 13:27:05 +0000
Commit: 3ddb975, github.com/apache/spark/pull/5207
[SQL][SPARK-6471]: Metastore schema should only be a subset of parquet schema to support dropping of columns using replace columns
Yash Datta <Yash.Datta@guavus.com>
2015-03-26 21:13:38 +0800
Commit: 1c05027, github.com/apache/spark/pull/5141
[SPARK-6468][Block Manager] Fix the race condition of subDirs in DiskBlockManager
zsxwing <zsxwing@gmail.com>
2015-03-26 12:54:48 +0000
Commit: 0c88ce5, github.com/apache/spark/pull/5136
[SPARK-6465][SQL] Fix serialization of GenericRowWithSchema using kryo
Michael Armbrust <michael@databricks.com>
2015-03-26 18:46:57 +0800
Commit: f88f51b, github.com/apache/spark/pull/5191
[SPARK-6546][Build] Using the wrong code that will make spark compile failed!!
DoingDone9 <799203320@qq.com>
2015-03-26 17:04:19 +0800
Commit: 855cba8, github.com/apache/spark/pull/5198
[SPARK-6117] [SQL] add describe function to DataFrame for summary statis...
azagrebin <azagrebin@gmail.com>
2015-03-26 00:25:04 -0700
Commit: 5bbcd13, github.com/apache/spark/pull/5073
[SPARK-6536] [PySpark] Column.inSet() in Python
Davies Liu <davies@databricks.com>
2015-03-26 00:01:24 -0700
Commit: f535802, github.com/apache/spark/pull/5190
[SPARK-6463][SQL] AttributeSet.equal should compare size
sisihj <jun.hejun@huawei.com>, Michael Armbrust <michael@databricks.com>
2015-03-25 19:21:54 -0700
Commit: 276ef1c, github.com/apache/spark/pull/5194
The UT test of spark is failed. Because there is a test in SQLQuerySuite about creating table “test”
KaiXinXiaoLei <huleilei1@huawei.com>
2015-03-25 19:15:30 -0700
Commit: e87bf37, github.com/apache/spark/pull/5150
[SPARK-6202] [SQL] enable variable substitution on test framework
Daoyuan Wang <daoyuan.wang@intel.com>
2015-03-25 18:43:26 -0700
Commit: 5ab6e9f, github.com/apache/spark/pull/4930
[SPARK-6271][SQL] Sort these tokens in alphabetic order to avoid further duplicate in HiveQl
DoingDone9 <799203320@qq.com>
2015-03-25 18:41:59 -0700
Commit: 328daf6, github.com/apache/spark/pull/4973
[SPARK-6326][SQL] Improve castStruct to be faster
Liang-Chi Hsieh <viirya@gmail.com>
2015-03-25 17:52:23 -0700
Commit: 73d5775, github.com/apache/spark/pull/5017
[SPARK-5498][SQL]fix query exception when partition schema does not match table schema
jeanlyn <jeanlyn92@gmail.com>
2015-03-25 17:47:45 -0700
Commit: e6d1406, github.com/apache/spark/pull/4289
[SPARK-6450] [SQL] Fixes metastore Parquet table conversion
Cheng Lian <lian@databricks.com>
2015-03-25 17:40:19 -0700
Commit: 8c3b005, github.com/apache/spark/pull/5183
[SPARK-6079] Use index to speed up StatusTracker.getJobIdsForGroup()
Josh Rosen <joshrosen@databricks.com>
2015-03-25 17:40:00 -0700
Commit: d44a336, github.com/apache/spark/pull/4830
[SPARK-5987] [MLlib] Save/load for GaussianMixtureModels
MechCoder <manojkumarsivaraj334@gmail.com>
2015-03-25 14:45:23 -0700
Commit: 4fc4d03, github.com/apache/spark/pull/4986
[SPARK-6256] [MLlib] MLlib Python API parity check for regression
Yanbo Liang <ybliang8@gmail.com>
2015-03-25 13:38:33 -0700
Commit: 4353373, github.com/apache/spark/pull/4997
[SPARK-5771] Master UI inconsistently displays application cores
Andrew Or <andrew@databricks.com>
2015-03-25 13:28:32 -0700
Commit: c1b74df, github.com/apache/spark/pull/5177
[SPARK-6537] UIWorkloadGenerator: The main thread should not stop SparkContext until all jobs finish
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2015-03-25 13:27:15 -0700
Commit: acef51d, github.com/apache/spark/pull/5187
[SPARK-6076][Block Manager] Fix a potential OOM issue when StorageLevel is MEMORY_AND_DISK_SER
zsxwing <zsxwing@gmail.com>
2015-03-25 12:09:30 -0700
Commit: 883b7e9, github.com/apache/spark/pull/4827
[SPARK-6409][SQL] It is not necessary that avoid old inteface of hive, because this will make some UDAF can not work.
DoingDone9 <799203320@qq.com>
2015-03-25 11:11:52 -0700
Commit: 968408b, github.com/apache/spark/pull/5131
[ML][FEATURE] SPARK-5566: RegEx Tokenizer
Augustin Borsu <augustin@sagacify.com>, Augustin Borsu <a.borsu@gmail.com>, Augustin Borsu <aborsu@gmail.com>, Xiangrui Meng <meng@databricks.com>
2015-03-25 10:16:39 -0700
Commit: 982952f, github.com/apache/spark/pull/4504
[SPARK-6496] [MLLIB] GeneralizedLinearAlgorithm.run(input, initialWeights) should initialize numFeatures
Yanbo Liang <ybliang8@gmail.com>
2015-03-25 17:05:56 +0000
Commit: 10c7860, github.com/apache/spark/pull/5167
[SPARK-6483][SQL]Improve ScalaUdf called performance.
zzcclp <xm_zzc@sina.com>
2015-03-25 19:11:04 +0800
Commit: 64262ed, github.com/apache/spark/pull/5154
[DOCUMENTATION]Fixed Missing Type Import in Documentation
Bill Chambers <wchambers@ischool.berkeley.edu>, anabranch <wac.chambers@gmail.com>
2015-03-24 22:24:35 -0700
Commit: c5cc414, github.com/apache/spark/pull/5179
[SPARK-6515] update OpenHashSet impl
Xiangrui Meng <meng@databricks.com>
2015-03-24 18:58:27 -0700
Commit: c14ddd9, github.com/apache/spark/pull/5176
[SPARK-6428][Streaming] Added explicit types for all public methods.
Reynold Xin <rxin@databricks.com>
2015-03-24 17:08:25 -0700
Commit: 9459865, github.com/apache/spark/pull/5110
[SPARK-6512] add contains to OpenHashMap
Xiangrui Meng <meng@databricks.com>
2015-03-24 17:06:22 -0700
Commit: 6930e96, github.com/apache/spark/pull/5171
[SPARK-6469] Improving documentation on YARN local directories usage
Christophe Préaud <christophe.preaud@kelkoo.com>
2015-03-24 17:05:49 -0700
Commit: 05c2214, github.com/apache/spark/pull/5165
Revert "[SPARK-5771] Number of Cores in Completed Applications of Standalone Master Web Page always be 0 if sc.stop() is called"
Andrew Or <andrew@databricks.com>
2015-03-24 16:49:27 -0700
Commit: dd907d1
Revert "[SPARK-5771][UI][hotfix] Change Requested Cores into * if default cores is not set"
Andrew Or <andrew@databricks.com>
2015-03-24 16:41:31 -0700
Commit: f7c3668
[SPARK-3570] Include time to open files in shuffle write time.
Kay Ousterhout <kayousterhout@gmail.com>
2015-03-24 16:29:40 -0700
Commit: d8ccf65, github.com/apache/spark/pull/4550
[SPARK-6088] Correct how tasks that get remote results are shown in UI.
Kay Ousterhout <kayousterhout@gmail.com>
2015-03-24 16:26:43 -0700
Commit: 6948ab6, github.com/apache/spark/pull/4839
[SPARK-6428][SQL] Added explicit types for all public methods in catalyst
Reynold Xin <rxin@databricks.com>
2015-03-24 16:03:55 -0700
Commit: 7334801, github.com/apache/spark/pull/5162
[SPARK-6209] Clean up connections in ExecutorClassLoader after failing to load classes (master branch PR)
Josh Rosen <joshrosen@databricks.com>
2015-03-24 14:38:20 -0700
Commit: 7215aa74, github.com/apache/spark/pull/4944
[SPARK-6458][SQL] Better error messages for invalid data sources
Michael Armbrust <michael@databricks.com>
2015-03-24 14:10:56 -0700
Commit: a8f51b8, github.com/apache/spark/pull/5158
[SPARK-6376][SQL] Avoid eliminating subqueries until optimization
Michael Armbrust <michael@databricks.com>
2015-03-24 14:08:20 -0700
Commit: cbeaf9e, github.com/apache/spark/pull/5160
[SPARK-6375][SQL] Fix formatting of error messages.
Michael Armbrust <michael@databricks.com>
2015-03-24 13:22:46 -0700
Commit: 046c1e2, github.com/apache/spark/pull/5155
[SPARK-6054][SQL] Fix transformations of TreeNodes that hold StructTypes
Michael Armbrust <michael@databricks.com>
2015-03-24 12:28:01 -0700
Commit: 3fa3d12, github.com/apache/spark/pull/5157
[SPARK-6437][SQL] Use completion iterator to close external sorter
Michael Armbrust <michael@databricks.com>
2015-03-24 12:10:30 -0700
Commit: 26c6ce3, github.com/apache/spark/pull/5161
[SPARK-6459][SQL] Warn when constructing trivially true equals predicate
Michael Armbrust <michael@databricks.com>
2015-03-24 12:09:02 -0700
Commit: 32efadd, github.com/apache/spark/pull/5163
[SPARK-6361][SQL] support adding a column with metadata in DF
Xiangrui Meng <meng@databricks.com>
2015-03-24 12:08:19 -0700
Commit: 6bdddb6, github.com/apache/spark/pull/5151
[SPARK-6475][SQL] recognize array types when infer data types from JavaBeans
Xiangrui Meng <meng@databricks.com>
2015-03-24 10:11:27 -0700
Commit: a1d1529, github.com/apache/spark/pull/5146
[ML][docs][minor] Define LabeledDocument/Document classes in CV example
Peter Rudenko <petro.rudenko@gmail.com>
2015-03-24 16:33:38 +0000
Commit: 08d4528, github.com/apache/spark/pull/5135
[SPARK-5559] [Streaming] [Test] Remove oppotunity we met flakiness when running FlumeStreamSuite
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2015-03-24 16:13:25 +0000
Commit: 85cf063, github.com/apache/spark/pull/4337
[SPARK-6473] [core] Do not try to figure out Scala version if not needed...
Marcelo Vanzin <vanzin@cloudera.com>
2015-03-24 13:48:33 +0000
Commit: b293afc, github.com/apache/spark/pull/5143
Update the command to use IPython notebook
Cong Yue <yuecong1104@gmail.com>
2015-03-24 12:56:13 +0000
Commit: c12312f, github.com/apache/spark/pull/5111
[SPARK-6477][Build]: Run MIMA tests before the Spark test suite
Brennon York <brennon.york@capitalone.com>
2015-03-24 10:33:04 +0000
Commit: 37fac1d, github.com/apache/spark/pull/5145
[SPARK-6452] [SQL] Checks for missing attributes and unresolved operator for all types of operator
Cheng Lian <lian@databricks.com>
2015-03-24 01:12:11 -0700
Commit: 1afcf77, github.com/apache/spark/pull/5129
[SPARK-6428] Added explicit types for all public methods in core.
Reynold Xin <rxin@databricks.com>
2015-03-23 23:41:06 -0700
Commit: 4ce2782, github.com/apache/spark/pull/5125
[SPARK-6124] Support jdbc connection properties in OPTIONS part of the query
Volodymyr Lyubinets <vlyubin@gmail.com>
2015-03-23 17:00:27 -0700
Commit: bfd3ee9, github.com/apache/spark/pull/4859
Revert "[SPARK-6122][Core] Upgrade Tachyon client version to 0.6.1."
Patrick Wendell <patrick@databricks.com>
2015-03-23 15:08:39 -0700
Commit: 6cd7058
[SPARK-6308] [MLlib] [Sql] Override TypeName in VectorUDT and MatrixUDT
MechCoder <manojkumarsivaraj334@gmail.com>
2015-03-23 13:30:21 -0700
Commit: 474d132, github.com/apache/spark/pull/5118
[SPARK-6397][SQL] Check the missingInput simply
Yadong Qi <qiyadong2010@gmail.com>
2015-03-23 18:16:49 +0800
Commit: 9f3273b, github.com/apache/spark/pull/5132
Revert "[SPARK-6397][SQL] Check the missingInput simply"
Cheng Lian <lian@databricks.com>
2015-03-23 12:15:19 +0800
Commit: bf044de
[SPARK-6397][SQL] Check the missingInput simply
q00251598 <qiyadong@huawei.com>
2015-03-23 12:06:13 +0800
Commit: e566fe5, github.com/apache/spark/pull/5082
[SPARK-4985] [SQL] parquet support for date type
Daoyuan Wang <daoyuan.wang@intel.com>
2015-03-23 11:46:16 +0800
Commit: 4659468, github.com/apache/spark/pull/3822
[SPARK-6337][Documentation, SQL]Spark 1.3 doc fixes
vinodkc <vinod.kc.in@gmail.com>
2015-03-22 20:00:08 +0000
Commit: 2bf40c5, github.com/apache/spark/pull/5112
[HOTFIX] Build break due to https://github.com/apache/spark/pull/5128
Reynold Xin <rxin@databricks.com>
2015-03-22 12:08:15 -0700
Commit: 7a0da47
[SPARK-6122][Core] Upgrade Tachyon client version to 0.6.1.
Calvin Jia <jia.calvin@gmail.com>
2015-03-22 11:11:29 -0700
Commit: a41b9c6, github.com/apache/spark/pull/4867
SPARK-6454 [DOCS] Fix links to pyspark api
Kamil Smuga <smugakamil@gmail.com>, stderr <smugakamil@gmail.com>
2015-03-22 15:56:25 +0000
Commit: 6ef4863, github.com/apache/spark/pull/5120
[SPARK-6453][Mesos] Some Mesos*Suite have a different package with their classes
Jongyoul Lee <jongyoul@gmail.com>
2015-03-22 15:53:18 +0000
Commit: adb2ff7, github.com/apache/spark/pull/5126
[SPARK-6455] [docs] Correct some mistakes and typos
Hangchen Yu <yuhc@gitcafe.com>
2015-03-22 15:51:10 +0000
Commit: ab4f516, github.com/apache/spark/pull/5128
[SPARK-6448] Make history server log parse exceptions
Ryan Williams <ryan.blake.williams@gmail.com>
2015-03-22 11:54:23 +0000
Commit: b9fe504, github.com/apache/spark/pull/5122
[SPARK-6408] [SQL] Fix JDBCRDD filtering string literals
ypcat <ypcat6@gmail.com>, Pei-Lun Lee <pllee@appier.com>
2015-03-22 15:49:13 +0800
Commit: 9b1e1f2, github.com/apache/spark/pull/5087
[SPARK-6428][SQL] Added explicit type for all public methods for Hive module
Reynold Xin <rxin@databricks.com>
2015-03-21 14:30:04 -0700
Commit: b6090f9, github.com/apache/spark/pull/5108
[SPARK-6250][SPARK-6146][SPARK-5911][SQL] Types are now reserved words in DDL parser.
Yin Huai <yhuai@databricks.com>
2015-03-21 13:27:53 -0700
Commit: 94a102a, github.com/apache/spark/pull/5078
[SPARK-5680][SQL] Sum function on all null values, should return zero
Venkata Ramana G <ramana.gollamudihuawei.com>, Venkata Ramana Gollamudi <ramana.gollamudi@huawei.com>
2015-03-21 13:24:24 -0700
Commit: ee569a0, github.com/apache/spark/pull/4466
[SPARK-5320][SQL]Add statistics method at NoRelation (override super).
x1- <viva008@gmail.com>
2015-03-21 13:22:34 -0700
Commit: 52dd4b2, github.com/apache/spark/pull/5105
[SPARK-5821] [SQL] JSON CTAS command should throw error message when delete path failure
Yanbo Liang <ybliang8@gmail.com>, Yanbo Liang <yanbohappy@gmail.com>
2015-03-21 11:23:28 +0800
Commit: e5d2c37, github.com/apache/spark/pull/4610
[SPARK-6315] [SQL] Also tries the case class string parser while reading Parquet schema
Cheng Lian <lian@databricks.com>
2015-03-21 11:18:45 +0800
Commit: 937c1e5, github.com/apache/spark/pull/5034
[SPARK-5821] [SQL] ParquetRelation2 CTAS should check if delete is successful
Yanbo Liang <ybliang8@gmail.com>
2015-03-21 10:53:04 +0800
Commit: bc37c97, github.com/apache/spark/pull/5107
[SPARK-6025] [MLlib] Add helper method evaluateEachIteration to extract learning curve
MechCoder <manojkumarsivaraj334@gmail.com>
2015-03-20 17:14:09 -0700
Commit: 25e271d, github.com/apache/spark/pull/4906
[SPARK-6428][SQL] Added explicit type for all public methods in sql/core
Reynold Xin <rxin@databricks.com>
2015-03-20 15:47:07 -0700
Commit: a95043b, github.com/apache/spark/pull/5104
[SPARK-6421][MLLIB] _regression_train_wrapper does not test initialWeights correctly
lewuathe <lewuathe@me.com>
2015-03-20 17:18:18 -0400
Commit: 257cde7, github.com/apache/spark/pull/5101
[SPARK-6309] [SQL] [MLlib] Implement MatrixUDT
MechCoder <manojkumarsivaraj334@gmail.com>
2015-03-20 17:13:18 -0400
Commit: 11e0259, github.com/apache/spark/pull/5048
[SPARK-6423][Mesos] MemoryUtils should use memoryOverhead if it's set
Jongyoul Lee <jongyoul@gmail.com>
2015-03-20 19:14:35 +0000
Commit: 49a01c7, github.com/apache/spark/pull/5099
[SPARK-5955][MLLIB] add checkpointInterval to ALS
Xiangrui Meng <meng@databricks.com>
2015-03-20 15:02:57 -0400
Commit: 6b36470, github.com/apache/spark/pull/5076
[Spark 6096][MLlib] Add Naive Bayes load save methods in Python
Xusen Yin <yinxusen@gmail.com>
2015-03-20 14:53:59 -0400
Commit: 25636d9, github.com/apache/spark/pull/5090
[MLlib] SPARK-5954: Top by key
Shuo Xiang <shuoxiangpub@gmail.com>
2015-03-20 14:45:44 -0400
Commit: 5e6ad24, github.com/apache/spark/pull/5075
[SPARK-6095] [MLLIB] Support model save/load in Python's linear models
Yanbo Liang <ybliang8@gmail.com>
2015-03-20 14:44:21 -0400
Commit: 48866f7, github.com/apache/spark/pull/5016
[SPARK-6371] [build] Update version to 1.4.0-SNAPSHOT.
Marcelo Vanzin <vanzin@cloudera.com>
2015-03-20 18:43:57 +0000
Commit: a745645, github.com/apache/spark/pull/5056
[SPARK-6426][Doc]User could also point the yarn cluster config directory via YARN_CONF_DI...
WangTaoTheTonic <wangtao111@huawei.com>
2015-03-20 18:42:18 +0000
Commit: 385b2ff, github.com/apache/spark/pull/5103
[SPARK-6370][core] Documentation: Improve all 3 docs for RDD.sample
mbonaci <mbonaci@gmail.com>
2015-03-20 18:30:45 +0000
Commit: 28bcb9e, github.com/apache/spark/pull/5097
[SPARK-6428][MLlib] Added explicit type for public methods and implemented hashCode when equals is defined.
Reynold Xin <rxin@databricks.com>
2015-03-20 14:13:02 -0400
Commit: db4d317, github.com/apache/spark/pull/5102
SPARK-6338 [CORE] Use standard temp dir mechanisms in tests to avoid orphaned temp files
Sean Owen <sowen@cloudera.com>
2015-03-20 14:16:21 +0000
Commit: 6f80c3e, github.com/apache/spark/pull/5029
SPARK-5134 [BUILD] Bump default Hadoop version to 2+
Sean Owen <sowen@cloudera.com>
2015-03-20 14:14:53 +0000
Commit: d08e3eb, github.com/apache/spark/pull/5027
[SPARK-6286][Mesos][minor] Handle missing Mesos case TASK_ERROR
Jongyoul Lee <jongyoul@gmail.com>
2015-03-20 12:24:34 +0000
Commit: 116c553, github.com/apache/spark/pull/5088
Tighten up field/method visibility in Executor and made some code more clear to read.
Reynold Xin <rxin@databricks.com>
2015-03-19 22:12:01 -0400
Commit: 0745a30, github.com/apache/spark/pull/4850
[SPARK-6219] [Build] Check that Python code compiles
Nicholas Chammas <nicholas.chammas@gmail.com>
2015-03-19 12:46:10 -0700
Commit: f17d43b, github.com/apache/spark/pull/4941
[Core][minor] remove unused `visitedStages` in `DAGScheduler.stageDependsOn`
Wenchen Fan <cloud0fan@outlook.com>
2015-03-19 15:25:32 -0400
Commit: 3b5aaa6, github.com/apache/spark/pull/5086
[SPARK-5313][Project Infra]: Create simple framework for highlighting changes introduced in a PR
Brennon York <brennon.york@capitalone.com>
2015-03-19 11:18:24 -0400
Commit: 8cb23a1, github.com/apache/spark/pull/5072
[SPARK-6291] [MLLIB] GLM toString & toDebugString
Yanbo Liang <ybliang8@gmail.com>
2015-03-19 11:10:20 -0400
Commit: dda4ded, github.com/apache/spark/pull/5038
[SPARK-5843] [API] Allowing map-side combine to be specified in Java.
mcheah <mcheah@palantir.com>
2015-03-19 08:51:49 -0400
Commit: 3c4e486, github.com/apache/spark/pull/4634
[SPARK-6402][DOC] - Remove some refererences to shark in docs and ec2
Pierre Borckmans <pierre.borckmans@realimpactanalytics.com>
2015-03-19 08:02:06 -0400
Commit: 797f8a0, github.com/apache/spark/pull/5083
[SPARK-4012] stop SparkContext when the exception is thrown from an infinite loop
CodingCat <zhunansjtu@gmail.com>
2015-03-18 23:48:45 -0700
Commit: 2c3f83c, github.com/apache/spark/pull/5004
[SPARK-6222][Streaming] Dont delete checkpoint data when doing pre-batch-start checkpoint
Tathagata Das <tathagata.das1565@gmail.com>
2015-03-19 02:15:50 -0400
Commit: 645cf3f, github.com/apache/spark/pull/5008
[SPARK-6394][Core] cleanup BlockManager companion object and improve the getCacheLocs method in DAGScheduler
Wenchen Fan <cloud0fan@outlook.com>
2015-03-18 19:43:04 -0700
Commit: 540b2a4, github.com/apache/spark/pull/5043
SPARK-6085 Part. 2 Increase default value for memory overhead
Jongyoul Lee <jongyoul@gmail.com>
2015-03-18 20:54:22 -0400
Commit: 3db1387, github.com/apache/spark/pull/5065
[SPARK-6374] [MLlib] add get for GeneralizedLinearAlgo
Yuhao Yang <hhbyyh@gmail.com>
2015-03-18 13:44:37 -0400
Commit: a95ee24, github.com/apache/spark/pull/5058
[SPARK-6325] [core,yarn] Do not change target executor count when killing executors.
Marcelo Vanzin <vanzin@cloudera.com>
2015-03-18 09:18:28 -0400
Commit: 981fbaf, github.com/apache/spark/pull/5018
[SPARK-6286][minor] Handle missing Mesos case TASK_ERROR.
Iulian Dragos <jaguarul@gmail.com>
2015-03-18 09:15:33 -0400
Commit: 9d112a9, github.com/apache/spark/pull/5000
SPARK-6389 YARN app diagnostics report doesn't report NPEs
Steve Loughran <stevel@hortonworks.com>
2015-03-18 09:09:32 -0400
Commit: e09c852, github.com/apache/spark/pull/5070
[SPARK-6372] [core] Propagate --conf to child processes.
Marcelo Vanzin <vanzin@cloudera.com>
2015-03-18 09:06:57 -0400
Commit: 6205a25, github.com/apache/spark/pull/5057
[SPARK-6247][SQL] Fix resolution of ambiguous joins caused by new aliases
Michael Armbrust <michael@databricks.com>
2015-03-17 19:47:51 -0700
Commit: 3579003, github.com/apache/spark/pull/5062
[SPARK-5651][SQL] Add input64 in blacklist and add test suit for create table within backticks
watermen <qiyadong2010@gmail.com>, q00251598 <qiyadong@huawei.com>
2015-03-17 19:35:18 -0700
Commit: a6ee2f7, github.com/apache/spark/pull/4427
[SPARK-5404] [SQL] Update the default statistic number
Cheng Hao <hao.cheng@intel.com>
2015-03-17 19:32:38 -0700
Commit: 78cb08a, github.com/apache/spark/pull/4914
[SPARK-5908][SQL] Resolve UdtfsAlias when only single Alias is used
Liang-Chi Hsieh <viirya@gmail.com>
2015-03-17 18:58:52 -0700
Commit: 5c80643, github.com/apache/spark/pull/4692
[SPARK-6383][SQL]Fixed compiler and errors in Dataframe examples
Tijo Thomas <tijoparacka@gmail.com>
2015-03-17 18:50:19 -0700
Commit: a012e08, github.com/apache/spark/pull/5068
[SPARK-6366][SQL] In Python API, the default save mode for save and saveAsTable should be "error" instead of "append".
Yin Huai <yhuai@databricks.com>
2015-03-18 09:41:06 +0800
Commit: dc9c919, github.com/apache/spark/pull/5053
[SPARK-6330] [SQL] Add a test case for SPARK-6330
Pei-Lun Lee <pllee@appier.com>
2015-03-18 08:34:46 +0800
Commit: 4633a87, github.com/apache/spark/pull/5039
[SPARK-6226][MLLIB] add save/load in PySpark's KMeansModel
Xiangrui Meng <meng@databricks.com>
2015-03-17 12:14:40 -0700
Commit: c94d062, github.com/apache/spark/pull/5049
[SPARK-6336] LBFGS should document what convergenceTol means
lewuathe <lewuathe@me.com>
2015-03-17 12:11:57 -0700
Commit: d9f3e01, github.com/apache/spark/pull/5033
[SPARK-6313] Add config option to disable file locks/fetchFile cache to ...
nemccarthy <nathan@nemccarthy.me>
2015-03-17 09:33:11 -0700
Commit: 4cca391, github.com/apache/spark/pull/5036
[SPARK-3266] Use intermediate abstract classes to fix type erasure issues in Java APIs
Josh Rosen <joshrosen@databricks.com>
2015-03-17 09:18:57 -0700
Commit: 0f673c2, github.com/apache/spark/pull/5050
[SPARK-6365] jetty-security also needed for SPARK_PREPEND_CLASSES to work
Imran Rashid <irashid@cloudera.com>
2015-03-17 09:41:06 -0500
Commit: e9f22c6, github.com/apache/spark/pull/5052
[SPARK-6331] Load new master URL if present when recovering streaming context from checkpoint
Tathagata Das <tathagata.das1565@gmail.com>
2015-03-17 05:31:27 -0700
Commit: c928796, github.com/apache/spark/pull/5024
[docs] [SPARK-4820] Spark build encounters "File name too long" on some encrypted filesystems
Theodore Vasiloudis <tvas@sics.se>
2015-03-17 11:25:01 +0000
Commit: e26db9be, github.com/apache/spark/pull/5041
[SPARK-6269] [CORE] Use ScalaRunTime's array methods instead of java.lang.reflect.Array in size estimation
mcheah <mcheah@palantir.com>, Justin Uang <justin.uang@gmail.com>
2015-03-17 11:20:20 +0000
Commit: 005d1c5, github.com/apache/spark/pull/4972
[SPARK-4011] tighten the visibility of the members in Master/Worker class
CodingCat <zhunansjtu@gmail.com>
2015-03-17 11:18:27 +0000
Commit: 25f3580, github.com/apache/spark/pull/4844
SPARK-6044 [CORE] RDD.aggregate() should not use the closure serializer on the zero value
Sean Owen <sowen@cloudera.com>
2015-03-16 23:58:52 -0700
Commit: b2d8c02, github.com/apache/spark/pull/5028
[SPARK-6357][GraphX] Add unapply in EdgeContext
Takeshi YAMAMURO <linguin.m.s@gmail.com>
2015-03-16 23:54:54 -0700
Commit: b3e6eca, github.com/apache/spark/pull/5047
[SQL][docs][minor] Fixed sample code in SQLContext scaladoc
Lomig Mégard <lomig.megard@gmail.com>
2015-03-16 23:52:42 -0700
Commit: 6870722, github.com/apache/spark/pull/5051
[SPARK-6299][CORE] ClassNotFoundException in standalone mode when running groupByKey with class defined in REPL
Kevin (Sangwoo) Kim <sangwookim.me@gmail.com>
2015-03-16 23:49:23 -0700
Commit: f0edeae, github.com/apache/spark/pull/5046
[SPARK-5712] [SQL] fix comment with semicolon at end
Daoyuan Wang <daoyuan.wang@intel.com>
2015-03-17 12:29:15 +0800
Commit: 9667b9f, github.com/apache/spark/pull/4500
[SPARK-6327] [PySpark] fix launch spark-submit from python
Davies Liu <davies@databricks.com>
2015-03-16 16:26:55 -0700
Commit: e3f315a, github.com/apache/spark/pull/5019
[SPARK-6077] Remove streaming tab while stopping StreamingContext
lisurprise <zhichao.li@intel.com>
2015-03-16 13:10:32 -0700
Commit: f149b8b, github.com/apache/spark/pull/4828
[SPARK-6330] Fix filesystem bug in newParquet relation
Volodymyr Lyubinets <vlyubin@gmail.com>
2015-03-16 12:13:18 -0700
Commit: d19efed, github.com/apache/spark/pull/5020
[SPARK-2087] [SQL] Multiple thriftserver sessions with single HiveContext instance
Cheng Hao <hao.cheng@intel.com>
2015-03-17 01:09:27 +0800
Commit: 12a345a, github.com/apache/spark/pull/4885
[SPARK-6300][Spark Core] sc.addFile(path) does not support the relative path.
DoingDone9 <799203320@qq.com>
2015-03-16 12:27:15 +0000
Commit: 00e730b, github.com/apache/spark/pull/4993
[SPARK-5922][GraphX]: Add diff(other: RDD[VertexId, VD]) in VertexRDD
Brennon York <brennon.york@capitalone.com>
2015-03-16 01:06:26 -0700
Commit: 45f4c66, github.com/apache/spark/pull/4733
[SPARK-3619] Part 2. Upgrade to Mesos 0.21 to work around MESOS-1688
Jongyoul Lee <jongyoul@gmail.com>
2015-03-15 15:46:55 +0000
Commit: aa6536f, github.com/apache/spark/pull/4361
[SPARK-6285][SQL]Remove ParquetTestData in SparkBuild.scala and in README.md
OopsOutOfMemory <victorshengli@126.com>
2015-03-15 20:44:45 +0800
Commit: 62ede53, github.com/apache/spark/pull/5032
[SPARK-5790][GraphX]: VertexRDD's won't zip properly for `diff` capability (added tests)
Brennon York <brennon.york@capitalone.com>
2015-03-14 17:38:12 +0000
Commit: c49d156, github.com/apache/spark/pull/5023
[SPARK-6329][Docs]: Minor doc changes for Mesos and TOC
Brennon York <brennon.york@capitalone.com>
2015-03-14 17:28:13 +0000
Commit: 127268b, github.com/apache/spark/pull/5022
[SPARK-6195] [SQL] Adds in-memory column type for fixed-precision decimals
Cheng Lian <lian@databricks.com>
2015-03-14 19:53:54 +0800
Commit: 5be6b0e, github.com/apache/spark/pull/4938
[SQL]Delete some dupliate code in HiveThriftServer2
ArcherShao <ArcherShao@users.noreply.github.com>, ArcherShao <shaochuan@huawei.com>
2015-03-14 08:27:18 +0000
Commit: ee15404, github.com/apache/spark/pull/5007
[SPARK-6210] [SQL] use prettyString as column name in agg()
Davies Liu <davies@databricks.com>
2015-03-14 00:43:33 -0700
Commit: b38e073, github.com/apache/spark/pull/5006
[SPARK-6317][SQL]Fixed HIVE console startup issue
vinodkc <vinod.kc.in@gmail.com>, Vinod K C <vinod.kc@huawei.com>
2015-03-14 07:17:54 +0800
Commit: e360d5e, github.com/apache/spark/pull/5011
[SPARK-6285] [SQL] Removes unused ParquetTestData and duplicated TestGroupWriteSupport
Cheng Lian <lian@databricks.com>
2015-03-14 07:09:53 +0800
Commit: cdc34ed, github.com/apache/spark/pull/5010
[SPARK-4600][GraphX]: org.apache.spark.graphx.VertexRDD.diff does not work
Brennon York <brennon.york@capitalone.com>
2015-03-13 18:48:31 +0000
Commit: b943f5d, github.com/apache/spark/pull/5015
[SPARK-6278][MLLIB] Mention the change of objective in linear regression
Xiangrui Meng <meng@databricks.com>
2015-03-13 10:27:28 -0700
Commit: 7f13434, github.com/apache/spark/pull/4978
[SPARK-6252] [mllib] Added getLambda to Scala NaiveBayes
Joseph K. Bradley <joseph.kurata.bradley@gmail.com>, Joseph K. Bradley <joseph@databricks.com>
2015-03-13 10:26:09 -0700
Commit: dc4abd4, github.com/apache/spark/pull/4969
[CORE][minor] remove unnecessary ClassTag in `DAGScheduler`
Wenchen Fan <cloud0fan@outlook.com>
2015-03-13 14:08:56 +0000
Commit: ea3d2ee, github.com/apache/spark/pull/4992
[SPARK-6197][CORE] handle json exception when hisotry file not finished writing
Zhang, Liye <liye.zhang@intel.com>
2015-03-13 13:59:54 +0000
Commit: 9048e81, github.com/apache/spark/pull/4927
[SPARK-5310] [SQL] [DOC] Parquet section for the SQL programming guide
Cheng Lian <lian@databricks.com>
2015-03-13 21:34:50 +0800
Commit: 69ff8e8, github.com/apache/spark/pull/5001
[SPARK-5845][Shuffle] Time to cleanup spilled shuffle files not included in shuffle write time
Ilya Ganelin <ilya.ganelin@capitalone.com>
2015-03-13 13:21:04 +0000
Commit: 0af9ea7, github.com/apache/spark/pull/4965
HOTFIX: Changes to release script.
Patrick Wendell <patrick@databricks.com>
2015-03-12 18:36:17 -0700
Commit: 3980ebd
[mllib] [python] Add LassoModel to __all__ in regression.py
Joseph K. Bradley <joseph@databricks.com>
2015-03-12 16:46:29 -0700
Commit: 17c309c, github.com/apache/spark/pull/4970
[SPARK-4588] ML Attributes
Xiangrui Meng <meng@databricks.com>, Sean Owen <sowen@cloudera.com>
2015-03-12 16:34:56 -0700
Commit: a4b2716, github.com/apache/spark/pull/4925
[SPARK-6268][MLlib] KMeans parameter getter methods
Yuhao Yang <hhbyyh@gmail.com>
2015-03-12 15:17:46 -0700
Commit: fb4787c, github.com/apache/spark/pull/4974
[build] [hotfix] Fix make-distribution.sh for Scala 2.11.
Marcelo Vanzin <vanzin@cloudera.com>
2015-03-12 19:16:58 +0000
Commit: 8f1bc79, github.com/apache/spark/pull/5002
[SPARK-6275][Documentation]Miss toDF() function in docs/sql-programming-guide.md
zzcclp <xm_zzc@sina.com>
2015-03-12 15:07:15 +0000
Commit: 304366c, github.com/apache/spark/pull/4977
[docs] [SPARK-6306] Readme points to dead link
Theodore Vasiloudis <tvas@sics.se>
2015-03-12 15:01:33 +0000
Commit: 4e47d54, github.com/apache/spark/pull/4999
[SPARK-5814][MLLIB][GRAPHX] Remove JBLAS from runtime
Xiangrui Meng <meng@databricks.com>
2015-03-12 01:39:04 -0700
Commit: 0cba802, github.com/apache/spark/pull/4699
[SPARK-6294] fix hang when call take() in JVM on PythonRDD
Davies Liu <davies@databricks.com>
2015-03-12 01:34:38 -0700
Commit: 712679a, github.com/apache/spark/pull/4987
[SPARK-6296] [SQL] Added equals to Column
Volodymyr Lyubinets <vlyubin@gmail.com>
2015-03-12 00:55:26 -0700
Commit: 25b71d8, github.com/apache/spark/pull/4988
BUILD: Adding more known contributor names
Patrick Wendell <patrick@databricks.com>
2015-03-11 22:24:08 -0700
Commit: e921a66
[SPARK-6128][Streaming][Documentation] Updates to Spark Streaming Programming Guide
Tathagata Das <tathagata.das1565@gmail.com>
2015-03-11 18:48:21 -0700
Commit: cd3b68d, github.com/apache/spark/pull/4956
[SPARK-6274][Streaming][Examples] Added examples streaming + sql examples.
Tathagata Das <tathagata.das1565@gmail.com>
2015-03-11 11:19:51 -0700
Commit: 51a79a7, github.com/apache/spark/pull/4975
SPARK-6245 [SQL] jsonRDD() of empty RDD results in exception
Sean Owen <sowen@cloudera.com>
2015-03-11 14:09:09 +0000
Commit: 55c4831, github.com/apache/spark/pull/4971
SPARK-3642. Document the nuances of shared variables.
Sandy Ryza <sandy@cloudera.com>
2015-03-11 13:22:05 +0000
Commit: 2d87a41, github.com/apache/spark/pull/2490
[SPARK-4423] Improve foreach() documentation to avoid confusion between local- and cluster-mode behavior
Ilya Ganelin <ilya.ganelin@capitalone.com>
2015-03-11 13:20:15 +0000
Commit: 548643a, github.com/apache/spark/pull/4696
[SPARK-6228] [network] Move SASL classes from network/shuffle to network...
Marcelo Vanzin <vanzin@cloudera.com>
2015-03-11 13:16:22 +0000
Commit: 5b335bd, github.com/apache/spark/pull/4953
SPARK-6225 [CORE] [SQL] [STREAMING] Resolve most build warnings, 1.3.0 edition
Sean Owen <sowen@cloudera.com>
2015-03-11 13:15:19 +0000
Commit: 6e94c4e, github.com/apache/spark/pull/4950
[SPARK-6279][Streaming]In KafkaRDD.scala, Miss expressions flag "s" at logging string
zzcclp <xm_zzc@sina.com>
2015-03-11 12:22:24 +0000
Commit: ec30c17, github.com/apache/spark/pull/4979
[SQL][Minor] fix typo in comments
Hongbo Liu <liuhb86@gmail.com>
2015-03-11 12:18:24 +0000
Commit: 40f4979, github.com/apache/spark/pull/4976
[MINOR] [DOCS] Fix map -> mapToPair in Streaming Java example
Sean Owen <sowen@cloudera.com>
2015-03-11 12:16:32 +0000
Commit: 35b2564, github.com/apache/spark/pull/4967
[SPARK-4924] Add a library for launching Spark jobs programmatically.
Marcelo Vanzin <vanzin@cloudera.com>
2015-03-11 01:03:01 -0700
Commit: 517975d, github.com/apache/spark/pull/3916
[SPARK-5986][MLLib] Add save/load for k-means
Xusen Yin <yinxusen@gmail.com>
2015-03-11 00:24:55 -0700
Commit: 2d4e00e, github.com/apache/spark/pull/4951
[SPARK-5183][SQL] Update SQL Docs with JDBC and Migration Guide
Michael Armbrust <michael@databricks.com>
2015-03-10 18:13:09 -0700
Commit: 2672374, github.com/apache/spark/pull/4958
Minor doc: Remove the extra blank line in data types javadoc.
Reynold Xin <rxin@databricks.com>
2015-03-10 17:25:04 -0700
Commit: 74fb433, github.com/apache/spark/pull/4955
[SPARK-6186] [EC2] Make Tachyon version configurable in EC2 deployment script
cheng chang <myairia@gmail.com>
2015-03-10 11:02:12 +0000
Commit: 7c7d2d5, github.com/apache/spark/pull/4901
[SPARK-6191] [EC2] Generalize ability to download libs
Nicholas Chammas <nicholas.chammas@gmail.com>
2015-03-10 10:58:31 +0000
Commit: d14df06, github.com/apache/spark/pull/4919
[SPARK-6087][CORE] Provide actionable exception if Kryo buffer is not large enough
Lev Khomich <levkhomich@gmail.com>
2015-03-10 10:55:42 +0000
Commit: c4c4b07, github.com/apache/spark/pull/4947
[SPARK-6177][MLlib]Add note in LDA example to remind possible coalesce
Yuhao Yang <hhbyyh@gmail.com>
2015-03-10 10:51:44 +0000
Commit: 9a0272f, github.com/apache/spark/pull/4899
[SPARK-6194] [SPARK-677] [PySpark] fix memory leak in collect()
Davies Liu <davies@databricks.com>
2015-03-09 16:24:06 -0700
Commit: 8767565, github.com/apache/spark/pull/4923
[SPARK-5310][Doc] Update SQL Programming Guide to include DataFrames.
Reynold Xin <rxin@databricks.com>
2015-03-09 16:16:16 -0700
Commit: 3cac199, github.com/apache/spark/pull/4954
[Docs] Replace references to SchemaRDD with DataFrame
Reynold Xin <rxin@databricks.com>
2015-03-09 13:29:19 -0700
Commit: 70f8814, github.com/apache/spark/pull/4952
[EC2] [SPARK-6188] Instance types can be mislabeled when re-starting cluster with default arguments
Theodore Vasiloudis <thvasilo@users.noreply.github.com>, Theodore Vasiloudis <tvas@sics.se>
2015-03-09 14:16:07 +0000
Commit: f7c7992, github.com/apache/spark/pull/4916
[GraphX] Improve LiveJournalPageRank example
Jacky Li <jacky.likun@huawei.com>
2015-03-08 19:47:35 +0000
Commit: 55b1b32, github.com/apache/spark/pull/4917
SPARK-6205 [CORE] UISeleniumSuite fails for Hadoop 2.x test with NoClassDefFoundError
Sean Owen <sowen@cloudera.com>
2015-03-08 14:09:40 +0000
Commit: f16b7b0, github.com/apache/spark/pull/4933
[SPARK-6193] [EC2] Push group filter up to EC2
Nicholas Chammas <nicholas.chammas@gmail.com>
2015-03-08 14:01:26 +0000
Commit: 52ed7da, github.com/apache/spark/pull/4922
[SPARK-5641] [EC2] Allow spark_ec2.py to copy arbitrary files to cluster
Florian Verhein <florian.verhein@gmail.com>
2015-03-07 12:56:59 +0000
Commit: 334c5bd, github.com/apache/spark/pull/4583
[Minor]fix the wrong description
WangTaoTheTonic <wangtao111@huawei.com>
2015-03-07 12:35:26 +0000
Commit: 729c05b, github.com/apache/spark/pull/4936
[EC2] Reorder print statements on termination
Nicholas Chammas <nicholas.chammas@gmail.com>
2015-03-07 12:33:41 +0000
Commit: 2646794, github.com/apache/spark/pull/4932
Fix python typo (+ Scala, Java typos)
RobertZK <technoguyrob@gmail.com>, Robert Krzyzanowski <technoguyrob@gmail.com>
2015-03-07 00:16:50 +0000
Commit: 48a723c, github.com/apache/spark/pull/4840
[SPARK-6178][Shuffle] Removed unused imports
Vinod K C <vinod.kchuawei.com>, Vinod K C <vinod.kc@huawei.com>
2015-03-06 14:43:09 +0000
Commit: dba0b2e, github.com/apache/spark/pull/4900
[Minor] Resolve sbt warnings: postfix operator second should be enabled
GuoQiang Li <witgo@qq.com>
2015-03-06 13:20:20 +0000
Commit: 05cb6b3, github.com/apache/spark/pull/4908
[core] [minor] Don't pollute source directory when running UtilsSuite.
Marcelo Vanzin <vanzin@cloudera.com>
2015-03-06 09:43:24 +0000
Commit: cd7594c, github.com/apache/spark/pull/4921
[CORE, DEPLOY][minor] align arguments order with docs of worker
Zhang, Liye <liye.zhang@intel.com>
2015-03-06 09:34:07 +0000
Commit: d8b3da9, github.com/apache/spark/pull/4924
[SQL] Make Strategies a public developer API
Michael Armbrust <michael@databricks.com>
2015-03-05 14:50:25 -0800
Commit: eb48fd6, github.com/apache/spark/pull/4920
[SPARK-6163][SQL] jsonFile should be backed by the data source API
Yin Huai <yhuai@databricks.com>
2015-03-05 14:49:44 -0800
Commit: 1b4bb25, github.com/apache/spark/pull/4896
[SPARK-6145][SQL] fix ORDER BY on nested fields
Wenchen Fan <cloud0fan@outlook.com>, Michael Armbrust <michael@databricks.com>
2015-03-05 14:49:01 -0800
Commit: 5873c71, github.com/apache/spark/pull/4918
[SPARK-6175] Fix standalone executor log links when ephemeral ports or SPARK_PUBLIC_DNS are used
Josh Rosen <joshrosen@databricks.com>
2015-03-05 12:04:00 -0800
Commit: 424a86a, github.com/apache/spark/pull/4903
[SPARK-6090][MLLIB] add a basic BinaryClassificationMetrics to PySpark/MLlib
Xiangrui Meng <meng@databricks.com>
2015-03-05 11:50:09 -0800
Commit: 0bfacd5, github.com/apache/spark/pull/4863
SPARK-6182 [BUILD] spark-parent pom needs to be published for both 2.10 and 2.11
Sean Owen <sowen@cloudera.com>
2015-03-05 11:31:48 -0800
Commit: c9cfba0, github.com/apache/spark/pull/4912
[SPARK-6153] [SQL] promote guava dep for hive-thriftserver
Daoyuan Wang <daoyuan.wang@intel.com>
2015-03-05 16:35:17 +0800
Commit: e06c7df, github.com/apache/spark/pull/4884
SPARK-5143 [BUILD] [WIP] spark-network-yarn 2.11 depends on spark-network-shuffle 2.10
Sean Owen <sowen@cloudera.com>
2015-03-04 21:00:51 -0800
Commit: 7ac072f, github.com/apache/spark/pull/4876
[SPARK-6149] [SQL] [Build] Excludes Guava 15 referenced by jackson-module-scala_2.10
Cheng Lian <lian@databricks.com>
2015-03-04 20:52:58 -0800
Commit: 1aa90e3, github.com/apache/spark/pull/4890
[SPARK-6144] [core] Fix addFile when source files are on "hdfs:"
Marcelo Vanzin <vanzin@cloudera.com>, trystanleftwich <trystan@atscale.com>
2015-03-04 12:58:39 -0800
Commit: 3a35a0d, github.com/apache/spark/pull/4894
[SPARK-6107][CORE] Display inprogress application information for event log history for standalone mode
Zhang, Liye <liye.zhang@intel.com>
2015-03-04 12:28:27 +0000
Commit: f6773ed, github.com/apache/spark/pull/4848
[SPARK-6134][SQL] Fix wrong datatype for casting FloatType and default LongType value in defaultPrimitive
Liang-Chi Hsieh <viirya@gmail.com>
2015-03-04 20:23:43 +0800
Commit: aef8a84, github.com/apache/spark/pull/4870
[SPARK-6136] [SQL] Removed JDBC integration tests which depends on docker-client
Cheng Lian <lian@databricks.com>
2015-03-04 19:39:02 +0800
Commit: 76b472f, github.com/apache/spark/pull/4872
[SPARK-3355][Core]: Allow running maven tests in run-tests
Brennon York <brennon.york@capitalone.com>
2015-03-04 11:02:33 +0000
Commit: 418f38d, github.com/apache/spark/pull/4734
SPARK-6085 Increase default value for memory overhead
tedyu <yuzhihong@gmail.com>
2015-03-04 11:00:52 +0000
Commit: 8d3e241, github.com/apache/spark/pull/4836
[SPARK-6141][MLlib] Upgrade Breeze from 0.10 to 0.11 to fix convergence bug
Xiangrui Meng <meng@databricks.com>, DB Tsai <dbtsai@alpinenow.com>, DB Tsai <dbtsai@dbtsai.com>
2015-03-03 23:52:02 -0800
Commit: 76e20a0, github.com/apache/spark/pull/4879
[SPARK-6132][HOTFIX] ContextCleaner InterruptedException should be quiet
Andrew Or <andrew@databricks.com>
2015-03-03 20:49:45 -0800
Commit: d334bfb, github.com/apache/spark/pull/4882
[SPARK-5949] HighlyCompressedMapStatus needs more classes registered w/ kryo
Imran Rashid <irashid@cloudera.com>
2015-03-03 15:33:19 -0800
Commit: 1f1fccc, github.com/apache/spark/pull/4877
[SPARK-6133] Make sc.stop() idempotent
Andrew Or <andrew@databricks.com>
2015-03-03 15:09:57 -0800
Commit: 6c20f35, github.com/apache/spark/pull/4871
[SPARK-6132] ContextCleaner race condition across SparkContexts
Andrew Or <andrew@databricks.com>
2015-03-03 13:44:05 -0800
Commit: fe63e82, github.com/apache/spark/pull/4869
SPARK-1911 [DOCS] Warn users if their assembly jars are not built with Java 6
Sean Owen <sowen@cloudera.com>
2015-03-03 13:40:11 -0800
Commit: e750a6b, github.com/apache/spark/pull/4874
Revert "[SPARK-5423][Core] Cleanup resources in DiskMapIterator.finalize to ensure deleting the temp file"
Andrew Or <andrew@databricks.com>
2015-03-03 13:03:52 -0800
Commit: 9af0017
[SPARK-6138][CORE][minor] enhance the `toArray` method in `SizeTrackingVector`
Wenchen Fan <cloud0fan@outlook.com>
2015-03-03 12:12:23 +0000
Commit: e359794, github.com/apache/spark/pull/4825
[SPARK-6118] making package name of deploy.worker.CommandUtils and deploy.CommandUtilsSuite consistent
CodingCat <zhunansjtu@gmail.com>
2015-03-03 10:32:57 +0000
Commit: 975643c, github.com/apache/spark/pull/4856
BUILD: Minor tweaks to internal build scripts
Patrick Wendell <patrick@databricks.com>
2015-03-03 00:38:12 -0800
Commit: 0c9a8ea
HOTFIX: Bump HBase version in MapR profiles.
Patrick Wendell <patrick@databricks.com>
2015-03-03 01:38:07 -0800
Commit: 165ff36
[SPARK-5537][MLlib][Docs] Add user guide for multinomial logistic regression
DB Tsai <dbtsai@alpinenow.com>
2015-03-02 22:37:12 -0800
Commit: b196056, github.com/apache/spark/pull/4866
[SPARK-6120] [mllib] Warnings about memory in tree, ensemble model save
Joseph K. Bradley <joseph@databricks.com>
2015-03-02 22:33:51 -0800
Commit: c2fe3a6, github.com/apache/spark/pull/4864
[SPARK-6097][MLLIB] Support tree model save/load in PySpark/MLlib
Xiangrui Meng <meng@databricks.com>
2015-03-02 22:27:01 -0800
Commit: 7e53a79, github.com/apache/spark/pull/4854
[SPARK-5310][SQL] Fixes to Docs and Datasources API
Reynold Xin <rxin@databricks.com>, Michael Armbrust <michael@databricks.com>
2015-03-02 22:14:08 -0800
Commit: 54d1968, github.com/apache/spark/pull/4868
[SPARK-5950][SQL]Insert array into a metastore table saved as parquet should work when using datasource api
Yin Huai <yhuai@databricks.com>
2015-03-02 19:31:55 -0800
Commit: 1259994, github.com/apache/spark/pull/4826
[SPARK-6127][Streaming][Docs] Add Kafka to Python api docs
Tathagata Das <tathagata.das1565@gmail.com>
2015-03-02 18:40:46 -0800
Commit: 9eb22ec, github.com/apache/spark/pull/4860
[SPARK-5537] Add user guide for multinomial logistic regression
Xiangrui Meng <meng@databricks.com>, DB Tsai <dbtsai@alpinenow.com>
2015-03-02 18:10:50 -0800
Commit: 9d6c5ae, github.com/apache/spark/pull/4801
[SPARK-6121][SQL][MLLIB] simpleString for UDT
Xiangrui Meng <meng@databricks.com>
2015-03-02 17:14:34 -0800
Commit: 2db6a85, github.com/apache/spark/pull/4858
[SPARK-4777][CORE] Some block memory after unrollSafely not count into used memory(memoryStore.entrys or unrollMemory)
hushan[胡珊] <hushan@xiaomi.com>
2015-03-02 16:53:54 -0800
Commit: e3a88d1, github.com/apache/spark/pull/3629
[SPARK-6048] SparkConf should not translate deprecated configs on set
Andrew Or <andrew@databricks.com>
2015-03-02 16:36:42 -0800
Commit: 258d154, github.com/apache/spark/pull/4799
[SPARK-6066] Make event log format easier to parse
Andrew Or <andrew@databricks.com>
2015-03-02 16:34:32 -0800
Commit: 6776cb3, github.com/apache/spark/pull/4821
[SPARK-6082] [SQL] Provides better error message for malformed rows when caching tables
Cheng Lian <lian@databricks.com>
2015-03-02 16:18:00 -0800
Commit: 1a49496, github.com/apache/spark/pull/4842
[SPARK-6114][SQL] Avoid metastore conversions before plan is resolved
Michael Armbrust <michael@databricks.com>
2015-03-02 16:10:54 -0800
Commit: 8223ce6, github.com/apache/spark/pull/4855
[SPARK-5522] Accelerate the Histroty Server start
guliangliang <guliangliang@qiyi.com>
2015-03-02 15:33:23 -0800
Commit: 26c1c56, github.com/apache/spark/pull/4525
[SPARK-6050] [yarn] Relax matching of vcore count in received containers.
Marcelo Vanzin <vanzin@cloudera.com>
2015-03-02 16:41:43 -0600
Commit: 6b348d9, github.com/apache/spark/pull/4818
[SPARK-6040][SQL] Fix the percent bug in tablesample
q00251598 <qiyadong@huawei.com>
2015-03-02 13:16:29 -0800
Commit: 582e5a2, github.com/apache/spark/pull/4789
[Minor] Fix doc typo for describing primitiveTerm effectiveness condition
Liang-Chi Hsieh <viirya@gmail.com>
2015-03-02 13:11:17 -0800
Commit: 3f9def8, github.com/apache/spark/pull/4762
SPARK-5390 [DOCS] Encourage users to post on Stack Overflow in Community Docs
Sean Owen <sowen@cloudera.com>
2015-03-02 21:10:08 +0000
Commit: 0b472f6, github.com/apache/spark/pull/4843
[DOCS] Refactored Dataframe join comment to use correct parameter ordering
Paul Power <paul.power@peerside.com>
2015-03-02 13:08:47 -0800
Commit: d9a8bae, github.com/apache/spark/pull/4847
[SPARK-6080] [PySpark] correct LogisticRegressionWithLBFGS regType parameter for pyspark
Yanbo Liang <ybliang8@gmail.com>
2015-03-02 10:17:24 -0800
Commit: af2effd, github.com/apache/spark/pull/4831
aggregateMessages example in graphX doc
DEBORAH SIEGEL <deborahsiegel@DEBORAHs-MacBook-Pro.local>
2015-03-02 10:15:32 -0800
Commit: e7d8ae4, github.com/apache/spark/pull/4853
[SPARK-5741][SQL] Support the path contains comma in HiveContext
q00251598 <qiyadong@huawei.com>
2015-03-02 10:13:11 -0800
Commit: 9ce12aa, github.com/apache/spark/pull/4532
[SPARK-6111] Fixed usage string in documentation.
Kenneth Myers <myerske@us.ibm.com>
2015-03-02 17:25:24 +0000
Commit: 95ac68b, github.com/apache/spark/pull/4852
[SPARK-6052][SQL]In JSON schema inference, we should always set containsNull of an ArrayType to true
Yin Huai <yhuai@databricks.com>
2015-03-02 23:18:07 +0800
Commit: 3efd8bb, github.com/apache/spark/pull/4806
[SPARK-6073][SQL] Need to refresh metastore cache after append data in CreateMetastoreDataSourceAsSelect
Yin Huai <yhuai@databricks.com>
2015-03-02 22:42:18 +0800
Commit: 39a54b4, github.com/apache/spark/pull/4824
[SPARK-6103][Graphx]remove unused class to import in EdgeRDDImpl
Lianhui Wang <lianhuiwang09@gmail.com>
2015-03-02 09:06:56 +0000
Commit: 49c7a8f, github.com/apache/spark/pull/4846
SPARK-3357 [CORE] Internal log messages should be set at DEBUG level instead of INFO
Sean Owen <sowen@cloudera.com>
2015-03-02 08:51:03 +0000
Commit: 948c239, github.com/apache/spark/pull/4838
[Streaming][Minor]Fix some error docs in streaming examples
Saisai Shao <saisai.shao@intel.com>
2015-03-02 08:49:19 +0000
Commit: d8fb40e, github.com/apache/spark/pull/4837
[SPARK-6083] [MLLib] [DOC] Make Python API example consistent in NaiveBayes
MechCoder <manojkumarsivaraj334@gmail.com>
2015-03-01 16:28:15 -0800
Commit: 3f00bb3, github.com/apache/spark/pull/4834
[SPARK-6053][MLLIB] support save/load in PySpark's ALS
Xiangrui Meng <meng@databricks.com>
2015-03-01 16:26:57 -0800
Commit: aedbbaa, github.com/apache/spark/pull/4811
[SPARK-6074] [sql] Package pyspark sql bindings.
Marcelo Vanzin <vanzin@cloudera.com>
2015-03-01 11:05:10 +0000
Commit: fd8d283, github.com/apache/spark/pull/4822
[SPARK-6075] Fix bug in that caused lost accumulator updates: do not store WeakReferences in localAccums map
Josh Rosen <joshrosen@databricks.com>
2015-02-28 22:51:01 -0800
Commit: 2df5f1f, github.com/apache/spark/pull/4835
SPARK-5984: Fix TimSort bug causes ArrayOutOfBoundsException
Evan Yu <ehotou@gmail.com>
2015-02-28 18:55:34 -0800
Commit: 643300a, github.com/apache/spark/pull/4804
SPARK-1965 [WEBUI] Spark UI throws NPE on trying to load the app page for non-existent app
Sean Owen <sowen@cloudera.com>
2015-02-28 15:34:08 +0000
Commit: 86fcdae, github.com/apache/spark/pull/4777
SPARK-5983 [WEBUI] Don't respond to HTTP TRACE in HTTP-based UIs
Sean Owen <sowen@cloudera.com>
2015-02-28 15:23:59 +0000
Commit: f91298e, github.com/apache/spark/pull/4765
SPARK-6063 MLlib doesn't pass mvn scalastyle check due to UTF chars in LDAModel.scala
Michael Griffiths <msjgriffiths@gmail.com>, Griffiths, Michael (NYC-RPM) <michael.griffiths@reprisemedia.com>
2015-02-28 14:47:39 +0000
Commit: b36b1bc, github.com/apache/spark/pull/4815
[SPARK-5775] [SQL] BugFix: GenericRow cannot be cast to SpecificMutableRow when nested data and partitioned table
Cheng Lian <lian@databricks.com>, Cheng Lian <liancheng@users.noreply.github.com>, Yin Huai <yhuai@databricks.com>
2015-02-28 21:15:43 +0800
Commit: e6003f0, github.com/apache/spark/pull/4792
MAINTENANCE: Automated closing of pull requests.
Patrick Wendell <patrick@databricks.com>
2015-02-27 23:10:09 -0800
Commit: 9168259, github.com/apache/spark/pull/1128
[SPARK-5979][SPARK-6032] Smaller safer --packages fix
Burak Yavuz <brkyvz@gmail.com>
2015-02-27 22:59:35 -0800
Commit: 6d8e5fb, github.com/apache/spark/pull/4802
[SPARK-6070] [yarn] Remove unneeded classes from shuffle service jar.
Marcelo Vanzin <vanzin@cloudera.com>
2015-02-27 22:44:11 -0800
Commit: dba08d1, github.com/apache/spark/pull/4820
[SPARK-6055] [PySpark] fix incorrect __eq__ of DataType
Davies Liu <davies@databricks.com>
2015-02-27 20:07:17 -0800
Commit: e0e64ba, github.com/apache/spark/pull/4808
[SPARK-5751] [SQL] Sets SPARK_HOME as SPARK_PID_DIR when running Thrift server test suites
Cheng Lian <lian@databricks.com>
2015-02-28 08:41:49 +0800
Commit: 8c468a6, github.com/apache/spark/pull/4758
[Streaming][Minor] Remove useless type signature of Java Kafka direct stream API
Saisai Shao <saisai.shao@intel.com>
2015-02-27 13:01:42 -0800
Commit: 5f7f3b9, github.com/apache/spark/pull/4817
[SPARK-4587] [mllib] [docs] Fixed save,load calls in ML guide examples
Joseph K. Bradley <joseph@databricks.com>
2015-02-27 13:00:36 -0800
Commit: d17cb2b, github.com/apache/spark/pull/4816
[SPARK-6059][Yarn] Add volatile to ApplicationMaster's reporterThread and allocator
zsxwing <zsxwing@gmail.com>
2015-02-27 13:33:39 +0000
Commit: 57566d0, github.com/apache/spark/pull/4814
[SPARK-6058][Yarn] Log the user class exception in ApplicationMaster
zsxwing <zsxwing@gmail.com>
2015-02-27 13:31:46 +0000
Commit: e747e98, github.com/apache/spark/pull/4813
[SPARK-6036][CORE] avoid race condition between eventlogListener and akka actor system
Zhang, Liye <liye.zhang@intel.com>
2015-02-26 23:11:43 -0800
Commit: 8cd1692, github.com/apache/spark/pull/4785
fix spark-6033, clarify the spark.worker.cleanup behavior in standalone mode
许鹏 <peng.xu@fraudmetrix.cn>
2015-02-26 23:05:56 -0800
Commit: 0375a41, github.com/apache/spark/pull/4803
[SPARK-6046] Privatize SparkConf.translateConfKey
Andrew Or <andrew@databricks.com>
2015-02-26 22:39:46 -0800
Commit: 7c99a01, github.com/apache/spark/pull/4797
SPARK-2168 [Spark core] Use relative URIs for the app links in the History Server.
Lukasz Jastrzebski <lukasz.jastrzebski@gmail.com>
2015-02-26 22:38:06 -0800
Commit: 4a8a0a8, github.com/apache/spark/pull/4778
[SPARK-5495][UI] Add app and driver kill function in master web UI
jerryshao <saisai.shao@intel.com>
2015-02-26 22:36:48 -0800
Commit: 67595eb, github.com/apache/spark/pull/4288
[SPARK-5771][UI][hotfix] Change Requested Cores into * if default cores is not set
jerryshao <saisai.shao@intel.com>
2015-02-26 22:35:43 -0800
Commit: 12135e9, github.com/apache/spark/pull/4800
[SPARK-6024][SQL] When a data source table has too many columns, it's schema cannot be stored in metastore.
Yin Huai <yhuai@databricks.com>
2015-02-26 20:46:05 -0800
Commit: 5e5ad65, github.com/apache/spark/pull/4795
[SPARK-6037][SQL] Avoiding duplicate Parquet schema merging
Liang-Chi Hsieh <viirya@gmail.com>
2015-02-27 11:06:47 +0800
Commit: 4ad5153, github.com/apache/spark/pull/4786
[SPARK-5529][CORE]Add expireDeadHosts in HeartbeatReceiver
Hong Shen <hongshen@tencent.com>
2015-02-26 18:43:23 -0800
Commit: 18f2098, github.com/apache/spark/pull/4363
SPARK-4579 [WEBUI] Scheduling Delay appears negative
Sean Owen <sowen@cloudera.com>
2015-02-26 17:35:09 -0800
Commit: fbc4694, github.com/apache/spark/pull/4796
SPARK-6045 RecordWriter should be checked against null in PairRDDFunctio...
tedyu <yuzhihong@gmail.com>
2015-02-26 23:26:07 +0000
Commit: e60ad2f, github.com/apache/spark/pull/4794
[SPARK-5951][YARN] Remove unreachable driver memory properties in yarn client mode
mohit.goyal <mohit.goyal@guavus.com>
2015-02-26 14:27:47 -0800
Commit: b38dec2, github.com/apache/spark/pull/4730
Add a note for context termination for History server on Yarn
moussa taifi <moutai10@gmail.com>
2015-02-26 14:19:43 -0800
Commit: c871e2d, github.com/apache/spark/pull/4721
SPARK-4300 [CORE] Race condition during SparkWorker shutdown
Sean Owen <sowen@cloudera.com>
2015-02-26 14:08:56 -0800
Commit: 3fb53c0, github.com/apache/spark/pull/4787
[SPARK-6018] [YARN] NoSuchMethodError in Spark app is swallowed by YARN AM
Cheolsoo Park <cheolsoop@netflix.com>
2015-02-26 13:53:49 -0800
Commit: 5f3238b, github.com/apache/spark/pull/4773
[SPARK-6027][SPARK-5546] Fixed --jar and --packages not working for KafkaUtils and improved error message
Tathagata Das <tathagata.das1565@gmail.com>
2015-02-26 13:46:07 -0800
Commit: aa63f63, github.com/apache/spark/pull/4779
[SPARK-3562]Periodic cleanup event logs
xukun 00228947 <xukun.xu@huawei.com>
2015-02-26 13:24:00 -0800
Commit: 8942b52, github.com/apache/spark/pull/4214
Modify default value description for spark.scheduler.minRegisteredResourcesRatio on docs.
Li Zhihui <zhihui.li@intel.com>
2015-02-26 13:07:07 -0800
Commit: 10094a5, github.com/apache/spark/pull/4781
SPARK-4704 [CORE] SparkSubmitDriverBootstrap doesn't flush output
Sean Owen <sowen@cloudera.com>
2015-02-26 12:56:54 -0800
Commit: cd5c8d7, github.com/apache/spark/pull/4788
[SPARK-5363] Fix bug in PythonRDD: remove() inside iterator is not safe
Davies Liu <davies@databricks.com>
2015-02-26 11:54:17 -0800
Commit: 7fa960e, github.com/apache/spark/pull/4776
[SPARK-6004][MLlib] Pick the best model when training GradientBoostedTrees with validation
Liang-Chi Hsieh <viirya@gmail.com>
2015-02-26 10:51:47 -0800
Commit: cfff397, github.com/apache/spark/pull/4763
[SPARK-6007][SQL] Add numRows param in DataFrame.show()
Jacky Li <jacky.likun@huawei.com>
2015-02-26 10:40:58 -0800
Commit: 2358657, github.com/apache/spark/pull/4767
[SPARK-5801] [core] Avoid creating nested directories.
Marcelo Vanzin <vanzin@cloudera.com>
2015-02-26 17:35:03 +0000
Commit: df3d559, github.com/apache/spark/pull/4747
[SPARK-6016][SQL] Cannot read the parquet table after overwriting the existing table when spark.sql.parquet.cacheMetadata=true
Yin Huai <yhuai@databricks.com>
2015-02-27 01:01:32 +0800
Commit: 192e42a, github.com/apache/spark/pull/4775
[SPARK-6023][SQL] ParquetConversions fails to replace the destination MetastoreRelation of an InsertIntoTable node to ParquetRelation2
Yin Huai <yhuai@databricks.com>
2015-02-26 22:39:49 +0800
Commit: f02394d, github.com/apache/spark/pull/4782
[SPARK-5914] to run spark-submit requiring only user perm on windows
Judy Nash <judynash@microsoft.com>
2015-02-26 11:14:37 +0000
Commit: 51a6f90, github.com/apache/spark/pull/4742
[SPARK-5976][MLLIB] Add partitioner to factors returned by ALS
Xiangrui Meng <meng@databricks.com>
2015-02-25 23:43:29 -0800
Commit: e43139f, github.com/apache/spark/pull/4748
[SPARK-5974] [SPARK-5980] [mllib] [python] [docs] Update ML guide with save/load, Python GBT
Joseph K. Bradley <joseph@databricks.com>
2015-02-25 16:13:17 -0800
Commit: d20559b, github.com/apache/spark/pull/4750
[SPARK-1182][Docs] Sort the configuration parameters in configuration.md
Brennon York <brennon.york@capitalone.com>
2015-02-25 16:12:56 -0800
Commit: 46a044a, github.com/apache/spark/pull/3863
[SPARK-5926] [SQL] make DataFrame.explain leverage queryExecution.logical
Yanbo Liang <ybliang8@gmail.com>
2015-02-25 15:37:13 -0800
Commit: 41e2e5a, github.com/apache/spark/pull/4707
[SPARK-5999][SQL] Remove duplicate Literal matching block
Liang-Chi Hsieh <viirya@gmail.com>
2015-02-25 15:22:33 -0800
Commit: 12dbf98, github.com/apache/spark/pull/4760
[SPARK-6010] [SQL] Merging compatible Parquet schemas before computing splits
Cheng Lian <lian@databricks.com>
2015-02-25 15:15:22 -0800
Commit: e0fdd46, github.com/apache/spark/pull/4768
[SPARK-5944] [PySpark] fix version in Python API docs
Davies Liu <davies@databricks.com>
2015-02-25 15:13:34 -0800
Commit: f3f4c87, github.com/apache/spark/pull/4731
[SPARK-5982] Remove incorrect Local Read Time Metric
Kay Ousterhout <kayousterhout@gmail.com>
2015-02-25 14:55:24 -0800
Commit: 838a480, github.com/apache/spark/pull/4749
[SPARK-1955][GraphX]: VertexRDD can incorrectly assume index sharing
Brennon York <brennon.york@capitalone.com>
2015-02-25 14:11:12 -0800
Commit: 9f603fc, github.com/apache/spark/pull/4705
[SPARK-5970][core] Register directory created in getOrCreateLocalRootDirs for automatic deletion.
Milan Straka <fox@ucw.cz>
2015-02-25 21:33:34 +0000
Commit: a777c65, github.com/apache/spark/pull/4759
SPARK-5930 [DOCS] Documented default of spark.shuffle.io.retryWait is confusing
Sean Owen <sowen@cloudera.com>
2015-02-25 12:20:44 -0800
Commit: 7d8e6a2, github.com/apache/spark/pull/4769
[SPARK-5996][SQL] Fix specialized outbound conversions
Michael Armbrust <michael@databricks.com>
2015-02-25 10:13:40 -0800
Commit: f84c799, github.com/apache/spark/pull/4757
[SPARK-5771] Number of Cores in Completed Applications of Standalone Master Web Page always be 0 if sc.stop() is called
guliangliang <guliangliang@qiyi.com>
2015-02-25 14:48:02 +0000
Commit: dd077ab, github.com/apache/spark/pull/4567
[GraphX] fixing 3 typos in the graphx programming guide
Benedikt Linse <benedikt.linse@gmail.com>
2015-02-25 14:46:17 +0000
Commit: 5b8480e, github.com/apache/spark/pull/4766
[SPARK-5666][streaming][MQTT streaming] some trivial fixes
prabs <prabsmails@gmail.com>, Prabeesh K <prabsmails@gmail.com>
2015-02-25 14:37:35 +0000
Commit: d51ed26, github.com/apache/spark/pull/4178
[SPARK-5994] [SQL] Python DataFrame documentation fixes
Davies Liu <davies@databricks.com>
2015-02-24 20:51:55 -0800
Commit: d641fbb, github.com/apache/spark/pull/4756
[SPARK-5286][SQL] SPARK-5286 followup
Yin Huai <yhuai@databricks.com>
2015-02-24 19:51:36 -0800
Commit: 769e092, github.com/apache/spark/pull/4755
[SPARK-5993][Streaming][Build] Fix assembly jar location of kafka-assembly
Tathagata Das <tathagata.das1565@gmail.com>
2015-02-24 19:10:37 -0800
Commit: 922b43b, github.com/apache/spark/pull/4753
[SPARK-5985][SQL] DataFrame sortBy -> orderBy in Python.
Reynold Xin <rxin@databricks.com>
2015-02-24 18:59:23 -0800
Commit: fba11c2, github.com/apache/spark/pull/4752
[SPARK-5904][SQL] DataFrame Java API test suites.
Reynold Xin <rxin@databricks.com>
2015-02-24 18:51:41 -0800
Commit: 53a1ebf, github.com/apache/spark/pull/4751
[SPARK-5751] [SQL] [WIP] Revamped HiveThriftServer2Suite for robustness
Cheng Lian <lian@databricks.com>
2015-02-25 08:34:55 +0800
Commit: f816e73, github.com/apache/spark/pull/4720
[SPARK-5436] [MLlib] Validate GradientBoostedTrees using runWithValidation
MechCoder <manojkumarsivaraj334@gmail.com>
2015-02-24 15:13:22 -0800
Commit: 2a0fe34, github.com/apache/spark/pull/4677
[SPARK-5973] [PySpark] fix zip with two RDDs with AutoBatchedSerializer
Davies Liu <davies@databricks.com>
2015-02-24 14:50:00 -0800
Commit: da505e5, github.com/apache/spark/pull/4745
[SPARK-5952][SQL] Lock when using hive metastore client
Michael Armbrust <michael@databricks.com>
2015-02-24 13:39:29 -0800
Commit: a2b9137, github.com/apache/spark/pull/4746
[Spark-5708] Add Slf4jSink to Spark Metrics
Judy <judynash@microsoft.com>, judynash <judynash@microsoft.com>
2015-02-24 20:50:16 +0000
Commit: c5ba975, github.com/apache/spark/pull/4644
[MLLIB] Change x_i to y_i in Variance's user guide
Xiangrui Meng <meng@databricks.com>
2015-02-24 11:38:59 -0800
Commit: 105791e, github.com/apache/spark/pull/4740
[SPARK-5965] Standalone Worker UI displays {{USER_JAR}}
Andrew Or <andrew@databricks.com>
2015-02-24 11:08:07 -0800
Commit: 6d2caa5, github.com/apache/spark/pull/4739
[Spark-5967] [UI] Correctly clean JobProgressListener.stageIdToActiveJobIds
Tathagata Das <tathagata.das1565@gmail.com>
2015-02-24 11:02:47 -0800
Commit: 64d2c01, github.com/apache/spark/pull/4741
[SPARK-5532][SQL] Repartition should not use external rdd representation
Michael Armbrust <michael@databricks.com>
2015-02-24 10:52:18 -0800
Commit: 2012366, github.com/apache/spark/pull/4738
[SPARK-5910][SQL] Support for as in selectExpr
Michael Armbrust <michael@databricks.com>
2015-02-24 10:49:51 -0800
Commit: 0a59e45, github.com/apache/spark/pull/4736
[SPARK-5968] [SQL] Suppresses ParquetOutputCommitter WARN logs
Cheng Lian <lian@databricks.com>
2015-02-24 10:45:38 -0800
Commit: 8403331, github.com/apache/spark/pull/4744
[SPARK-5958][MLLIB][DOC] update block matrix user guide
Xiangrui Meng <meng@databricks.com>
2015-02-23 22:08:44 -0800
Commit: cf2e416, github.com/apache/spark/pull/4737
[SPARK-5873][SQL] Allow viewing of partially analyzed plans in queryExecution
Michael Armbrust <michael@databricks.com>
2015-02-23 17:34:54 -0800
Commit: 1ed5708, github.com/apache/spark/pull/4684
[SPARK-5935][SQL] Accept MapType in the schema provided to a JSON dataset.
Yin Huai <yhuai@databricks.com>, Yin Huai <huai@cse.ohio-state.edu>
2015-02-23 17:16:34 -0800
Commit: 48376bf, github.com/apache/spark/pull/4710
[SPARK-5912] [docs] [mllib] Small fixes to ChiSqSelector docs
Joseph K. Bradley <joseph@databricks.com>
2015-02-23 16:15:57 -0800
Commit: 59536cc, github.com/apache/spark/pull/4732
[MLLIB] SPARK-5912 Programming guide for feature selection
Alexander Ulanov <nashb@yandex.ru>
2015-02-23 12:09:40 -0800
Commit: 28ccf5e, github.com/apache/spark/pull/4709
[SPARK-5939][MLLib] make FPGrowth example app take parameters
Jacky Li <jacky.likun@huawei.com>
2015-02-23 08:47:28 -0800
Commit: 651a1c0, github.com/apache/spark/pull/4714
[SPARK-5724] fix the misconfiguration in AkkaUtils
CodingCat <zhunansjtu@gmail.com>
2015-02-23 11:29:25 +0000
Commit: 242d495, github.com/apache/spark/pull/4512
[SPARK-5943][Streaming] Update the test to use new API to reduce the warning
Saisai Shao <saisai.shao@intel.com>
2015-02-23 11:27:27 +0000
Commit: 757b14b, github.com/apache/spark/pull/4722
[EXAMPLES] fix typo.
Makoto Fukuhara <fukuo33@gmail.com>
2015-02-23 09:24:33 +0000
Commit: 9348767, github.com/apache/spark/pull/4724
[SPARK-3885] Provide mechanism to remove accumulators once they are no longer used
Ilya Ganelin <ilya.ganelin@capitalone.com>
2015-02-22 22:43:04 -0800
Commit: 95cd643, github.com/apache/spark/pull/4021
[SPARK-911] allow efficient queries for a range if RDD is partitioned wi...
Aaron Josephs <ajoseph4@binghamton.edu>
2015-02-22 22:09:06 -0800
Commit: e4f9d03, github.com/apache/spark/pull/1381
[DataFrame] [Typo] Fix the typo
Cheng Hao <hao.cheng@intel.com>
2015-02-22 08:56:30 +0000
Commit: 275b1be, github.com/apache/spark/pull/4717
[DOCS] Fix typo in API for custom InputFormats based on the “new” MapReduce API
Alexander <abezzubov@nflabs.com>
2015-02-22 08:53:05 +0000
Commit: a7f9039, github.com/apache/spark/pull/4718
MAINTENANCE: Automated closing of pull requests.
Patrick Wendell <patrick@databricks.com>
2015-02-21 23:07:30 -0800
Commit: 46462ff, github.com/apache/spark/pull/3490
[SPARK-5860][CORE] JdbcRDD: overflow on large range with high number of partitions
Evan Yu <ehotou@gmail.com>
2015-02-21 20:40:21 +0000
Commit: 7683982, github.com/apache/spark/pull/4701
[SPARK-5937][YARN] Fix ClientSuite to set YARN mode, so that the correct class is used in t...
Hari Shreedharan <hshreedharan@apache.org>
2015-02-21 10:01:01 -0800
Commit: 7138816, github.com/apache/spark/pull/4711
SPARK-5841 [CORE] [HOTFIX 2] Memory leak in DiskBlockManager
Nishkam Ravi <nravi@cloudera.com>, nishkamravi2 <nishkamravi@gmail.com>, nravi <nravi@c1704.halxg.cloudera.com>
2015-02-21 09:59:28 -0800
Commit: d3cbd38, github.com/apache/spark/pull/4690
[MLlib] fix typo
Jacky Li <jackylk@users.noreply.github.com>
2015-02-21 13:00:16 +0000
Commit: e155324, github.com/apache/spark/pull/4713
[SPARK-5898] [SPARK-5896] [SQL] [PySpark] create DataFrame from pandas and tuple/list
Davies Liu <davies@databricks.com>
2015-02-20 15:35:05 -0800
Commit: 5b0a42c, github.com/apache/spark/pull/4679
[SPARK-5867] [SPARK-5892] [doc] [ml] [mllib] Doc cleanups for 1.3 release
Joseph K. Bradley <joseph@databricks.com>
2015-02-20 02:31:32 -0800
Commit: 4a17eed, github.com/apache/spark/pull/4675
SPARK-5744 [CORE] Take 2. RDD.isEmpty / take fails for (empty) RDD of Nothing
Sean Owen <sowen@cloudera.com>
2015-02-20 10:21:39 +0000
Commit: d3dfebe, github.com/apache/spark/pull/4698
[SPARK-5909][SQL] Add a clearCache command to Spark SQL's cache manager
Yin Huai <yhuai@databricks.com>
2015-02-20 16:20:02 +0800
Commit: 70bfb5c, github.com/apache/spark/pull/4694
[SPARK-4808] Removing minimum number of elements read before spill check
mcheah <mcheah@palantir.com>
2015-02-19 18:09:22 -0800
Commit: 3be92cd, github.com/apache/spark/pull/4420
[SPARK-5900][MLLIB] make PIC and FPGrowth Java-friendly
Xiangrui Meng <meng@databricks.com>
2015-02-19 18:06:16 -0800
Commit: 0cfd2ce, github.com/apache/spark/pull/4695
SPARK-5570: No docs stating that `new SparkConf().set("spark.driver.memory", ...) will not work
Ilya Ganelin <ilya.ganelin@capitalone.com>
2015-02-19 15:50:58 -0800
Commit: 6bddc40, github.com/apache/spark/pull/4665
SPARK-4682 [CORE] Consolidate various 'Clock' classes
Sean Owen <sowen@cloudera.com>
2015-02-19 15:35:23 -0800
Commit: 34b7c35, github.com/apache/spark/pull/4514
[Spark-5889] Remove pid file after stopping service.
Zhan Zhang <zhazhan@gmail.com>
2015-02-19 23:13:02 +0000
Commit: ad6b169, github.com/apache/spark/pull/4676
[SPARK-5902] [ml] Made PipelineStage.transformSchema public instead of private to ml
Joseph K. Bradley <joseph@databricks.com>
2015-02-19 12:46:27 -0800
Commit: a5fed34, github.com/apache/spark/pull/4682
[SPARK-5904][SQL] DataFrame API fixes.
Reynold Xin <rxin@databricks.com>
2015-02-19 12:09:44 -0800
Commit: 8ca3418, github.com/apache/spark/pull/4686
[SPARK-5825] [Spark Submit] Remove the double checking instance name when stopping the service
Cheng Hao <hao.cheng@intel.com>
2015-02-19 12:07:51 -0800
Commit: 94cdb05, github.com/apache/spark/pull/4611
[SPARK-5423][Core] Cleanup resources in DiskMapIterator.finalize to ensure deleting the temp file
zsxwing <zsxwing@gmail.com>
2015-02-19 18:37:31 +0000
Commit: 90095bf, github.com/apache/spark/pull/4219
[SPARK-5816] Add huge compatibility warning in DriverWrapper
Andrew Or <andrew@databricks.com>
2015-02-19 09:56:25 -0800
Commit: 38e624a, github.com/apache/spark/pull/4687
SPARK-5548: Fix for AkkaUtilsSuite failure - attempt 2
Jacek Lewandowski <lewandowski.jacek@gmail.com>
2015-02-19 09:53:36 -0800
Commit: fb87f44, github.com/apache/spark/pull/4653
[SPARK-5846] Correctly set job description and pool for SQL jobs
Kay Ousterhout <kayousterhout@gmail.com>
2015-02-19 09:49:34 +0800
Commit: e945aa6, github.com/apache/spark/pull/4630
[SPARK-5879][MLLIB] update PIC user guide and add a Java example
Xiangrui Meng <meng@databricks.com>
2015-02-18 16:29:32 -0800
Commit: d12d2ad, github.com/apache/spark/pull/4680
[SPARK-5722] [SQL] [PySpark] infer int as LongType
Davies Liu <davies@databricks.com>
2015-02-18 14:17:04 -0800
Commit: aa8f10e, github.com/apache/spark/pull/4666
[SPARK-5840][SQL] HiveContext cannot be serialized due to tuple extraction
Reynold Xin <rxin@databricks.com>
2015-02-18 14:02:32 -0800
Commit: f0e3b71, github.com/apache/spark/pull/4628
[SPARK-5507] Added documentation for BlockMatrix
Burak Yavuz <brkyvz@gmail.com>
2015-02-18 10:11:08 -0800
Commit: a8eb92d, github.com/apache/spark/pull/4664
[SPARK-5519][MLLIB] add user guide with example code for fp-growth
Xiangrui Meng <meng@databricks.com>
2015-02-18 10:09:56 -0800
Commit: 85e9d09, github.com/apache/spark/pull/4661
SPARK-5669 [BUILD] [HOTFIX] Spark assembly includes incompatibly licensed libgfortran, libgcc code via JBLAS
Sean Owen <sowen@cloudera.com>
2015-02-18 14:41:44 +0000
Commit: 5aecdcf, github.com/apache/spark/pull/4673
[SPARK-4949]shutdownCallback in SparkDeploySchedulerBackend should be enclosed by synchronized block.
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2015-02-18 12:20:11 +0000
Commit: 82197ed, github.com/apache/spark/pull/3781
SPARK-4610 addendum: [Minor] [MLlib] Minor doc fix in GBT classification example
MechCoder <manojkumarsivaraj334@gmail.com>
2015-02-18 10:13:28 +0000
Commit: e79a7a6, github.com/apache/spark/pull/4672
[SPARK-5878] fix DataFrame.repartition() in Python
Davies Liu <davies@databricks.com>
2015-02-18 01:00:54 -0800
Commit: c1b6fa9, github.com/apache/spark/pull/4667
Avoid deprecation warnings in JDBCSuite.
Tor Myklebust <tmyklebu@gmail.com>
2015-02-18 01:00:13 -0800
Commit: de0dd6d, github.com/apache/spark/pull/4668
[Minor] [SQL] Cleans up DataFrame variable names and toDF() calls
Cheng Lian <lian@databricks.com>
2015-02-17 23:36:20 -0800
Commit: 61ab085, github.com/apache/spark/pull/4670
[SPARK-5731][Streaming][Test] Fix incorrect test in DirectKafkaStreamSuite
Tathagata Das <tathagata.das1565@gmail.com>
2015-02-17 22:44:16 -0800
Commit: 3912d33, github.com/apache/spark/pull/4597
[SPARK-5723][SQL]Change the default file format to Parquet for CTAS statements.
Yin Huai <yhuai@databricks.com>
2015-02-17 18:14:33 -0800
Commit: e50934f, github.com/apache/spark/pull/4639
[SPARK-5875][SQL]logical.Project should not be resolved if it contains aggregates or generators
Yin Huai <yhuai@databricks.com>
2015-02-17 17:50:39 -0800
Commit: d5f12bf, github.com/apache/spark/pull/4663
[SPARK-4454] Revert getOrElse() cleanup in DAGScheduler.getCacheLocs()
Josh Rosen <joshrosen@databricks.com>
2015-02-17 17:45:16 -0800
Commit: a51fc7e
[SPARK-4454] Properly synchronize accesses to DAGScheduler cacheLocs map
Josh Rosen <joshrosen@databricks.com>
2015-02-17 17:39:58 -0800
Commit: d46d624, github.com/apache/spark/pull/4660
[SPARK-5811] Added documentation for maven coordinates and added Spark Packages support
Burak Yavuz <brkyvz@gmail.com>, Davies Liu <davies@databricks.com>
2015-02-17 17:15:43 -0800
Commit: ae6cfb3, github.com/apache/spark/pull/4662
[SPARK-5785] [PySpark] narrow dependency for cogroup/join in PySpark
Davies Liu <davies@databricks.com>
2015-02-17 16:54:57 -0800
Commit: c3d2b90, github.com/apache/spark/pull/4629
[SPARK-5852][SQL]Fail to convert a newly created empty metastore parquet table to a data source parquet table.
Yin Huai <yhuai@databricks.com>, Cheng Hao <hao.cheng@intel.com>
2015-02-17 15:47:59 -0800
Commit: 117121a, github.com/apache/spark/pull/4655
[SPARK-5872] [SQL] create a sqlCtx in pyspark shell
Davies Liu <davies@databricks.com>
2015-02-17 15:44:37 -0800
Commit: 4d4cc76, github.com/apache/spark/pull/4659
[SPARK-5871] output explain in Python
Davies Liu <davies@databricks.com>
2015-02-17 13:48:38 -0800
Commit: 3df85dc, github.com/apache/spark/pull/4658
[SPARK-4172] [PySpark] Progress API in Python
Davies Liu <davies@databricks.com>
2015-02-17 13:36:43 -0800
Commit: 445a755, github.com/apache/spark/pull/3027
[SPARK-5868][SQL] Fix python UDFs in HiveContext and checks in SQLContext
Michael Armbrust <michael@databricks.com>
2015-02-17 13:23:45 -0800
Commit: de4836f, github.com/apache/spark/pull/4657
[SQL] [Minor] Update the HiveContext Unittest
Cheng Hao <hao.cheng@intel.com>
2015-02-17 12:25:35 -0800
Commit: 9d281fa, github.com/apache/spark/pull/4584
[Minor][SQL] Use same function to check path parameter in JSONRelation
Liang-Chi Hsieh <viirya@gmail.com>
2015-02-17 12:24:13 -0800
Commit: ac506b7, github.com/apache/spark/pull/4649
[SPARK-5862][SQL] Only transformUp the given plan once in HiveMetastoreCatalog
Liang-Chi Hsieh <viirya@gmail.com>
2015-02-17 12:23:18 -0800
Commit: 4611de1, github.com/apache/spark/pull/4651
[Minor] fix typo in SQL document
CodingCat <zhunansjtu@gmail.com>
2015-02-17 12:16:52 -0800
Commit: 31efb39, github.com/apache/spark/pull/4656
[SPARK-5864] [PySpark] support .jar as python package
Davies Liu <davies@databricks.com>
2015-02-17 12:05:06 -0800
Commit: fc4eb95, github.com/apache/spark/pull/4652
SPARK-5841 [CORE] [HOTFIX] Memory leak in DiskBlockManager
Sean Owen <sowen@cloudera.com>
2015-02-17 19:40:06 +0000
Commit: 49c19fd, github.com/apache/spark/pull/4648
MAINTENANCE: Automated closing of pull requests.
Patrick Wendell <patrick@databricks.com>
2015-02-17 11:35:26 -0800
Commit: 24f358b, github.com/apache/spark/pull/3297
[SPARK-3381] [MLlib] Eliminate bins for unordered features in DecisionTrees
MechCoder <manojkumarsivaraj334@gmail.com>
2015-02-17 11:19:23 -0800
Commit: 9b746f3, github.com/apache/spark/pull/4231
[SPARK-5661]function hasShutdownDeleteTachyonDir should use shutdownDeleteTachyonPaths to determine whether contains file
xukun 00228947 <xukun.xu@huawei.com>, viper-kun <xukun.xu@huawei.com>
2015-02-17 18:59:41 +0000
Commit: b271c26, github.com/apache/spark/pull/4418
[SPARK-5778] throw if nonexistent metrics config file provided
Ryan Williams <ryan.blake.williams@gmail.com>
2015-02-17 10:57:16 -0800
Commit: d8f69cf, github.com/apache/spark/pull/4571
[SPARK-5859] [PySpark] [SQL] fix DataFrame Python API
Davies Liu <davies@databricks.com>
2015-02-17 10:22:48 -0800
Commit: d8adefe, github.com/apache/spark/pull/4645
[SPARK-5166][SPARK-5247][SPARK-5258][SQL] API Cleanup / Documentation
Michael Armbrust <michael@databricks.com>
2015-02-17 10:21:17 -0800
Commit: c74b07f, github.com/apache/spark/pull/4642
[SPARK-5858][MLLIB] Remove unnecessary first() call in GLM
Xiangrui Meng <meng@databricks.com>
2015-02-17 10:17:45 -0800
Commit: c76da36, github.com/apache/spark/pull/4647
SPARK-5856: In Maven build script, launch Zinc with more memory
Patrick Wendell <patrick@databricks.com>
2015-02-17 10:10:01 -0800
Commit: 3ce46e9, github.com/apache/spark/pull/4643
Revert "[SPARK-5363] [PySpark] check ending mark in non-block way"
Josh Rosen <joshrosen@databricks.com>
2015-02-17 07:48:27 -0800
Commit: ee6e3ef
[SPARK-5826][Streaming] Fix Configuration not serializable problem
jerryshao <saisai.shao@intel.com>
2015-02-17 10:45:18 +0000
Commit: a65766b, github.com/apache/spark/pull/4612
HOTFIX: Style issue causing build break
Patrick Wendell <patrick@databricks.com>
2015-02-16 22:10:39 -0800
Commit: c06e42f
[SPARK-5802][MLLIB] cache transformed data in glm
Xiangrui Meng <meng@databricks.com>
2015-02-16 22:09:04 -0800
Commit: fd84229, github.com/apache/spark/pull/4593
[SPARK-5853][SQL] Schema support in Row.
Reynold Xin <rxin@databricks.com>
2015-02-16 20:42:57 -0800
Commit: d380f32, github.com/apache/spark/pull/4640
SPARK-5850: Remove experimental label for Scala 2.11 and FlumePollingStream
Patrick Wendell <patrick@databricks.com>
2015-02-16 20:33:33 -0800
Commit: a51d51f, github.com/apache/spark/pull/4638
[SPARK-5363] [PySpark] check ending mark in non-block way
Davies Liu <davies@databricks.com>
2015-02-16 20:32:03 -0800
Commit: ac6fe67, github.com/apache/spark/pull/4601
[SQL] Various DataFrame doc changes.
Reynold Xin <rxin@databricks.com>
2015-02-16 19:00:30 -0800
Commit: 0e180bf, github.com/apache/spark/pull/4636
[SPARK-5849] Handle more types of invalid JSON requests in SubmitRestProtocolMessage.parseAction
Josh Rosen <joshrosen@databricks.com>
2015-02-16 18:08:02 -0800
Commit: 58a82a7, github.com/apache/spark/pull/4637
[SPARK-3340] Deprecate ADD_JARS and ADD_FILES
azagrebin <azagrebin@gmail.com>
2015-02-16 18:06:19 -0800
Commit: 1668765, github.com/apache/spark/pull/4616
[SPARK-5788] [PySpark] capture the exception in python write thread
Davies Liu <davies@databricks.com>
2015-02-16 17:57:14 -0800
Commit: b1bd1dd, github.com/apache/spark/pull/4577
SPARK-5848: tear down the ConsoleProgressBar timer
Matt Whelan <mwhelan@perka.com>
2015-02-17 00:59:49 +0000
Commit: 1294a6e, github.com/apache/spark/pull/4635
[SPARK-4865][SQL]Include temporary tables in SHOW TABLES
Yin Huai <yhuai@databricks.com>
2015-02-16 15:59:23 -0800
Commit: e189cbb, github.com/apache/spark/pull/4618
[SQL] Optimize arithmetic and predicate operators
kai <kaizeng@eecs.berkeley.edu>
2015-02-16 15:58:05 -0800
Commit: cb6c48c, github.com/apache/spark/pull/4472
[SPARK-5839][SQL]HiveMetastoreCatalog does not recognize table names and aliases of data source tables.
Yin Huai <yhuai@databricks.com>
2015-02-16 15:54:01 -0800
Commit: f3ff1eb, github.com/apache/spark/pull/4626
[SPARK-5746][SQL] Check invalid cases for the write path of data source API
Yin Huai <yhuai@databricks.com>
2015-02-16 15:51:59 -0800
Commit: 5b6cd65, github.com/apache/spark/pull/4617
HOTFIX: Break in Jekyll build from #4589
Patrick Wendell <patrick@databricks.com>
2015-02-16 15:43:56 -0800
Commit: 04b401d
[SPARK-2313] Use socket to communicate GatewayServer port back to Python driver
Josh Rosen <joshrosen@databricks.com>
2015-02-16 15:25:11 -0800
Commit: 0cfda84, github.com/apache/spark/pull/3424.
SPARK-5357: Update commons-codec version to 1.10 (current)
Matt Whelan <mwhelan@perka.com>
2015-02-16 23:05:34 +0000
Commit: c01c4eb, github.com/apache/spark/pull/4153
SPARK-5841: remove DiskBlockManager shutdown hook on stop
Matt Whelan <mwhelan@perka.com>
2015-02-16 22:54:32 +0000
Commit: bb05982, github.com/apache/spark/pull/4627
[SPARK-5833] [SQL] Adds REFRESH TABLE command
Cheng Lian <lian@databricks.com>
2015-02-16 12:52:05 -0800
Commit: c51ab37, github.com/apache/spark/pull/4624
[SPARK-5296] [SQL] Add more filter types for data sources API
Cheng Lian <lian@databricks.com>
2015-02-16 12:48:55 -0800
Commit: 6f54dee, github.com/apache/spark/pull/4623
[SQL] Add fetched row count in SparkSQLCLIDriver
OopsOutOfMemory <victorshengli@126.com>
2015-02-16 12:34:09 -0800
Commit: b4d7c70, github.com/apache/spark/pull/4604
[SQL] Initial support for reporting location of error in sql string
Michael Armbrust <michael@databricks.com>
2015-02-16 12:32:56 -0800
Commit: 104b2c4, github.com/apache/spark/pull/4587
[SPARK-5824] [SQL] add null format in ctas and set default col comment to null
Daoyuan Wang <daoyuan.wang@intel.com>
2015-02-16 12:31:36 -0800
Commit: 275a0c0, github.com/apache/spark/pull/4609
[SQL] [Minor] Update the SpecificMutableRow.copy
Cheng Hao <hao.cheng@intel.com>
2015-02-16 12:21:08 -0800
Commit: cc552e0, github.com/apache/spark/pull/4619
SPARK-5795 [STREAMING] api.java.JavaPairDStream.saveAsNewAPIHadoopFiles may not friendly to java
Sean Owen <sowen@cloudera.com>
2015-02-16 19:32:31 +0000
Commit: 8e25373, github.com/apache/spark/pull/4608
Minor fixes for commit https://github.com/apache/spark/pull/4592.
Reynold Xin <rxin@databricks.com>
2015-02-16 10:09:55 -0800
Commit: 9baac56
[SPARK-5799][SQL] Compute aggregation function on specified numeric columns
Liang-Chi Hsieh <viirya@gmail.com>
2015-02-16 10:06:11 -0800
Commit: 5c78be7, github.com/apache/spark/pull/4592
SPARK-5815 [MLLIB] Part 2. Deprecate SVDPlusPlus APIs that expose DoubleMatrix from JBLAS
Sean Owen <sowen@cloudera.com>
2015-02-16 17:04:30 +0000
Commit: a3afa4a, github.com/apache/spark/pull/4625
[SPARK-5831][Streaming]When checkpoint file size is bigger than 10, then delete the old ones
Xutingjun <1039320815@qq.com>
2015-02-16 14:54:23 +0000
Commit: 1115e8e, github.com/apache/spark/pull/4621
[SPARK-4553] [SPARK-5767] [SQL] Wires Parquet data source with the newly introduced write support for data source API
Cheng Lian <lian@databricks.com>
2015-02-16 01:38:31 -0800
Commit: 3ce58cf, github.com/apache/spark/pull/4563
[Minor] [SQL] Renames stringRddToDataFrame to stringRddToDataFrameHolder for consistency
Cheng Lian <lian@databricks.com>
2015-02-16 01:33:37 -0800
Commit: 199a9e8, github.com/apache/spark/pull/4613
[Ml] SPARK-5804 Explicitly manage cache in Crossvalidator k-fold loop
Peter Rudenko <petro.rudenko@gmail.com>
2015-02-16 00:07:23 -0800
Commit: d51d6ba, github.com/apache/spark/pull/4595
[Ml] SPARK-5796 Don't transform data on a last estimator in Pipeline
Peter Rudenko <petro.rudenko@gmail.com>
2015-02-15 20:51:32 -0800
Commit: c78a12c, github.com/apache/spark/pull/4590
SPARK-5815 [MLLIB] Deprecate SVDPlusPlus APIs that expose DoubleMatrix from JBLAS
Sean Owen <sowen@cloudera.com>
2015-02-15 20:41:27 -0800
Commit: acf2558, github.com/apache/spark/pull/4614
[SPARK-5769] Set params in constructors and in setParams in Python ML pipelines
Xiangrui Meng <meng@databricks.com>
2015-02-15 20:29:26 -0800
Commit: cd4a153, github.com/apache/spark/pull/4564
SPARK-5669 [BUILD] Spark assembly includes incompatibly licensed libgfortran, libgcc code via JBLAS
Sean Owen <sowen@cloudera.com>
2015-02-15 09:15:48 -0800
Commit: 836577b, github.com/apache/spark/pull/4453
[MLLIB][SPARK-5502] User guide for isotonic regression
martinzapletal <zapletal-martin@email.cz>
2015-02-15 09:10:03 -0800
Commit: 61eb126, github.com/apache/spark/pull/4536
[SPARK-5827][SQL] Add missing import in the example of SqlContext
Takeshi Yamamuro <linguin.m.s@gmail.com>
2015-02-15 14:42:20 +0000
Commit: c771e47, github.com/apache/spark/pull/4615
SPARK-5822 [BUILD] cannot import src/main/scala & src/test/scala into eclipse as source folder
gli <gli@redhat.com>
2015-02-14 20:43:27 +0000
Commit: ed5f4bb, github.com/apache/spark/pull/4531
Revise formatting of previous commit f80e2629bb74bc62960c61ff313f7e7802d61319
Sean Owen <sowen@cloudera.com>
2015-02-14 20:12:29 +0000
Commit: 15a2ab5
[SPARK-5800] Streaming Docs. Change linked files according the selected language
gasparms <gmunoz@stratio.com>
2015-02-14 20:10:29 +0000
Commit: f80e262, github.com/apache/spark/pull/4589
[SPARK-5752][SQL] Don't implicitly convert RDDs directly to DataFrames
Reynold Xin <rxin@databricks.com>, Davies Liu <davies@databricks.com>
2015-02-13 23:03:22 -0800
Commit: e98dfe6, github.com/apache/spark/pull/4556
SPARK-3290 [GRAPHX] No unpersist callls in SVDPlusPlus
Sean Owen <sowen@cloudera.com>
2015-02-13 20:12:52 -0800
Commit: 0ce4e43, github.com/apache/spark/pull/4234
[SPARK-5227] [SPARK-5679] Disable FileSystem cache in WholeTextFileRecordReaderSuite
Josh Rosen <joshrosen@databricks.com>
2015-02-13 17:45:31 -0800
Commit: d06d5ee, github.com/apache/spark/pull/4599
[SPARK-5730][ML] add doc groups to spark.ml components
Xiangrui Meng <meng@databricks.com>
2015-02-13 16:45:59 -0800
Commit: 4f4c6d5, github.com/apache/spark/pull/4600
[SPARK-5803][MLLIB] use ArrayBuilder to build primitive arrays
Xiangrui Meng <meng@databricks.com>
2015-02-13 16:43:49 -0800
Commit: d50a91d, github.com/apache/spark/pull/4594
[SPARK-5806] re-organize sections in mllib-clustering.md
Xiangrui Meng <meng@databricks.com>
2015-02-13 15:09:27 -0800
Commit: cc56c87, github.com/apache/spark/pull/4598
[SPARK-5789][SQL]Throw a better error message if JsonRDD.parseJson encounters unrecoverable parsing errors.
Yin Huai <yhuai@databricks.com>
2015-02-13 13:51:06 -0800
Commit: 2e0c084, github.com/apache/spark/pull/4582
[SPARK-5642] [SQL] Apply column pruning on unused aggregation fields
Daoyuan Wang <daoyuan.wang@intel.com>, Michael Armbrust <michael@databricks.com>
2015-02-13 13:46:50 -0800
Commit: 2cbb3e4, github.com/apache/spark/pull/4415
[HOTFIX] Fix build break in MesosSchedulerBackendSuite
Andrew Or <andrew@databricks.com>
2015-02-13 13:10:29 -0800
Commit: 5d3cc6b
[HOTFIX] Ignore DirectKafkaStreamSuite.
Reynold Xin <rxin@databricks.com>
2015-02-13 12:43:53 -0800
Commit: 378c7eb
SPARK-5805 Fixed the type error in documentation.
Emre Sevinç <emre.sevinc@gmail.com>
2015-02-13 12:31:27 -0800
Commit: 9f31db0, github.com/apache/spark/pull/4596
[SPARK-5735] Replace uses of EasyMock with Mockito
Josh Rosen <joshrosen@databricks.com>
2015-02-13 09:53:57 -0800
Commit: 077eec2, github.com/apache/spark/pull/4578
[SPARK-5783] Better eventlog-parsing error messages
Ryan Williams <ryan.blake.williams@gmail.com>
2015-02-13 09:47:26 -0800
Commit: fc6d3e7, github.com/apache/spark/pull/4573
[SPARK-5503][MLLIB] Example code for Power Iteration Clustering
sboeschhuawei <stephen.boesch@huawei.com>
2015-02-13 09:45:57 -0800
Commit: e1a1ff8, github.com/apache/spark/pull/4495
[SPARK-5732][CORE]:Add an option to print the spark version in spark script.
uncleGen <hustyugm@gmail.com>, genmao.ygm <genmao.ygm@alibaba-inc.com>
2015-02-13 09:43:10 -0800
Commit: c0ccd25, github.com/apache/spark/pull/4522
[SPARK-4832][Deploy]some other processes might take the daemon pid
WangTaoTheTonic <barneystinson@aliyun.com>, WangTaoTheTonic <wangtao111@huawei.com>
2015-02-13 10:27:23 +0000
Commit: 1768bd5, github.com/apache/spark/pull/3683
[SPARK-3365][SQL]Wrong schema generated for List type
tianyi <tianyi.asiainfo@gmail.com>
2015-02-12 22:18:39 -0800
Commit: 1c8633f, github.com/apache/spark/pull/4581
[SQL] Fix docs of SQLContext.tables
Yin Huai <yhuai@databricks.com>
2015-02-12 20:37:55 -0800
Commit: 2aea892, github.com/apache/spark/pull/4579
[SPARK-3299][SQL]Public API in SQLContext to list tables
Yin Huai <yhuai@databricks.com>
2015-02-12 18:08:01 -0800
Commit: 1d0596a, github.com/apache/spark/pull/4547
[SQL] Move SaveMode to SQL package.
Yin Huai <yhuai@databricks.com>
2015-02-12 15:32:17 -0800
Commit: c025a46, github.com/apache/spark/pull/4542
[SPARK-5335] Fix deletion of security groups within a VPC
Vladimir Grigor <vladimir@kiosked.com>, Vladimir Grigor <vladimir@voukka.com>
2015-02-12 23:26:24 +0000
Commit: ada993e, github.com/apache/spark/pull/4122
[SPARK-5755] [SQL] remove unnecessary Add
Daoyuan Wang <daoyuan.wang@intel.com>
2015-02-12 15:22:07 -0800
Commit: d5fc514, github.com/apache/spark/pull/4551
[SPARK-5573][SQL] Add explode to dataframes
Michael Armbrust <michael@databricks.com>
2015-02-12 15:19:19 -0800
Commit: ee04a8b, github.com/apache/spark/pull/4546
[SPARK-5758][SQL] Use LongType as the default type for integers in JSON schema inference.
Yin Huai <yhuai@databricks.com>
2015-02-12 15:17:25 -0800
Commit: c352ffb, github.com/apache/spark/pull/4544
[SPARK-5780] [PySpark] Mute the logging during unit tests
Davies Liu <davies@databricks.com>
2015-02-12 14:54:38 -0800
Commit: 0bf0315, github.com/apache/spark/pull/4572
SPARK-5747: Fix wordsplitting bugs in make-distribution.sh
David Y. Ross <dyross@gmail.com>
2015-02-12 14:52:38 -0800
Commit: 26c816e, github.com/apache/spark/pull/4540
[SPARK-5759][Yarn]ExecutorRunnable should catch YarnException while NMClient start contain...
lianhuiwang <lianhuiwang09@gmail.com>
2015-02-12 14:50:16 -0800
Commit: 947b8bd, github.com/apache/spark/pull/4554
[SPARK-5760][SPARK-5761] Fix standalone rest protocol corner cases + revamp tests
Andrew Or <andrew@databricks.com>
2015-02-12 14:47:52 -0800
Commit: 1d5663e, github.com/apache/spark/pull/4557
[SPARK-5762] Fix shuffle write time for sort-based shuffle
Kay Ousterhout <kayousterhout@gmail.com>
2015-02-12 14:46:37 -0800
Commit: 47c73d4, github.com/apache/spark/pull/4559
[SPARK-5765][Examples]Fixed word split problem in run-example and compute-classpath
Venkata Ramana G <ramana.gollamudihuawei.com>, Venkata Ramana Gollamudi <ramana.gollamudi@huawei.com>
2015-02-12 14:44:21 -0800
Commit: 629d014, github.com/apache/spark/pull/4561
[EC2] Update default Spark version to 1.2.1
Katsunori Kanda <potix2@gmail.com>
2015-02-12 14:38:42 -0800
Commit: 9c80765, github.com/apache/spark/pull/4566
[SPARK-5645] Added local read bytes/time to task metrics
Kay Ousterhout <kayousterhout@gmail.com>
2015-02-12 14:35:44 -0800
Commit: 893d6fd, github.com/apache/spark/pull/4510
[SQL] Improve error messages
Michael Armbrust <michael@databricks.com>, wangfei <wangfei1@huawei.com>
2015-02-12 13:11:28 -0800
Commit: aa4ca8b, github.com/apache/spark/pull/4558
[SQL][DOCS] Update sql documentation
Antonio Navarro Perez <ajnavarro@users.noreply.github.com>
2015-02-12 12:46:17 -0800
Commit: 6a1be02, github.com/apache/spark/pull/4560
SPARK-5776 JIRA version not of form x.y.z breaks merge_spark_pr.py
Sean Owen <sowen@cloudera.com>
2015-02-12 20:14:45 +0000
Commit: bc57789, github.com/apache/spark/pull/4570
[SPARK-5757][MLLIB] replace SQL JSON usage in model import/export by json4s
Xiangrui Meng <meng@databricks.com>
2015-02-12 10:48:13 -0800
Commit: 99bd500, github.com/apache/spark/pull/4555
[SPARK-5655] Don't chmod700 application files if running in YARN
Andrew Rowson <github@growse.com>
2015-02-12 18:41:39 +0000
Commit: 466b1f6, github.com/apache/spark/pull/4509
ignore cache paths for RAT tests
Oren Mazor <oren.mazor@gmail.com>
2015-02-12 18:37:00 +0000
Commit: 9a6efbc, github.com/apache/spark/pull/4569
SPARK-5727 [BUILD] Remove Debian packaging
Sean Owen <sowen@cloudera.com>
2015-02-12 12:36:26 +0000
Commit: 9a3ea49, github.com/apache/spark/pull/4526
[SQL] Make dataframe more tolerant of being serialized
Michael Armbrust <michael@databricks.com>
2015-02-11 19:05:49 -0800
Commit: a38e23c, github.com/apache/spark/pull/4545
[SQL] Two DataFrame fixes.
Reynold Xin <rxin@databricks.com>
2015-02-11 18:32:48 -0800
Commit: d931b01, github.com/apache/spark/pull/4543
[SPARK-3688][SQL] More inline comments for LogicalPlan.
Reynold Xin <rxin@databricks.com>
2015-02-11 15:26:31 -0800
Commit: fa6bdc6, github.com/apache/spark/pull/4539
[SPARK-3688][SQL]LogicalPlan can't resolve column correctlly
tianyi <tianyi.asiainfo@gmail.com>
2015-02-11 12:50:17 -0800
Commit: 44b2311, github.com/apache/spark/pull/4524
[SPARK-5454] More robust handling of self joins
Michael Armbrust <michael@databricks.com>
2015-02-11 12:31:56 -0800
Commit: a60d2b7, github.com/apache/spark/pull/4520
Remove outdated remark about take(n).
Daniel Darabos <darabos.daniel@gmail.com>
2015-02-11 20:24:17 +0000
Commit: 03bf704, github.com/apache/spark/pull/4533
[SPARK-5677] [SPARK-5734] [SQL] [PySpark] Python DataFrame API remaining tasks
Davies Liu <davies@databricks.com>
2015-02-11 12:13:16 -0800
Commit: b694eb9, github.com/apache/spark/pull/4528
[SPARK-5733] Error Link in Pagination of HistroyPage when showing Incomplete Applications
guliangliang <guliangliang@qiyi.com>
2015-02-11 15:55:49 +0000
Commit: 1ac099e, github.com/apache/spark/pull/4523
SPARK-5727 [BUILD] Deprecate Debian packaging
Sean Owen <sowen@cloudera.com>
2015-02-11 08:30:16 +0000
Commit: bd0d6e0, github.com/apache/spark/pull/4516
SPARK-5728 [STREAMING] MQTTStreamSuite leaves behind ActiveMQ database files
Sean Owen <sowen@cloudera.com>
2015-02-11 08:13:51 +0000
Commit: da89720, github.com/apache/spark/pull/4517
[SPARK-4964] [Streaming] refactor createRDD to take leaders via map instead of array
cody koeninger <cody@koeninger.org>
2015-02-11 00:13:27 -0800
Commit: 658687b, github.com/apache/spark/pull/4511
HOTFIX: Adding Junit to Hive tests for Maven build
Patrick Wendell <patrick@databricks.com>
2015-02-10 23:39:21 -0800
Commit: c2131c0
HOTFIX: Java 6 compilation error in Spark SQL
Patrick Wendell <patrick@databricks.com>
2015-02-10 22:43:32 -0800
Commit: 7e2f882
[SPARK-5714][Mllib] Refactor initial step of LDA to remove redundant operations
Liang-Chi Hsieh <viirya@gmail.com>
2015-02-10 21:51:15 -0800
Commit: f86a89a, github.com/apache/spark/pull/4501
[SPARK-5702][SQL] Allow short names for built-in data sources.
Reynold Xin <rxin@databricks.com>
2015-02-10 20:40:21 -0800
Commit: b8f88d3, github.com/apache/spark/pull/4489
[SPARK-5729] Potential NPE in standalone REST API
Andrew Or <andrew@databricks.com>
2015-02-10 20:19:14 -0800
Commit: b969182, github.com/apache/spark/pull/4518
[SPARK-4879] Use driver to coordinate Hadoop output committing for speculative tasks
mcheah <mcheah@palantir.com>, Josh Rosen <joshrosen@databricks.com>
2015-02-10 20:12:18 -0800
Commit: 1cb3770, github.com/apache/spark/pull/4155.
[SQL][DataFrame] Fix column computability bug.
Reynold Xin <rxin@databricks.com>
2015-02-10 19:50:44 -0800
Commit: 7e24249, github.com/apache/spark/pull/4519
[SPARK-5709] [SQL] Add EXPLAIN support in DataFrame API for debugging purpose
Cheng Hao <hao.cheng@intel.com>
2015-02-10 19:40:51 -0800
Commit: 45df77b, github.com/apache/spark/pull/4496
[SPARK-5704] [SQL] [PySpark] createDataFrame from RDD with columns
Davies Liu <davies@databricks.com>
2015-02-10 19:40:12 -0800
Commit: ea60284, github.com/apache/spark/pull/4498
[SPARK-5683] [SQL] Avoid multiple json generator created
Cheng Hao <hao.cheng@intel.com>
2015-02-10 18:19:56 -0800
Commit: a60aea8, github.com/apache/spark/pull/4468
[SQL] Add an exception for analysis errors.
Michael Armbrust <michael@databricks.com>
2015-02-10 17:32:42 -0800
Commit: 6195e24, github.com/apache/spark/pull/4439
[SPARK-5658][SQL] Finalize DDL and write support APIs
Yin Huai <yhuai@databricks.com>
2015-02-10 17:29:52 -0800
Commit: aaf50d0, github.com/apache/spark/pull/4446
[SPARK-5493] [core] Add option to impersonate user.
Marcelo Vanzin <vanzin@cloudera.com>
2015-02-10 17:19:10 -0800
Commit: ed167e7, github.com/apache/spark/pull/4405
[SQL] Make Options in the data source API CREATE TABLE statements optional.
Yin Huai <yhuai@databricks.com>
2015-02-10 17:06:12 -0800
Commit: e28b6bd, github.com/apache/spark/pull/4515
[SPARK-5725] [SQL] Fixes ParquetRelation2.equals
Cheng Lian <lian@databricks.com>
2015-02-10 17:02:44 -0800
Commit: 2d50a01, github.com/apache/spark/pull/4513
[SQL][Minor] correct some comments
Sheng, Li <OopsOutOfMemory@users.noreply.github.com>, OopsOutOfMemory <victorshengli@126.com>
2015-02-11 00:59:46 +0000
Commit: 91e3512, github.com/apache/spark/pull/4508
[SPARK-5644] [Core]Delete tmp dir when sc is stop
Sephiroth-Lin <linwzhong@gmail.com>
2015-02-10 23:23:35 +0000
Commit: 52983d7, github.com/apache/spark/pull/4412
[SPARK-5343][GraphX]: ShortestPaths traverses backwards
Brennon York <brennon.york@capitalone.com>
2015-02-10 14:57:00 -0800
Commit: 5820961, github.com/apache/spark/pull/4478
[SPARK-5021] [MLlib] Gaussian Mixture now supports Sparse Input
MechCoder <manojkumarsivaraj334@gmail.com>
2015-02-10 14:05:55 -0800
Commit: fd2c032, github.com/apache/spark/pull/4459
[SPARK-5686][SQL] Add show current roles command in HiveQl
OopsOutOfMemory <victorshengli@126.com>
2015-02-10 13:20:15 -0800
Commit: f98707c, github.com/apache/spark/pull/4471
[SQL] Add toString to DataFrame/Column
Michael Armbrust <michael@databricks.com>
2015-02-10 13:14:01 -0800
Commit: de80b1b, github.com/apache/spark/pull/4436
[SPARK-5668] Display region in spark_ec2.py get_existing_cluster()
Miguel Peralvo <miguel.peralvo@gmail.com>
2015-02-10 19:54:52 +0000
Commit: c49a404, github.com/apache/spark/pull/4457
[SPARK-5592][SQL] java.net.URISyntaxException when insert data to a partitioned table
wangfei <wangfei1@huawei.com>, Fei Wang <wangfei1@huawei.com>
2015-02-10 11:54:30 -0800
Commit: 59272da, github.com/apache/spark/pull/4368
[HOTFIX][SPARK-4136] Fix compilation and tests
Andrew Or <andrew@databricks.com>
2015-02-10 11:18:01 -0800
Commit: b640c84
SPARK-4136. Under dynamic allocation, cancel outstanding executor requests when no longer needed
Sandy Ryza <sandy@cloudera.com>
2015-02-10 11:07:25 -0800
Commit: 69bc3bb, github.com/apache/spark/pull/4168
[SPARK-5716] [SQL] Support TOK_CHARSETLITERAL in HiveQl
Daoyuan Wang <daoyuan.wang@intel.com>
2015-02-10 11:08:21 -0800
Commit: c7ad80a, github.com/apache/spark/pull/4502
[Spark-5717] [MLlib] add stop and reorganize import
JqueryFan <firing@126.com>, Yuhao Yang <hhbyyh@gmail.com>
2015-02-10 17:37:32 +0000
Commit: 6cc96cf, github.com/apache/spark/pull/4503
[SPARK-1805] [EC2] Validate instance types
Nicholas Chammas <nicholas.chammas@gmail.com>
2015-02-10 15:45:38 +0000
Commit: 50820f1, github.com/apache/spark/pull/4455
[SPARK-5700] [SQL] [Build] Bumps jets3t to 0.9.3 for hadoop-2.3 and hadoop-2.4 profiles
Cheng Lian <lian@databricks.com>
2015-02-10 02:28:47 -0800
Commit: ba66793, github.com/apache/spark/pull/4499
SPARK-5239 [CORE] JdbcRDD throws "java.lang.AbstractMethodError: oracle.jdbc.driver.xxxxxx.isClosed()Z"
Sean Owen <sowen@cloudera.com>
2015-02-10 09:19:01 +0000
Commit: 2d1e916, github.com/apache/spark/pull/4470
[SPARK-4964][Streaming][Kafka] More updates to Exactly-once Kafka stream
Tathagata Das <tathagata.das1565@gmail.com>
2015-02-09 22:45:48 -0800
Commit: c151346, github.com/apache/spark/pull/4384
[SPARK-5597][MLLIB] save/load for decision trees and emsembles
Joseph K. Bradley <joseph@databricks.com>, Xiangrui Meng <meng@databricks.com>
2015-02-09 22:09:07 -0800
Commit: ef2f55b, github.com/apache/spark/pull/4444.
[SQL] Remove the duplicated code
Cheng Hao <hao.cheng@intel.com>
2015-02-09 21:33:34 -0800
Commit: bd0b5ea, github.com/apache/spark/pull/4494
[SPARK-5701] Only set ShuffleReadMetrics when task has shuffle deps
Kay Ousterhout <kayousterhout@gmail.com>
2015-02-09 21:22:09 -0800
Commit: a2d33d0, github.com/apache/spark/pull/4488
[SPARK-5703] AllJobsPage throws empty.max exception
Andrew Or <andrew@databricks.com>
2015-02-09 21:18:48 -0800
Commit: a95ed52, github.com/apache/spark/pull/4490
[SPARK-2996] Implement userClassPathFirst for driver, yarn.
Marcelo Vanzin <vanzin@cloudera.com>
2015-02-09 21:17:06 -0800
Commit: 20a6013, github.com/apache/spark/pull/3233
SPARK-4900 [MLLIB] MLlib SingularValueDecomposition ARPACK IllegalStateException
Sean Owen <sowen@cloudera.com>
2015-02-09 21:13:58 -0800
Commit: 36c4e1d, github.com/apache/spark/pull/4485
Add a config option to print DAG.
KaiXinXiaoLei <huleilei1@huawei.com>
2015-02-09 20:58:58 -0800
Commit: 31d435e, github.com/apache/spark/pull/4257
[SPARK-5469] restructure pyspark.sql into multiple files
Davies Liu <davies@databricks.com>
2015-02-09 20:49:22 -0800
Commit: 08488c1, github.com/apache/spark/pull/4479
[SPARK-5698] Do not let user request negative # of executors
Andrew Or <andrew@databricks.com>
2015-02-09 17:33:29 -0800
Commit: d302c48, github.com/apache/spark/pull/4483
[SPARK-5699] [SQL] [Tests] Runs hive-thriftserver tests whenever SQL code is modified
Cheng Lian <lian@databricks.com>
2015-02-09 16:52:05 -0800
Commit: 3ec3ad2, github.com/apache/spark/pull/4486
[SPARK-5648][SQL] support "alter ... unset tblproperties("key")"
DoingDone9 <799203320@qq.com>
2015-02-09 16:40:26 -0800
Commit: d08e7c2, github.com/apache/spark/pull/4424
[SPARK-2096][SQL] support dot notation on array of struct
Wenchen Fan <cloud0fan@outlook.com>
2015-02-09 16:39:34 -0800
Commit: 0ee53eb, github.com/apache/spark/pull/2405
[SPARK-5614][SQL] Predicate pushdown through Generate.
Lu Yan <luyan02@baidu.com>
2015-02-09 16:25:38 -0800
Commit: 2a36292, github.com/apache/spark/pull/4394
[SPARK-5696] [SQL] [HOTFIX] Asks HiveThriftServer2 to re-initialize log4j using Hive configurations
Cheng Lian <lian@databricks.com>
2015-02-09 16:23:12 -0800
Commit: b8080aa, github.com/apache/spark/pull/4484
[SQL] Code cleanup.
Yin Huai <yhuai@databricks.com>
2015-02-09 16:20:42 -0800
Commit: 5f0b30e, github.com/apache/spark/pull/4482
[SQL] Add some missing DataFrame functions.
Michael Armbrust <michael@databricks.com>
2015-02-09 16:02:56 -0800
Commit: 68b25cf, github.com/apache/spark/pull/4437
[SPARK-5611] [EC2] Allow spark-ec2 repo and branch to be set on CLI of spark_ec2.py
Florian Verhein <florian.verhein@gmail.com>
2015-02-09 23:47:07 +0000
Commit: b884daa, github.com/apache/spark/pull/4385
[SPARK-5675][SQL] XyzType companion object should subclass XyzType
Reynold Xin <rxin@databricks.com>
2015-02-09 14:51:46 -0800
Commit: f48199e, github.com/apache/spark/pull/4463
[SPARK-4905][STREAMING] FlumeStreamSuite fix.
Hari Shreedharan <hshreedharan@apache.org>
2015-02-09 14:17:14 -0800
Commit: 0765af9, github.com/apache/spark/pull/4371
[SPARK-5691] Fixing wrong data structure lookup for dupe app registratio...
mcheah <mcheah@palantir.com>
2015-02-09 13:20:14 -0800
Commit: 6fe70d8, github.com/apache/spark/pull/4477
[SPARK-5664][BUILD] Restore stty settings when exiting from SBT's spark-shell
Liang-Chi Hsieh <viirya@gmail.com>
2015-02-09 11:45:12 -0800
Commit: dae2161, github.com/apache/spark/pull/4451
[SPARK-5678] Convert DataFrame to pandas.DataFrame and Series
Davies Liu <davies@databricks.com>
2015-02-09 11:42:52 -0800
Commit: afb1316, github.com/apache/spark/pull/4476
SPARK-4267 [YARN] Failing to launch jobs on Spark on YARN with Hadoop 2.5.0 or later
Sean Owen <sowen@cloudera.com>
2015-02-09 10:33:57 -0800
Commit: de78060, github.com/apache/spark/pull/4452
SPARK-2149. [MLLIB] Univariate kernel density estimation
Sandy Ryza <sandy@cloudera.com>
2015-02-09 10:12:12 +0000
Commit: 0793ee1, github.com/apache/spark/pull/1093
[SPARK-5473] [EC2] Expose SSH failures after status checks pass
Nicholas Chammas <nicholas.chammas@gmail.com>
2015-02-09 09:44:53 +0000
Commit: 4dfe180, github.com/apache/spark/pull/4262
[SPARK-5539][MLLIB] LDA guide
Xiangrui Meng <meng@databricks.com>, Joseph K. Bradley <joseph@databricks.com>
2015-02-08 23:40:36 -0800
Commit: 855d12a, github.com/apache/spark/pull/4465
[SPARK-5472][SQL] Fix Scala code style
Hung Lin <hung@zoomdata.com>
2015-02-08 22:36:42 -0800
Commit: 4575c56, github.com/apache/spark/pull/4464
SPARK-4405 [MLLIB] Matrices.* construction methods should check for rows x cols overflow
Sean Owen <sowen@cloudera.com>
2015-02-08 21:08:50 -0800
Commit: 4396dfb, github.com/apache/spark/pull/4461
[SPARK-5660][MLLIB] Make Matrix apply public
Joseph K. Bradley <joseph@databricks.com>, Xiangrui Meng <meng@databricks.com>
2015-02-08 21:07:36 -0800
Commit: c171611, github.com/apache/spark/pull/4447
[SPARK-5643][SQL] Add a show method to print the content of a DataFrame in tabular format.
Reynold Xin <rxin@databricks.com>
2015-02-08 18:56:51 -0800
Commit: a052ed4, github.com/apache/spark/pull/4416
SPARK-5665 [DOCS] Update netlib-java documentation
Sam Halliday <sam.halliday@Gmail.com>, Sam Halliday <sam.halliday@gmail.com>
2015-02-08 16:34:26 -0800
Commit: 56aff4b, github.com/apache/spark/pull/4448
[SPARK-5598][MLLIB] model save/load for ALS
Xiangrui Meng <meng@databricks.com>
2015-02-08 16:26:20 -0800
Commit: 5c299c5, github.com/apache/spark/pull/4422
[SQL] Set sessionState in QueryExecution.
Yin Huai <yhuai@databricks.com>
2015-02-08 14:55:07 -0800
Commit: 804949d, github.com/apache/spark/pull/4445
[SPARK-3039] [BUILD] Spark assembly for new hadoop API (hadoop 2) contai...
medale <medale94@yahoo.com>
2015-02-08 10:35:29 +0000
Commit: 75fdccc, github.com/apache/spark/pull/4315
[SPARK-5672][Web UI] Don't return `ERROR 500` when have missing args
Kirill A. Korinskiy <catap@catap.ru>
2015-02-08 10:31:46 +0000
Commit: 23a99da, github.com/apache/spark/pull/4239
[SPARK-5656] Fail gracefully for large values of k and/or n that will ex...
mbittmann <mbittmann@gmail.com>, bittmannm <mark.bittmann@agilex.com>
2015-02-08 10:13:29 +0000
Commit: 4878313, github.com/apache/spark/pull/4433
[SPARK-5366][EC2] Check the mode of private key
liuchang0812 <liuchang0812@gmail.com>
2015-02-08 10:08:51 +0000
Commit: 6fb141e, github.com/apache/spark/pull/4162
[SPARK-5671] Upgrade jets3t to 0.9.2 in hadoop-2.3 and 2.4 profiles
Josh Rosen <joshrosen@databricks.com>
2015-02-07 17:19:08 -0800
Commit: 5de14cc, github.com/apache/spark/pull/4454
[SPARK-5108][BUILD] Jackson dependency management for Hadoop-2.6.0 support
Zhan Zhang <zhazhan@gmail.com>
2015-02-07 19:41:30 +0000
Commit: ecbbed2, github.com/apache/spark/pull/3938
SPARK-5408: Use -XX:MaxPermSize specified by user instead of default in ...
Jacek Lewandowski <lewandowski.jacek@gmail.com>
2015-02-07 15:58:04 +0000
Commit: dd4cb33, github.com/apache/spark/pull/4203
[BUILD] Add the ability to launch spark-shell from SBT.
Michael Armbrust <michael@databricks.com>
2015-02-07 00:14:38 -0800
Commit: e9a4fe1, github.com/apache/spark/pull/4438
[SPARK-5388] Provide a stable application submission gateway for standalone cluster mode
Andrew Or <andrew@databricks.com>
2015-02-06 15:57:06 -0800
Commit: 1390e56, github.com/apache/spark/pull/4216
SPARK-5403: Ignore UserKnownHostsFile in SSH calls
Grzegorz Dubicki <grzegorz.dubicki@gmail.com>
2015-02-06 15:43:58 -0800
Commit: e772b4e, github.com/apache/spark/pull/4196
[SPARK-5601][MLLIB] make streaming linear algorithms Java-friendly
Xiangrui Meng <meng@databricks.com>
2015-02-06 15:42:59 -0800
Commit: 0e23ca9, github.com/apache/spark/pull/4432
[SQL] [Minor] HiveParquetSuite was disabled by mistake, re-enable them
Cheng Lian <lian@databricks.com>
2015-02-06 15:23:42 -0800
Commit: c402140, github.com/apache/spark/pull/4440
[SQL] Use TestSQLContext in Java tests
Michael Armbrust <michael@databricks.com>
2015-02-06 15:11:02 -0800
Commit: 76c4bf5, github.com/apache/spark/pull/4441
[SPARK-4994][network]Cleanup removed executors' ShuffleInfo in yarn shuffle service
lianhuiwang <lianhuiwang09@gmail.com>
2015-02-06 14:47:52 -0800
Commit: 61073f8, github.com/apache/spark/pull/3828
[SPARK-5444][Network]Add a retry to deal with the conflict port in netty server.
huangzhaowei <carlmartinmax@gmail.com>
2015-02-06 14:35:29 -0800
Commit: 2bda1c1, github.com/apache/spark/pull/4240
[SPARK-4874] [CORE] Collect record count metrics
Kostas Sakellis <kostas@cloudera.com>
2015-02-06 14:31:20 -0800
Commit: dcd1e42, github.com/apache/spark/pull/4067
[HOTFIX] Fix the maven build after adding sqlContext to spark-shell
Michael Armbrust <michael@databricks.com>
2015-02-06 14:27:06 -0800
Commit: 5796156, github.com/apache/spark/pull/4443
[SPARK-5600] [core] Clean up FsHistoryProvider test, fix app sort order.
Marcelo Vanzin <vanzin@cloudera.com>
2015-02-06 14:23:09 -0800
Commit: 5687bab, github.com/apache/spark/pull/4370
SPARK-5613: Catch the ApplicationNotFoundException exception to avoid thread from getting killed on yarn restart.
Kashish Jain <kashish.jain@guavus.com>
2015-02-06 13:47:23 -0800
Commit: ca66159, github.com/apache/spark/pull/4392
SPARK-5633 pyspark saveAsTextFile support for compression codec
Vladimir Vladimirov <vladimir.vladimirov@magnetic.com>
2015-02-06 13:55:02 -0800
Commit: b3872e0, github.com/apache/spark/pull/4403
[HOTFIX][MLLIB] fix a compilation error with java 6
Xiangrui Meng <meng@databricks.com>
2015-02-06 13:52:35 -0800
Commit: 65181b7, github.com/apache/spark/pull/4442
[SPARK-4983] Insert waiting time before tagging EC2 instances
GenTang <gen.tang86@gmail.com>, Gen TANG <gen.tang86@gmail.com>
2015-02-06 13:27:34 -0800
Commit: 0f3a360, github.com/apache/spark/pull/3986
[SPARK-5586][Spark Shell][SQL] Make `sqlContext` available in spark shell
OopsOutOfMemory <victorshengli@126.com>
2015-02-06 13:20:10 -0800
Commit: 3d3ecd7, github.com/apache/spark/pull/4387
[SPARK-5278][SQL] Introduce UnresolvedGetField and complete the check of ambiguous reference to fields
Wenchen Fan <cloud0fan@outlook.com>
2015-02-06 13:08:09 -0800
Commit: 4793c84, github.com/apache/spark/pull/4068
[SQL][Minor] Remove cache keyword in SqlParser
wangfei <wangfei1@huawei.com>
2015-02-06 12:42:23 -0800
Commit: bc36356, github.com/apache/spark/pull/4393
[SQL][HiveConsole][DOC] HiveConsole `correct hiveconsole imports`
OopsOutOfMemory <victorshengli@126.com>
2015-02-06 12:41:28 -0800
Commit: b62c352, github.com/apache/spark/pull/4389
[SPARK-5595][SPARK-5603][SQL] Add a rule to do PreInsert type casting and field renaming and invalidating in memory cache after INSERT
Yin Huai <yhuai@databricks.com>
2015-02-06 12:38:07 -0800
Commit: 3eccf29, github.com/apache/spark/pull/4373
[SPARK-5324][SQL] Results of describe can't be queried
OopsOutOfMemory <victorshengli@126.com>, Sheng, Li <OopsOutOfMemory@users.noreply.github.com>
2015-02-06 12:33:20 -0800
Commit: 0b7eb3f, github.com/apache/spark/pull/4249
[SPARK-5619][SQL] Support 'show roles' in HiveContext
q00251598 <qiyadong@huawei.com>
2015-02-06 12:29:26 -0800
Commit: a958d60, github.com/apache/spark/pull/4397
[SPARK-5640] Synchronize ScalaReflection where necessary
Tobias Schlatter <tobias@meisch.ch>
2015-02-06 12:15:02 -0800
Commit: 500dc2b, github.com/apache/spark/pull/4431
[SPARK-5650][SQL] Support optional 'FROM' clause
Liang-Chi Hsieh <viirya@gmail.com>
2015-02-06 12:13:44 -0800
Commit: d433816, github.com/apache/spark/pull/4426
[SPARK-5628] Add version option to spark-ec2
Nicholas Chammas <nicholas.chammas@gmail.com>
2015-02-06 12:08:22 -0800
Commit: 70e5b03, github.com/apache/spark/pull/4414
[SPARK-2945][YARN][Doc]add doc for spark.executor.instances
WangTaoTheTonic <wangtao111@huawei.com>
2015-02-06 11:57:02 -0800
Commit: d34f79c, github.com/apache/spark/pull/4350
[SPARK-4361][Doc] Add more docs for Hadoop Configuration
zsxwing <zsxwing@gmail.com>
2015-02-06 11:50:20 -0800
Commit: af2a2a2, github.com/apache/spark/pull/3225
[HOTFIX] Fix test build break in ExecutorAllocationManagerSuite.
Josh Rosen <joshrosen@databricks.com>
2015-02-06 11:47:32 -0800
Commit: fb6c0cb
[SPARK-5652][Mllib] Use broadcasted weights in LogisticRegressionModel
Liang-Chi Hsieh <viirya@gmail.com>
2015-02-06 11:22:11 -0800
Commit: 80f3bcb, github.com/apache/spark/pull/4429
[SPARK-5555] Enable UISeleniumSuite tests
Josh Rosen <joshrosen@databricks.com>
2015-02-06 11:14:58 -0800
Commit: 0d74bd7, github.com/apache/spark/pull/4334
SPARK-2450 Adds executor log links to Web UI
Kostas Sakellis <kostas@cloudera.com>, Josh Rosen <joshrosen@databricks.com>
2015-02-06 11:13:00 -0800
Commit: 32e964c, github.com/apache/spark/pull/3486
[SPARK-5618][Spark Core][Minor] Optimise utility code.
Makoto Fukuhara <fukuo33@gmail.com>
2015-02-06 11:11:38 -0800
Commit: 4cdb26c, github.com/apache/spark/pull/4396
[SPARK-5593][Core]Replace BlockManagerListener with ExecutorListener in ExecutorAllocationListener
lianhuiwang <lianhuiwang09@gmail.com>
2015-02-06 11:09:37 -0800
Commit: 6072fcc, github.com/apache/spark/pull/4369
[SPARK-4877] Allow user first classes to extend classes in the parent.
Stephen Haberman <stephen@exigencecorp.com>
2015-02-06 11:03:56 -0800
Commit: 9792bec, github.com/apache/spark/pull/3725
[SPARK-5396] Syntax error in spark scripts on windows.
Masayoshi TSUZUKI <tsudukim@oss.nttdata.co.jp>
2015-02-06 10:58:26 -0800
Commit: c01b985, github.com/apache/spark/pull/4428
[SPARK-5636] Ramp up faster in dynamic allocation
Andrew Or <andrew@databricks.com>
2015-02-06 10:54:23 -0800
Commit: fe3740c, github.com/apache/spark/pull/4409
SPARK-4337. [YARN] Add ability to cancel pending requests
Sandy Ryza <sandy@cloudera.com>
2015-02-06 10:53:16 -0800
Commit: 1a88f20, github.com/apache/spark/pull/4141
[SPARK-5653][YARN] In ApplicationMaster rename isDriver to isClusterMode
lianhuiwang <lianhuiwang09@gmail.com>
2015-02-06 10:48:31 -0800
Commit: cc6e531, github.com/apache/spark/pull/4430
[SPARK-5013] [MLlib] Added documentation and sample data file for GaussianMixture
Travis Galoppo <tjg2107@columbia.edu>
2015-02-06 10:26:51 -0800
Commit: 9ad56ad, github.com/apache/spark/pull/4401
[SPARK-5416] init Executor.threadPool before ExecutorSource
Ryan Williams <ryan.blake.williams@gmail.com>
2015-02-06 12:22:25 +0000
Commit: 37d35ab, github.com/apache/spark/pull/4212
[Build] Set all Debian package permissions to 755
Nicholas Chammas <nicholas.chammas@gmail.com>
2015-02-06 11:38:39 +0000
Commit: cf6778e, github.com/apache/spark/pull/4277
Update ec2-scripts.md
Miguel Peralvo <miguel.peralvo@gmail.com>
2015-02-06 11:04:48 +0000
Commit: f827ef4, github.com/apache/spark/pull/4300
[SPARK-5470][Core]use defaultClassLoader to load classes in KryoSerializer
lianhuiwang <lianhuiwang09@gmail.com>
2015-02-06 11:00:35 +0000
Commit: ed3aac7, github.com/apache/spark/pull/4258
[SPARK-5582] [history] Ignore empty log directories.
Marcelo Vanzin <vanzin@cloudera.com>
2015-02-06 10:07:20 +0000
Commit: 8569289, github.com/apache/spark/pull/4352
[SPARK-5157][YARN] Configure more JVM options properly when we use ConcMarkSweepGC for AM.
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2015-02-06 09:39:12 +0000
Commit: 24dbc50, github.com/apache/spark/pull/3956
[Minor] Remove permission for execution from spark-shell.cmd
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2015-02-06 09:33:36 +0000
Commit: f6ba813, github.com/apache/spark/pull/3983
[SPARK-5380][GraphX] Solve an ArrayIndexOutOfBoundsException when build graph with a file format error
Leolh <leosandylh@gmail.com>
2015-02-06 09:01:53 +0000
Commit: 575d2df, github.com/apache/spark/pull/4176
[SPARK-4789] [SPARK-4942] [SPARK-5031] [mllib] Standardize ML Prediction APIs
Joseph K. Bradley <joseph@databricks.com>
2015-02-05 23:43:47 -0800
Commit: dc0c449, github.com/apache/spark/pull/3637
[SPARK-5604][MLLIB] remove checkpointDir from trees
Xiangrui Meng <meng@databricks.com>
2015-02-05 23:32:09 -0800
Commit: 6b88825, github.com/apache/spark/pull/4407
[SPARK-5639][SQL] Support DataFrame.renameColumn.
Reynold Xin <rxin@databricks.com>
2015-02-05 23:02:40 -0800
Commit: 7dc4965, github.com/apache/spark/pull/4410
Revert "SPARK-5607: Update to Kryo 2.24.0 to avoid including objenesis 1.2."
Patrick Wendell <patrick@databricks.com>
2015-02-05 18:36:48 -0800
Commit: 6d3b7cb
SPARK-5557: Explicitly include servlet API in dependencies.
Patrick Wendell <patrick@databricks.com>
2015-02-05 18:14:54 -0800
Commit: 793dbae, github.com/apache/spark/pull/4411
[HOTFIX] [SQL] Disables Metastore Parquet table conversion for "SQLQuerySuite.CTAS with serde"
Cheng Lian <lian@databricks.com>
2015-02-05 18:09:18 -0800
Commit: 7c0a648, github.com/apache/spark/pull/4413
[SPARK-5638][SQL] Add a config flag to disable eager analysis of DataFrames
Reynold Xin <rxin@databricks.com>
2015-02-05 18:07:10 -0800
Commit: e8a5d50, github.com/apache/spark/pull/4408
[SPARK-5620][DOC] group methods in generated unidoc
Xiangrui Meng <meng@databricks.com>
2015-02-05 16:26:51 -0800
Commit: 85ccee8, github.com/apache/spark/pull/4404
[SPARK-5182] [SPARK-5528] [SPARK-5509] [SPARK-3575] [SQL] Parquet data source improvements
Cheng Lian <lian@databricks.com>
2015-02-05 15:29:56 -0800
Commit: a9ed511, github.com/apache/spark/pull/4308
[SPARK-5604[MLLIB] remove checkpointDir from LDA
Xiangrui Meng <meng@databricks.com>
2015-02-05 15:07:33 -0800
Commit: c19152c, github.com/apache/spark/pull/4390
[SPARK-5460][MLlib] Wrapped `Try` around `deleteAllCheckpoints` - RandomForest.
x1- <viva008@gmail.com>
2015-02-05 15:02:04 -0800
Commit: 62371ad, github.com/apache/spark/pull/4347
[SPARK-5135][SQL] Add support for describe table to DDL in SQLContext
OopsOutOfMemory <victorshengli@126.com>
2015-02-05 13:07:48 -0800
Commit: 4d8d070, github.com/apache/spark/pull/4227
[SPARK-5617][SQL] fix test failure of SQLQuerySuite
wangfei <wangfei1@huawei.com>
2015-02-05 12:44:12 -0800
Commit: a83936e, github.com/apache/spark/pull/4395
[Branch-1.3] [DOC] doc fix for date
Daoyuan Wang <daoyuan.wang@intel.com>
2015-02-05 12:42:27 -0800
Commit: 6fa4ac1, github.com/apache/spark/pull/4400
SPARK-5548: Fixed a race condition in AkkaUtilsSuite
Jacek Lewandowski <lewandowski.jacek@gmail.com>
2015-02-05 12:00:04 -0800
Commit: 081ac69, github.com/apache/spark/pull/4343
[SPARK-5474][Build]curl should support URL redirection in build/mvn
GuoQiang Li <witgo@qq.com>
2015-02-05 12:03:13 -0800
Commit: 3414754, github.com/apache/spark/pull/4263
[SPARK-5608] Improve SEO of Spark documentation pages
Matei Zaharia <matei@databricks.com>
2015-02-05 11:12:50 -0800
Commit: 4d74f06, github.com/apache/spark/pull/4381
SPARK-4687. Add a recursive option to the addFile API
Sandy Ryza <sandy@cloudera.com>
2015-02-05 10:15:55 -0800
Commit: c4b1108, github.com/apache/spark/pull/3670
[HOTFIX] MLlib build break.
Reynold Xin <rxin@databricks.com>
2015-02-05 00:42:50 -0800
Commit: 6580929
[MLlib] Minor: UDF style update.
Reynold Xin <rxin@databricks.com>
2015-02-04 23:57:53 -0800
Commit: c3ba4d4, github.com/apache/spark/pull/4388
[SPARK-5612][SQL] Move DataFrame implicit functions into SQLContext.implicits.
Reynold Xin <rxin@databricks.com>
2015-02-04 23:44:34 -0800
Commit: 7d789e1, github.com/apache/spark/pull/4386
[SPARK-5606][SQL] Support plus sign in HiveContext
q00251598 <qiyadong@huawei.com>
2015-02-04 23:16:01 -0800
Commit: 9d3a75e, github.com/apache/spark/pull/4378
[SPARK-5599] Check MLlib public APIs for 1.3
Xiangrui Meng <meng@databricks.com>
2015-02-04 23:03:47 -0800
Commit: db34690, github.com/apache/spark/pull/4377
[SPARK-5596] [mllib] ML model import/export for GLMs, NaiveBayes
Joseph K. Bradley <joseph@databricks.com>
2015-02-04 22:46:48 -0800
Commit: 975bcef, github.com/apache/spark/pull/4233
SPARK-5607: Update to Kryo 2.24.0 to avoid including objenesis 1.2.
Patrick Wendell <patrick@databricks.com>
2015-02-04 22:39:44 -0800
Commit: c23ac03, github.com/apache/spark/pull/4383
[SPARK-5602][SQL] Better support for creating DataFrame from local data collection
Reynold Xin <rxin@databricks.com>
2015-02-04 19:53:57 -0800
Commit: 84acd08, github.com/apache/spark/pull/4372
[SPARK-5538][SQL] Fix flaky CachedTableSuite
Reynold Xin <rxin@databricks.com>
2015-02-04 19:52:41 -0800
Commit: 206f9bc, github.com/apache/spark/pull/4379
[SQL][DataFrame] Minor cleanup.
Reynold Xin <rxin@databricks.com>
2015-02-04 19:51:48 -0800
Commit: 6b4c7f0, github.com/apache/spark/pull/4374
[SPARK-4520] [SQL] This pr fixes the ArrayIndexOutOfBoundsException as r...
Sadhan Sood <sadhan@tellapart.com>
2015-02-04 19:18:06 -0800
Commit: dba98bf, github.com/apache/spark/pull/4148
[SPARK-5605][SQL][DF] Allow using String to specify colum name in DSL aggregate functions
Reynold Xin <rxin@databricks.com>
2015-02-04 18:35:51 -0800
Commit: 1fbd124, github.com/apache/spark/pull/4376
[SPARK-5411] Allow SparkListeners to be specified in SparkConf and loaded when creating SparkContext
Josh Rosen <joshrosen@databricks.com>
2015-02-04 17:18:03 -0800
Commit: 9a7ce70, github.com/apache/spark/pull/4111
[SPARK-5577] Python udf for DataFrame
Davies Liu <davies@databricks.com>
2015-02-04 15:55:09 -0800
Commit: dc101b0, github.com/apache/spark/pull/4351
[SPARK-5118][SQL] Fix: create table test stored as parquet as select ..
guowei2 <guowei2@asiainfo.com>
2015-02-04 15:26:10 -0800
Commit: e0490e2, github.com/apache/spark/pull/3921
[SQL] Use HiveContext's sessionState in HiveMetastoreCatalog.hiveDefaultTableFilePath
Yin Huai <yhuai@databricks.com>
2015-02-04 15:22:40 -0800
Commit: 548c9c2, github.com/apache/spark/pull/4355
[SQL] Correct the default size of TimestampType and expose NumericType
Yin Huai <yhuai@databricks.com>
2015-02-04 15:14:49 -0800
Commit: 0d81645, github.com/apache/spark/pull/4314
[SQL][Hiveconsole] Bring hive console code up to date and update README.md
OopsOutOfMemory <victorshengli@126.com>, Sheng, Li <OopsOutOfMemory@users.noreply.github.com>
2015-02-04 15:13:54 -0800
Commit: b73d5ff, github.com/apache/spark/pull/4330
[SPARK-5367][SQL] Support star expression in udfs
wangfei <wangfei1@huawei.com>, scwf <wangfei1@huawei.com>
2015-02-04 15:12:07 -0800
Commit: 417d111, github.com/apache/spark/pull/4353
[SPARK-5426][SQL] Add SparkSQL Java API helper methods.
kul <kuldeep.bora@gmail.com>
2015-02-04 15:08:37 -0800
Commit: 424cb69, github.com/apache/spark/pull/4243
[SPARK-5587][SQL] Support change database owner
wangfei <wangfei1@huawei.com>
2015-02-04 14:35:12 -0800
Commit: b90dd39, github.com/apache/spark/pull/4357
[SPARK-5591][SQL] Fix NoSuchObjectException for CTAS
wangfei <wangfei1@huawei.com>
2015-02-04 14:33:07 -0800
Commit: a9f0db1, github.com/apache/spark/pull/4365
[SPARK-4939] move to next locality when no pending tasks
Davies Liu <davies@databricks.com>
2015-02-04 14:22:07 -0800
Commit: 0a89b15, github.com/apache/spark/pull/3779
[SPARK-4707][STREAMING] Reliable Kafka Receiver can lose data if the blo...
Hari Shreedharan <hshreedharan@apache.org>
2015-02-04 14:20:44 -0800
Commit: f0500f9, github.com/apache/spark/pull/3655
[SPARK-4964] [Streaming] Exactly-once semantics for Kafka
cody koeninger <cody@koeninger.org>
2015-02-04 12:06:34 -0800
Commit: b0c0021, github.com/apache/spark/pull/3798
[SPARK-5588] [SQL] support select/filter by SQL expression
Davies Liu <davies@databricks.com>
2015-02-04 11:34:46 -0800
Commit: ac0b2b7, github.com/apache/spark/pull/4359
[SPARK-5585] Flaky test in MLlib python
Davies Liu <davies@databricks.com>
2015-02-04 08:54:20 -0800
Commit: 38a416f, github.com/apache/spark/pull/4358
[SPARK-5574] use given name prefix in dir
Imran Rashid <irashid@cloudera.com>
2015-02-04 01:02:20 -0800
Commit: 5aa0f21, github.com/apache/spark/pull/4344
[Minor] Fix incorrect warning log
Liang-Chi Hsieh <viirya@gmail.com>
2015-02-04 00:52:41 -0800
Commit: a74cbbf, github.com/apache/spark/pull/4360
[SPARK-5379][Streaming] Add awaitTerminationOrTimeout
zsxwing <zsxwing@gmail.com>
2015-02-04 00:40:28 -0800
Commit: 4cf4cba, github.com/apache/spark/pull/4171
[SPARK-5341] Use maven coordinates as dependencies in spark-shell and spark-submit
Burak Yavuz <brkyvz@gmail.com>
2015-02-03 22:39:17 -0800
Commit: 6aed719, github.com/apache/spark/pull/4215
[SPARK-4939] revive offers periodically in LocalBackend
Davies Liu <davies@databricks.com>
2015-02-03 22:30:23 -0800
Commit: 83de71c, github.com/apache/spark/pull/4147
[SPARK-4969][STREAMING][PYTHON] Add binaryRecords to streaming
freeman <the.freeman.lab@gmail.com>
2015-02-03 22:24:30 -0800
Commit: 242b4f0, github.com/apache/spark/pull/3803
[SPARK-5579][SQL][DataFrame] Support for project/filter using SQL expressions
Reynold Xin <rxin@databricks.com>
2015-02-03 22:15:35 -0800
Commit: 40c4cb2, github.com/apache/spark/pull/4348
[FIX][MLLIB] fix seed handling in Python GMM
Xiangrui Meng <meng@databricks.com>
2015-02-03 20:39:11 -0800
Commit: eb15631, github.com/apache/spark/pull/4349
[SPARK-4795][Core] Redesign the "primitive type => Writable" implicit APIs to make them be activated automatically
zsxwing <zsxwing@gmail.com>
2015-02-03 20:17:12 -0800
Commit: d37978d, github.com/apache/spark/pull/3642
[SPARK-5578][SQL][DataFrame] Provide a convenient way for Scala users to use UDFs
Reynold Xin <rxin@databricks.com>
2015-02-03 20:07:46 -0800
Commit: 1077f2e, github.com/apache/spark/pull/4345
[SPARK-5520][MLlib] Make FP-Growth implementation take generic item types (WIP)
Jacky Li <jacky.likun@huawei.com>, Jacky Li <jackylk@users.noreply.github.com>, Xiangrui Meng <meng@databricks.com>
2015-02-03 17:02:42 -0800
Commit: e380d2d, github.com/apache/spark/pull/4340
[SPARK-5554] [SQL] [PySpark] add more tests for DataFrame Python API
Davies Liu <davies@databricks.com>
2015-02-03 16:01:56 -0800
Commit: 068c0e2, github.com/apache/spark/pull/4331
[STREAMING] SPARK-4986 Wait for receivers to deregister and receiver job to terminate
Jesper Lundgren <jesper.lundgren@vpon.com>
2015-02-03 14:53:39 -0800
Commit: 1e8b539, github.com/apache/spark/pull/4338
[SPARK-5153][Streaming][Test] Increased timeout to deal with flaky KafkaStreamSuite
Tathagata Das <tathagata.das1565@gmail.com>
2015-02-03 13:46:02 -0800
Commit: 681f9df, github.com/apache/spark/pull/4342
[SPARK-4508] [SQL] build native date type to conform behavior to Hive
Daoyuan Wang <daoyuan.wang@intel.com>
2015-02-03 12:21:45 -0800
Commit: db821ed, github.com/apache/spark/pull/4325
[SPARK-5383][SQL] Support alias for udtfs
wangfei <wangfei1@huawei.com>, scwf <wangfei1@huawei.com>, Fei Wang <wangfei1@huawei.com>
2015-02-03 12:16:31 -0800
Commit: 5adbb39, github.com/apache/spark/pull/4186
[SPARK-5550] [SQL] Support the case insensitive for UDF
Cheng Hao <hao.cheng@intel.com>
2015-02-03 12:12:26 -0800
Commit: ca7a6cd, github.com/apache/spark/pull/4326
[SPARK-4987] [SQL] parquet timestamp type support
Daoyuan Wang <daoyuan.wang@intel.com>
2015-02-03 12:06:06 -0800
Commit: 0c20ce6, github.com/apache/spark/pull/3820
Release 1.3.1
[SQL] Use path.makeQualified in newParquet.
Yin Huai <yhuai@databricks.com>
2015-04-04 23:26:10 +0800
Commit: eb57d4f, github.com/apache/spark/pull/5353
[SPARK-6700] disable flaky test
Davies Liu <davies@databricks.com>
2015-04-03 15:22:21 -0700
Commit: 3366af6, github.com/apache/spark/pull/5356
[SPARK-6688] [core] Always use resolved URIs in EventLoggingListener.
Marcelo Vanzin <vanzin@cloudera.com>
2015-04-03 11:54:31 -0700
Commit: f17a2fe, github.com/apache/spark/pull/5340
[SPARK-6575][SQL] Converted Parquet Metastore tables no longer cache metadata
Yin Huai <yhuai@databricks.com>
2015-04-03 14:40:36 +0800
Commit: 0c1b78b, github.com/apache/spark/pull/5339
[SPARK-6621][Core] Fix the bug that calling EventLoop.stop in EventLoop.onReceive/onError/onStart doesn't call onStop
zsxwing <zsxwing@gmail.com>
2015-04-02 22:54:30 -0700
Commit: ac705aa, github.com/apache/spark/pull/5280
[SPARK-6345][STREAMING][MLLIB] Fix for training with prediction
freeman <the.freeman.lab@gmail.com>
2015-04-02 21:37:44 -0700
Commit: d21f779, github.com/apache/spark/pull/5037
[CORE] The descriptionof jobHistory config should be spark.history.fs.logDirectory
KaiXinXiaoLei <huleilei1@huawei.com>
2015-04-02 20:24:31 -0700
Commit: 17ab6b0, github.com/apache/spark/pull/5332
[SPARK-6575][SQL] Converted Parquet Metastore tables no longer cache metadata
Yin Huai <yhuai@databricks.com>
2015-04-02 20:23:08 -0700
Commit: 0c1c0fb, github.com/apache/spark/pull/5339
[SPARK-6650] [core] Stop ExecutorAllocationManager when context stops.
Marcelo Vanzin <vanzin@cloudera.com>
2015-04-02 19:48:55 -0700
Commit: 0ef46b2, github.com/apache/spark/pull/5311
[SPARK-6686][SQL] Use resolved output instead of names for toDF rename
Michael Armbrust <michael@databricks.com>
2015-04-02 18:30:55 -0700
Commit: 2927af1, github.com/apache/spark/pull/5337
[SPARK-6672][SQL] convert row to catalyst in createDataFrame(RDD[Row], ...)
Xiangrui Meng <meng@databricks.com>
2015-04-02 17:57:01 +0800
Commit: c2694bb, github.com/apache/spark/pull/5329
[SPARK-6618][SPARK-6669][SQL] Lock Hive metastore client correctly.
Yin Huai <yhuai@databricks.com>, Michael Armbrust <michael@databricks.com>
2015-04-02 16:46:50 -0700
Commit: e6ee95c, github.com/apache/spark/pull/5333
[Minor] [SQL] Follow-up of PR #5210
Cheng Lian <lian@databricks.com>
2015-04-02 16:15:34 -0700
Commit: 4f1fe3f, github.com/apache/spark/pull/5219
[SPARK-6655][SQL] We need to read the schema of a data source table stored in spark.sql.sources.schema property
Yin Huai <yhuai@databricks.com>
2015-04-02 16:02:31 -0700
Commit: aecec07, github.com/apache/spark/pull/5313
[SQL] Throw UnsupportedOperationException instead of NotImplementedError
Michael Armbrust <michael@databricks.com>
2015-04-02 16:01:03 -0700
Commit: 78ba245, github.com/apache/spark/pull/5315
SPARK-6414: Spark driver failed with NPE on job cancelation
Hung Lin <hung.lin@gmail.com>
2015-04-02 14:01:43 -0700
Commit: 58e2b3f, github.com/apache/spark/pull/5124
[SPARK-6079] Use index to speed up StatusTracker.getJobIdsForGroup()
Josh Rosen <joshrosen@databricks.com>
2015-03-25 17:40:00 -0700
Commit: a6664dc, github.com/apache/spark/pull/4830
[SPARK-6667] [PySpark] remove setReuseAddress
Davies Liu <davies@databricks.com>
2015-04-02 12:18:33 -0700
Commit: ee2bd70, github.com/apache/spark/pull/5324
Revert "[SPARK-6618][SQL] HiveMetastoreCatalog.lookupRelation should use fine-grained lock"
Cheng Lian <lian@databricks.com>
2015-04-02 12:59:38 +0800
Commit: 1160cc9
[SQL] SPARK-6658: Update DataFrame documentation to refer to correct types
Michael Armbrust <michael@databricks.com>
2015-04-01 18:00:07 -0400
Commit: 223dd3f
[SPARK-6578] Small rewrite to make the logic more clear in MessageWithHeader.transferTo.
Reynold Xin <rxin@databricks.com>
2015-04-01 18:36:06 -0700
Commit: d697b76, github.com/apache/spark/pull/5319
[SPARK-6660][MLLIB] pythonToJava doesn't recognize object arrays
Xiangrui Meng <meng@databricks.com>
2015-04-01 18:17:07 -0700
Commit: 0d1e476, github.com/apache/spark/pull/5318
[SPARK-6553] [pyspark] Support functools.partial as UDF
ksonj <kson@siberie.de>
2015-04-01 17:23:57 -0700
Commit: 98f72df, github.com/apache/spark/pull/5206
[SPARK-6642][MLLIB] use 1.2 lambda scaling and remove addImplicit from NormalEquation
Xiangrui Meng <meng@databricks.com>
2015-04-01 16:47:18 -0700
Commit: bc04fa2, github.com/apache/spark/pull/5314
[SPARK-6578] [core] Fix thread-safety issue in outbound path of network library.
Marcelo Vanzin <vanzin@cloudera.com>
2015-04-01 16:06:11 -0700
Commit: 1c31ebd, github.com/apache/spark/pull/5234
[SPARK-6657] [Python] [Docs] fixed python doc build warnings
Joseph K. Bradley <joseph@databricks.com>
2015-04-01 15:15:47 -0700
Commit: e347a7a, github.com/apache/spark/pull/5317
[SPARK-6651][MLLIB] delegate dense vector arithmetics to the underlying numpy array
Xiangrui Meng <meng@databricks.com>
2015-04-01 13:29:04 -0700
Commit: f50d95a, github.com/apache/spark/pull/5312
SPARK-6626 [DOCS]: Corrected Scala:TwitterUtils parameters
jayson <jayson@ziprecruiter.com>
2015-04-01 11:12:55 +0100
Commit: 7d029cb, github.com/apache/spark/pull/5295
[Doc] Improve Python DataFrame documentation
Reynold Xin <rxin@databricks.com>
2015-03-31 18:31:36 -0700
Commit: e527b35, github.com/apache/spark/pull/5287
[SPARK-6614] OutputCommitCoordinator should clear authorized committer only after authorized committer fails, not after any failure
Josh Rosen <joshrosen@databricks.com>
2015-03-31 16:18:39 -0700
Commit: c4c982a, github.com/apache/spark/pull/5276
[SPARK-6633][SQL] Should be "Contains" instead of "EndsWith" when constructing sources.StringContains
Liang-Chi Hsieh <viirya@gmail.com>
2015-03-31 13:18:07 -0700
Commit: d851646, github.com/apache/spark/pull/5299
[SPARK-5371][SQL] Propagate types after function conversion, before futher resolution
Michael Armbrust <michael@databricks.com>
2015-03-31 11:34:29 -0700
Commit: 5a957fe, github.com/apache/spark/pull/5278
[SPARK-6145][SQL] fix ORDER BY on nested fields
Michael Armbrust <michael@databricks.com>
2015-03-31 11:23:18 -0700
Commit: 045228f, github.com/apache/spark/pull/5189
[SPARK-6575] [SQL] Adds configuration to disable schema merging while converting metastore Parquet tables
Cheng Lian <lian@databricks.com>
2015-03-31 11:21:15 -0700
Commit: 778c876, github.com/apache/spark/pull/5231
[SPARK-6555] [SQL] Overrides equals() and hashCode() for MetastoreRelation
Cheng Lian <lian@databricks.com>
2015-03-31 11:18:25 -0700
Commit: 9ebefb1, github.com/apache/spark/pull/5289
[SPARK-6618][SQL] HiveMetastoreCatalog.lookupRelation should use fine-grained lock
Yin Huai <yhuai@databricks.com>
2015-03-31 16:28:40 +0800
Commit: fd600ce, github.com/apache/spark/pull/5281
[SPARK-6623][SQL] Alias DataFrame.na.drop and DataFrame.na.fill in Python.
Reynold Xin <rxin@databricks.com>
2015-03-31 00:25:23 -0700
Commit: cf651a4, github.com/apache/spark/pull/5284
[SPARK-6625][SQL] Add common string filters to data sources.
Reynold Xin <rxin@databricks.com>
2015-03-31 00:19:51 -0700
Commit: a97d4e6, github.com/apache/spark/pull/5285
[SPARK-6119][SQL] DataFrame support for missing data handling
Reynold Xin <rxin@databricks.com>
2015-03-30 20:47:10 -0700
Commit: 67c885e, github.com/apache/spark/pull/5274
[SPARK-6369] [SQL] Uses commit coordinator to help committing Hive and Parquet tables
Cheng Lian <lian@databricks.com>
2015-03-31 07:48:37 +0800
Commit: fedbfc7, github.com/apache/spark/pull/5139
[SPARK-6603] [PySpark] [SQL] add SQLContext.udf and deprecate inferSchema() and applySchema
Davies Liu <davies@databricks.com>
2015-03-30 15:47:00 -0700
Commit: 30e7c63, github.com/apache/spark/pull/5273
[SPARK-6592][SQL] fix filter for scaladoc to generate API doc for Row class under catalyst dir
CodingCat <zhunansjtu@gmail.com>
2015-03-30 11:54:44 -0700
Commit: f9d4efa, github.com/apache/spark/pull/5252
[SPARK-6571][MLLIB] use wrapper in MatrixFactorizationModel.load
Xiangrui Meng <meng@databricks.com>
2015-03-28 15:08:05 -0700
Commit: 93a7166, github.com/apache/spark/pull/5243
[SPARK-6595][SQL] MetastoreRelation should be a MultiInstanceRelation
Michael Armbrust <michael@databricks.com>
2015-03-30 22:24:12 +0800
Commit: c411530, github.com/apache/spark/pull/5251
[SPARK-6558] Utils.getCurrentUserName returns the full principal name instead of login name
Thomas Graves <tgraves@apache.org>
2015-03-29 12:43:30 +0100
Commit: f8132de, github.com/apache/spark/pull/5229
[SPARK-5750][SPARK-3441][SPARK-5836][CORE] Added documentation explaining shuffle
Ilya Ganelin <ilya.ganelin@capitalone.com>, Ilya Ganelin <ilganeli@gmail.com>
2015-03-30 11:52:02 +0100
Commit: 1c59a4b, github.com/apache/spark/pull/5074
[spark-sql] a better exception message than "scala.MatchError" for unsupported types in Schema creation
Eran Medan <ehrann.mehdan@gmail.com>
2015-03-30 00:02:52 -0700
Commit: 4859c40, github.com/apache/spark/pull/5235
[HOTFIX] Build break due to NoRelation cherry-pick.
Reynold Xin <rxin@databricks.com>
2015-03-29 12:07:28 -0700
Commit: 6181366
[DOC] Improvements to Python docs.
Reynold Xin <rxin@databricks.com>
2015-03-28 23:59:27 -0700
Commit: 3db0844, github.com/apache/spark/pull/5238
[SPARK-6538][SQL] Add missing nullable Metastore fields when merging a Parquet schema
Adam Budde <budde@amazon.com>
2015-03-28 09:14:09 +0800
Commit: 5e04f45, github.com/apache/spark/pull/5214
[SPARK-6564][SQL] SQLContext.emptyDataFrame should contain 0 row, not 1 row
Reynold Xin <rxin@databricks.com>
2015-03-27 14:56:57 -0700
Commit: 7006858, github.com/apache/spark/pull/5226
[SPARK-6544][build] Increment Avro version from 1.7.6 to 1.7.7
Dean Chen <deanchen5@gmail.com>
2015-03-27 14:32:51 +0000
Commit: fefd49f, github.com/apache/spark/pull/5193
[SPARK-6574] [PySpark] fix sql example
Davies Liu <davies@databricks.com>
2015-03-27 11:42:26 -0700
Commit: b902a95, github.com/apache/spark/pull/5230
[SPARK-6550][SQL] Use analyzed plan in DataFrame
Michael Armbrust <michael@databricks.com>
2015-03-27 11:40:00 -0700
Commit: bc75189, github.com/apache/spark/pull/5217
[SPARK-6341][mllib] Upgrade breeze from 0.11.1 to 0.11.2
Yu ISHIKAWA <yuu.ishikawa@gmail.com>
2015-03-27 00:15:02 -0700
Commit: b318858, github.com/apache/spark/pull/5222
[DOCS][SQL] Fix JDBC example
Michael Armbrust <michael@databricks.com>
2015-03-26 14:51:46 -0700
Commit: 54d92b5, github.com/apache/spark/pull/5192
[SPARK-6554] [SQL] Don't push down predicates which reference partition column(s)
Cheng Lian <lian@databricks.com>
2015-03-26 13:11:37 -0700
Commit: 3d54578, github.com/apache/spark/pull/5210
[SPARK-6117] [SQL] Improvements to DataFrame.describe()
Reynold Xin <rxin@databricks.com>
2015-03-26 12:26:13 -0700
Commit: 28e3a1e, github.com/apache/spark/pull/5201
[SPARK-6117] [SQL] add describe function to DataFrame for summary statis...
azagrebin <azagrebin@gmail.com>
2015-03-26 00:25:04 -0700
Commit: 84735c3, github.com/apache/spark/pull/5073
SPARK-6480 [CORE] histogram() bucket function is wrong in some simple edge cases
Sean Owen <sowen@cloudera.com>
2015-03-26 15:00:23 +0000
Commit: aa2d157, github.com/apache/spark/pull/5148
[SPARK-6491] Spark will put the current working dir to the CLASSPATH
guliangliang <guliangliang@qiyi.com>
2015-03-26 13:28:56 +0000
Commit: 5b5f0e2, github.com/apache/spark/pull/5156
[SQL][SPARK-6471]: Metastore schema should only be a subset of parquet schema to support dropping of columns using replace columns
Yash Datta <Yash.Datta@guavus.com>
2015-03-26 21:13:38 +0800
Commit: 836c921, github.com/apache/spark/pull/5141
[SPARK-6465][SQL] Fix serialization of GenericRowWithSchema using kryo
Michael Armbrust <michael@databricks.com>
2015-03-26 18:46:57 +0800
Commit: 8254996, github.com/apache/spark/pull/5191
[SPARK-6536] [PySpark] Column.inSet() in Python
Davies Liu <davies@databricks.com>
2015-03-26 00:01:24 -0700
Commit: 0ba7599, github.com/apache/spark/pull/5190
[SPARK-6463][SQL] AttributeSet.equal should compare size
sisihj <jun.hejun@huawei.com>, Michael Armbrust <michael@databricks.com>
2015-03-25 19:21:54 -0700
Commit: 9edb34f, github.com/apache/spark/pull/5194
[SPARK-6450] [SQL] Fixes metastore Parquet table conversion
Cheng Lian <lian@databricks.com>
2015-03-25 17:40:19 -0700
Commit: 0cd4748, github.com/apache/spark/pull/5183
[SPARK-6409][SQL] It is not necessary that avoid old inteface of hive, because this will make some UDAF can not work.
DoingDone9 <799203320@qq.com>
2015-03-25 11:11:52 -0700
Commit: 4efa6c5, github.com/apache/spark/pull/5131
SPARK-6063 MLlib doesn't pass mvn scalastyle check due to UTF chars in LDAModel.scala
Michael Griffiths <msjgriffiths@gmail.com>, Griffiths, Michael (NYC-RPM) <michael.griffiths@reprisemedia.com>
2015-02-28 14:47:39 +0000
Commit: 6791f42, github.com/apache/spark/pull/4815
[SPARK-6496] [MLLIB] GeneralizedLinearAlgorithm.run(input, initialWeights) should initialize numFeatures
Yanbo Liang <ybliang8@gmail.com>
2015-03-25 17:05:56 +0000
Commit: 2be4255, github.com/apache/spark/pull/5167
[DOCUMENTATION]Fixed Missing Type Import in Documentation
Bill Chambers <wchambers@ischool.berkeley.edu>, anabranch <wac.chambers@gmail.com>
2015-03-24 22:24:35 -0700
Commit: 8e4e2e3, github.com/apache/spark/pull/5179
[SPARK-6469] Improving documentation on YARN local directories usage
Christophe Préaud <christophe.preaud@kelkoo.com>
2015-03-24 17:05:49 -0700
Commit: 6af9408, github.com/apache/spark/pull/5165
[SPARK-3570] Include time to open files in shuffle write time.
Kay Ousterhout <kayousterhout@gmail.com>
2015-03-24 16:29:40 -0700
Commit: e4db5a3, github.com/apache/spark/pull/4550
[SPARK-6088] Correct how tasks that get remote results are shown in UI.
Kay Ousterhout <kayousterhout@gmail.com>
2015-03-24 16:26:43 -0700
Commit: de8b2d4, github.com/apache/spark/pull/4839
[SPARK-6428][SQL] Added explicit types for all public methods in catalyst
Reynold Xin <rxin@databricks.com>
2015-03-24 16:03:55 -0700
Commit: 586e0d9, github.com/apache/spark/pull/5162
[SPARK-6209] Clean up connections in ExecutorClassLoader after failing to load classes (master branch PR)
Josh Rosen <joshrosen@databricks.com>
2015-03-24 14:38:20 -0700
Commit: dcf56aa, github.com/apache/spark/pull/4944
[SPARK-6458][SQL] Better error messages for invalid data sources
Michael Armbrust <michael@databricks.com>
2015-03-24 14:10:56 -0700
Commit: f48c16d, github.com/apache/spark/pull/5158
[SPARK-6376][SQL] Avoid eliminating subqueries until optimization
Michael Armbrust <michael@databricks.com>
2015-03-24 14:08:20 -0700
Commit: df671bc, github.com/apache/spark/pull/5160
[SPARK-6375][SQL] Fix formatting of error messages.
Michael Armbrust <michael@databricks.com>
2015-03-24 13:22:46 -0700
Commit: 92bf888, github.com/apache/spark/pull/5155
Revert "[SPARK-5680][SQL] Sum function on all null values, should return zero"
Michael Armbrust <michael@databricks.com>
2015-03-24 12:32:25 -0700
Commit: 930b667
[SPARK-6054][SQL] Fix transformations of TreeNodes that hold StructTypes
Michael Armbrust <michael@databricks.com>
2015-03-24 12:28:01 -0700
Commit: c699e2b, github.com/apache/spark/pull/5157
[SPARK-6437][SQL] Use completion iterator to close external sorter
Michael Armbrust <michael@databricks.com>
2015-03-24 12:10:30 -0700
Commit: c0101d3, github.com/apache/spark/pull/5161
[SPARK-6459][SQL] Warn when constructing trivially true equals predicate
Michael Armbrust <michael@databricks.com>
2015-03-24 12:09:02 -0700
Commit: f0141ca, github.com/apache/spark/pull/5163
[SPARK-5955][MLLIB] add checkpointInterval to ALS
Xiangrui Meng <meng@databricks.com>
2015-03-20 15:02:57 -0400
Commit: bc92a2e, github.com/apache/spark/pull/5076
[ML][docs][minor] Define LabeledDocument/Document classes in CV example
Peter Rudenko <petro.rudenko@gmail.com>
2015-03-24 16:33:38 +0000
Commit: 4ff5771, github.com/apache/spark/pull/5135
[SPARK-5559] [Streaming] [Test] Remove oppotunity we met flakiness when running FlumeStreamSuite
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2015-03-24 16:13:25 +0000
Commit: 8722369, github.com/apache/spark/pull/4337
Update the command to use IPython notebook
Cong Yue <yuecong1104@gmail.com>
2015-03-24 12:56:13 +0000
Commit: e545143, github.com/apache/spark/pull/5111
[SPARK-6452] [SQL] Checks for missing attributes and unresolved operator for all types of operator
Cheng Lian <lian@databricks.com>
2015-03-24 01:12:11 -0700
Commit: 6f10142, github.com/apache/spark/pull/5129
[SPARK-6124] Support jdbc connection properties in OPTIONS part of the query
Volodymyr Lyubinets <vlyubin@gmail.com>
2015-03-23 17:00:27 -0700
Commit: 04b2078, github.com/apache/spark/pull/4859
[SPARK-6397][SQL] Check the missingInput simply
Yadong Qi <qiyadong2010@gmail.com>
2015-03-23 18:16:49 +0800
Commit: a29f493, github.com/apache/spark/pull/5132
[SPARK-4985] [SQL] parquet support for date type
Daoyuan Wang <daoyuan.wang@intel.com>
2015-03-23 11:46:16 +0800
Commit: 60b9b96, github.com/apache/spark/pull/3822
[SPARK-6337][Documentation, SQL]Spark 1.3 doc fixes
vinodkc <vinod.kc.in@gmail.com>
2015-03-22 20:00:08 +0000
Commit: 857e8a6, github.com/apache/spark/pull/5112
SPARK-6454 [DOCS] Fix links to pyspark api
Kamil Smuga <smugakamil@gmail.com>, stderr <smugakamil@gmail.com>
2015-03-22 15:56:25 +0000
Commit: 3ba295f, github.com/apache/spark/pull/5120
[SPARK-6408] [SQL] Fix JDBCRDD filtering string literals
ypcat <ypcat6@gmail.com>, Pei-Lun Lee <pllee@appier.com>
2015-03-22 15:49:13 +0800
Commit: e60fbf6, github.com/apache/spark/pull/5087
[SPARK-6428][SQL] Added explicit type for all public methods for Hive module
Reynold Xin <rxin@databricks.com>
2015-03-21 14:30:04 -0700
Commit: 0021d22, github.com/apache/spark/pull/5108
[SPARK-6428][SQL] Added explicit type for all public methods in sql/core
Reynold Xin <rxin@databricks.com>
2015-03-20 15:47:07 -0700
Commit: c964588, github.com/apache/spark/pull/5104
[SPARK-6250][SPARK-6146][SPARK-5911][SQL] Types are now reserved words in DDL parser.
Yin Huai <yhuai@databricks.com>
2015-03-21 13:27:53 -0700
Commit: 102daaf, github.com/apache/spark/pull/5078
[SPARK-5680][SQL] Sum function on all null values, should return zero
Venkata Ramana G <ramana.gollamudihuawei.com>, Venkata Ramana Gollamudi <ramana.gollamudi@huawei.com>
2015-03-21 13:24:24 -0700
Commit: 93975a3, github.com/apache/spark/pull/4466
[SPARK-5320][SQL]Add statistics method at NoRelation (override super).
x1- <viva008@gmail.com>
2015-03-21 13:22:34 -0700
Commit: cba6842, github.com/apache/spark/pull/5105
[SPARK-5821] [SQL] JSON CTAS command should throw error message when delete path failure
Yanbo Liang <ybliang8@gmail.com>, Yanbo Liang <yanbohappy@gmail.com>
2015-03-21 11:23:28 +0800
Commit: 8de90c7, github.com/apache/spark/pull/4610
[SPARK-6315] [SQL] Also tries the case class string parser while reading Parquet schema
Cheng Lian <lian@databricks.com>
2015-03-21 11:18:45 +0800
Commit: b75943f, github.com/apache/spark/pull/5034
[SPARK-5821] [SQL] ParquetRelation2 CTAS should check if delete is successful
Yanbo Liang <ybliang8@gmail.com>
2015-03-21 10:53:04 +0800
Commit: df83e21, github.com/apache/spark/pull/5107
[SPARK-6421][MLLIB] _regression_train_wrapper does not test initialWeights correctly
lewuathe <lewuathe@me.com>
2015-03-20 17:18:18 -0400
Commit: aff9f8d, github.com/apache/spark/pull/5101
[SPARK-6286][Mesos][minor] Handle missing Mesos case TASK_ERROR
Jongyoul Lee <jongyoul@gmail.com>
2015-03-20 12:24:34 +0000
Commit: db812d9, github.com/apache/spark/pull/5088
[SPARK-6222][Streaming] Dont delete checkpoint data when doing pre-batch-start checkpoint
Tathagata Das <tathagata.das1565@gmail.com>
2015-03-19 02:15:50 -0400
Commit: 03e263f, github.com/apache/spark/pull/5008
[SPARK-6325] [core,yarn] Do not change target executor count when killing executors.
Marcelo Vanzin <vanzin@cloudera.com>
2015-03-18 09:18:28 -0400
Commit: 1723f05, github.com/apache/spark/pull/5018
[SPARK-6286][minor] Handle missing Mesos case TASK_ERROR.
Iulian Dragos <jaguarul@gmail.com>
2015-03-18 09:15:33 -0400
Commit: ff0a7f4, github.com/apache/spark/pull/5000
[SPARK-6247][SQL] Fix resolution of ambiguous joins caused by new aliases
Michael Armbrust <michael@databricks.com>
2015-03-17 19:47:51 -0700
Commit: ba8352c, github.com/apache/spark/pull/5062
[SPARK-6383][SQL]Fixed compiler and errors in Dataframe examples
Tijo Thomas <tijoparacka@gmail.com>
2015-03-17 18:50:19 -0700
Commit: cee6d08, github.com/apache/spark/pull/5068
[SPARK-6366][SQL] In Python API, the default save mode for save and saveAsTable should be "error" instead of "append".
Yin Huai <yhuai@databricks.com>
2015-03-18 09:41:06 +0800
Commit: 3ea38bc, github.com/apache/spark/pull/5053
[SPARK-6330] [SQL] Add a test case for SPARK-6330
Pei-Lun Lee <pllee@appier.com>
2015-03-18 08:34:46 +0800
Commit: 9d88f0c, github.com/apache/spark/pull/5039
[SPARK-6336] LBFGS should document what convergenceTol means
lewuathe <lewuathe@me.com>
2015-03-17 12:11:57 -0700
Commit: 476c4e1, github.com/apache/spark/pull/5033
[SPARK-6365] jetty-security also needed for SPARK_PREPEND_CLASSES to work
Imran Rashid <irashid@cloudera.com>
2015-03-17 12:03:54 -0500
Commit: ac0e7cc, github.com/apache/spark/pull/5071
[SPARK-6313] Add config option to disable file locks/fetchFile cache to ...
nemccarthy <nathan@nemccarthy.me>
2015-03-17 09:33:11 -0700
Commit: febb123, github.com/apache/spark/pull/5036
[SPARK-3266] Use intermediate abstract classes to fix type erasure issues in Java APIs
Josh Rosen <joshrosen@databricks.com>
2015-03-17 09:18:57 -0700
Commit: 29e39e1, github.com/apache/spark/pull/5050
[SPARK-6331] Load new master URL if present when recovering streaming context from checkpoint
Tathagata Das <tathagata.das1565@gmail.com>
2015-03-17 05:31:27 -0700
Commit: 95f8d1c, github.com/apache/spark/pull/5024
[SQL][docs][minor] Fixed sample code in SQLContext scaladoc
Lomig Mégard <lomig.megard@gmail.com>
2015-03-16 23:52:42 -0700
Commit: 426816b, github.com/apache/spark/pull/5051
[SPARK-6299][CORE] ClassNotFoundException in standalone mode when running groupByKey with class defined in REPL
Kevin (Sangwoo) Kim <sangwookim.me@gmail.com>
2015-03-16 23:49:23 -0700
Commit: 5c16ced, github.com/apache/spark/pull/5046
[SPARK-6077] Remove streaming tab while stopping StreamingContext
lisurprise <zhichao.li@intel.com>
2015-03-16 13:10:32 -0700
Commit: 47cce98, github.com/apache/spark/pull/4828
[SPARK-6330] Fix filesystem bug in newParquet relation
Volodymyr Lyubinets <vlyubin@gmail.com>
2015-03-16 12:13:18 -0700
Commit: 67fa6d1, github.com/apache/spark/pull/5020
SPARK-6245 [SQL] jsonRDD() of empty RDD results in exception
Sean Owen <sowen@cloudera.com>
2015-03-11 14:09:09 +0000
Commit: 684ff24, github.com/apache/spark/pull/4971
[SPARK-6300][Spark Core] sc.addFile(path) does not support the relative path.
DoingDone9 <799203320@qq.com>
2015-03-16 12:27:15 +0000
Commit: 724aab4, github.com/apache/spark/pull/4993
[SPARK-3619] Part 2. Upgrade to Mesos 0.21 to work around MESOS-1688
Jongyoul Lee <jongyoul@gmail.com>
2015-03-15 15:46:55 +0000
Commit: 43fcab0, github.com/apache/spark/pull/4361
[SPARK-6210] [SQL] use prettyString as column name in agg()
Davies Liu <davies@databricks.com>
2015-03-14 00:43:33 -0700
Commit: ad47563, github.com/apache/spark/pull/5006
[SPARK-6275][Documentation]Miss toDF() function in docs/sql-programming-guide.md
zzcclp <xm_zzc@sina.com>
2015-03-12 15:07:15 +0000
Commit: 3012781, github.com/apache/spark/pull/4977
[SPARK-6133] Make sc.stop() idempotent
Andrew Or <andrew@databricks.com>
2015-03-03 15:09:57 -0800
Commit: a08588c, github.com/apache/spark/pull/4871
[SPARK-6132][HOTFIX] ContextCleaner InterruptedException should be quiet
Andrew Or <andrew@databricks.com>
2015-03-03 20:49:45 -0800
Commit: 338bea7, github.com/apache/spark/pull/4882
[SPARK-6132] ContextCleaner race condition across SparkContexts
Andrew Or <andrew@databricks.com>
2015-03-03 13:44:05 -0800
Commit: 3cdc8a3, github.com/apache/spark/pull/4869
[SPARK-6087][CORE] Provide actionable exception if Kryo buffer is not large enough
Lev Khomich <levkhomich@gmail.com>
2015-03-10 10:55:42 +0000
Commit: 9846790, github.com/apache/spark/pull/4947
[SPARK-6036][CORE] avoid race condition between eventlogListener and akka actor system
Zhang, Liye <liye.zhang@intel.com>
2015-02-26 23:11:43 -0800
Commit: f81611d, github.com/apache/spark/pull/4785
SPARK-4044 [CORE] Thriftserver fails to start when JAVA_HOME points to JRE instead of JDK
Sean Owen <sowen@cloudera.com>
2015-03-13 17:59:31 +0000
Commit: 4aa4132, github.com/apache/spark/pull/4981
SPARK-4300 [CORE] Race condition during SparkWorker shutdown
Sean Owen <sowen@cloudera.com>
2015-02-26 14:08:56 -0800
Commit: a3493eb, github.com/apache/spark/pull/4787
[SPARK-6194] [SPARK-677] [PySpark] fix memory leak in collect()
Davies Liu <davies@databricks.com>
2015-03-09 16:24:06 -0700
Commit: 170af49, github.com/apache/spark/pull/4923
SPARK-4704 [CORE] SparkSubmitDriverBootstrap doesn't flush output
Sean Owen <sowen@cloudera.com>
2015-02-26 12:56:54 -0800
Commit: dbee7e1, github.com/apache/spark/pull/4788
[SPARK-6278][MLLIB] Mention the change of objective in linear regression
Xiangrui Meng <meng@databricks.com>
2015-03-13 10:27:28 -0700
Commit: 214f681, github.com/apache/spark/pull/4978
[SPARK-5310] [SQL] [DOC] Parquet section for the SQL programming guide
Cheng Lian <lian@databricks.com>
2015-03-13 21:34:50 +0800
Commit: dc287f3, github.com/apache/spark/pull/5001
[mllib] [python] Add LassoModel to __all__ in regression.py
Joseph K. Bradley <joseph@databricks.com>
2015-03-12 16:46:29 -0700
Commit: 23069bd, github.com/apache/spark/pull/4970
[SPARK-6294] fix hang when call take() in JVM on PythonRDD
Davies Liu <davies@databricks.com>
2015-03-12 01:34:38 -0700
Commit: 850e694, github.com/apache/spark/pull/4987
[SPARK-6296] [SQL] Added equals to Column
Volodymyr Lyubinets <vlyubin@gmail.com>
2015-03-12 00:55:26 -0700
Commit: d9e141c, github.com/apache/spark/pull/4988
[SPARK-6128][Streaming][Documentation] Updates to Spark Streaming Programming Guide
Tathagata Das <tathagata.das1565@gmail.com>
2015-03-11 18:48:21 -0700
Commit: bdc4682, github.com/apache/spark/pull/4956
[SPARK-6274][Streaming][Examples] Added examples streaming + sql examples.
Tathagata Das <tathagata.das1565@gmail.com>
2015-03-11 11:19:51 -0700
Commit: ac61466, github.com/apache/spark/pull/4975
[SPARK-5183][SQL] Update SQL Docs with JDBC and Migration Guide
Michael Armbrust <michael@databricks.com>
2015-03-10 18:13:09 -0700
Commit: edbcb6f, github.com/apache/spark/pull/4958
Minor doc: Remove the extra blank line in data types javadoc.
Reynold Xin <rxin@databricks.com>
2015-03-10 17:25:04 -0700
Commit: 7295192, github.com/apache/spark/pull/4955
[SPARK-5310][Doc] Update SQL Programming Guide to include DataFrames.
Reynold Xin <rxin@databricks.com>
2015-03-09 16:16:16 -0700
Commit: bc53d3d, github.com/apache/spark/pull/4954
[Docs] Replace references to SchemaRDD with DataFrame
Reynold Xin <rxin@databricks.com>
2015-03-09 13:29:19 -0700
Commit: 5e58f76, github.com/apache/spark/pull/4952
Preparing development version 1.3.1-SNAPSHOT
Patrick Wendell <patrick@databricks.com>
2015-03-05 23:02:08 +0000
Commit: c152f9a
Release 1.3.0
[SQL] Make Strategies a public developer API
Michael Armbrust <michael@databricks.com>
2015-03-05 14:50:25 -0800
Commit: 556e0de, github.com/apache/spark/pull/4920
[SPARK-6163][SQL] jsonFile should be backed by the data source API
Yin Huai <yhuai@databricks.com>
2015-03-05 14:49:44 -0800
Commit: 083fed5, github.com/apache/spark/pull/4896
[SPARK-6145][SQL] fix ORDER BY on nested fields
Wenchen Fan <cloud0fan@outlook.com>, Michael Armbrust <michael@databricks.com>
2015-03-05 14:49:01 -0800
Commit: e358f55, github.com/apache/spark/pull/4918
[SPARK-6175] Fix standalone executor log links when ephemeral ports or SPARK_PUBLIC_DNS are used
Josh Rosen <joshrosen@databricks.com>
2015-03-05 12:04:00 -0800
Commit: 988b498, github.com/apache/spark/pull/4903
SPARK-6182 [BUILD] spark-parent pom needs to be published for both 2.10 and 2.11
Sean Owen <sowen@cloudera.com>
2015-03-05 11:31:48 -0800
Commit: ae315d2, github.com/apache/spark/pull/4912
Revert "[SPARK-6153] [SQL] promote guava dep for hive-thriftserver"
Cheng Lian <lian@databricks.com>
2015-03-05 17:58:18 +0800
Commit: f8205d3
[SPARK-6153] [SQL] promote guava dep for hive-thriftserver
Daoyuan Wang <daoyuan.wang@intel.com>
2015-03-05 16:35:17 +0800
Commit: b92d925, github.com/apache/spark/pull/4884
Updating CHANGES file
Patrick Wendell <patrick@databricks.com>
2015-03-04 21:19:49 -0800
Commit: 87eac3c
SPARK-5143 [BUILD] [WIP] spark-network-yarn 2.11 depends on spark-network-shuffle 2.10
Sean Owen <sowen@cloudera.com>
2015-03-04 21:00:51 -0800
Commit: f509159, github.com/apache/spark/pull/4876
[SPARK-6149] [SQL] [Build] Excludes Guava 15 referenced by jackson-module-scala_2.10
Cheng Lian <lian@databricks.com>
2015-03-04 20:52:58 -0800
Commit: a0aa24a, github.com/apache/spark/pull/4890
[SPARK-6144] [core] Fix addFile when source files are on "hdfs:"
Marcelo Vanzin <vanzin@cloudera.com>, trystanleftwich <trystan@atscale.com>
2015-03-04 12:58:39 -0800
Commit: 3fc74f4, github.com/apache/spark/pull/4894
[SPARK-6134][SQL] Fix wrong datatype for casting FloatType and default LongType value in defaultPrimitive
Liang-Chi Hsieh <viirya@gmail.com>
2015-03-04 20:23:43 +0800
Commit: bfa4e31, github.com/apache/spark/pull/4870
[SPARK-6136] [SQL] Removed JDBC integration tests which depends on docker-client
Cheng Lian <lian@databricks.com>
2015-03-04 19:39:02 +0800
Commit: 035243d, github.com/apache/spark/pull/4872
[SPARK-6141][MLlib] Upgrade Breeze from 0.10 to 0.11 to fix convergence bug
Xiangrui Meng <meng@databricks.com>, DB Tsai <dbtsai@alpinenow.com>, DB Tsai <dbtsai@dbtsai.com>
2015-03-03 23:52:02 -0800
Commit: 9f24977, github.com/apache/spark/pull/4879
[SPARK-5949] HighlyCompressedMapStatus needs more classes registered w/ kryo
Imran Rashid <irashid@cloudera.com>
2015-03-03 15:33:19 -0800
Commit: 9a0b75c, github.com/apache/spark/pull/4877
SPARK-1911 [DOCS] Warn users if their assembly jars are not built with Java 6
Sean Owen <sowen@cloudera.com>
2015-03-03 13:40:11 -0800
Commit: 8446ad0, github.com/apache/spark/pull/4874
Revert "[SPARK-5423][Core] Cleanup resources in DiskMapIterator.finalize to ensure deleting the temp file"
Andrew Or <andrew@databricks.com>
2015-03-03 13:04:15 -0800
Commit: ee4929d
Adding CHANGES.txt for Spark 1.3
Patrick Wendell <patrick@databricks.com>
2015-03-03 02:19:19 -0800
Commit: ce7158c
BUILD: Minor tweaks to internal build scripts
Patrick Wendell <patrick@databricks.com>
2015-03-03 00:38:12 -0800
Commit: ae60eb9
HOTFIX: Bump HBase version in MapR profiles.
Patrick Wendell <patrick@databricks.com>
2015-03-03 01:38:07 -0800
Commit: 1aa8461
[SPARK-5537][MLlib][Docs] Add user guide for multinomial logistic regression
DB Tsai <dbtsai@alpinenow.com>
2015-03-02 22:37:12 -0800
Commit: 841d2a2, github.com/apache/spark/pull/4866
[SPARK-6120] [mllib] Warnings about memory in tree, ensemble model save
Joseph K. Bradley <joseph@databricks.com>
2015-03-02 22:33:51 -0800
Commit: 81648a7, github.com/apache/spark/pull/4864
[SPARK-6097][MLLIB] Support tree model save/load in PySpark/MLlib
Xiangrui Meng <meng@databricks.com>
2015-03-02 22:27:01 -0800
Commit: 62c53be, github.com/apache/spark/pull/4854
[SPARK-5310][SQL] Fixes to Docs and Datasources API
Reynold Xin <rxin@databricks.com>, Michael Armbrust <michael@databricks.com>
2015-03-02 22:14:08 -0800
Commit: 4e6e008, github.com/apache/spark/pull/4868
[SPARK-5950][SQL]Insert array into a metastore table saved as parquet should work when using datasource api
Yin Huai <yhuai@databricks.com>
2015-03-02 19:31:55 -0800
Commit: 1b490e9, github.com/apache/spark/pull/4826
[SPARK-6127][Streaming][Docs] Add Kafka to Python api docs
Tathagata Das <tathagata.das1565@gmail.com>
2015-03-02 18:40:46 -0800
Commit: ffd0591, github.com/apache/spark/pull/4860
[SPARK-5537] Add user guide for multinomial logistic regression
Xiangrui Meng <meng@databricks.com>, DB Tsai <dbtsai@alpinenow.com>
2015-03-02 18:10:50 -0800
Commit: 11389f0, github.com/apache/spark/pull/4801
[SPARK-6121][SQL][MLLIB] simpleString for UDT
Xiangrui Meng <meng@databricks.com>
2015-03-02 17:14:34 -0800
Commit: 1b8ab57, github.com/apache/spark/pull/4858
[SPARK-6048] SparkConf should not translate deprecated configs on set
Andrew Or <andrew@databricks.com>
2015-03-02 16:36:42 -0800
Commit: ea69cf2, github.com/apache/spark/pull/4799
[SPARK-6066] Make event log format easier to parse
Andrew Or <andrew@databricks.com>
2015-03-02 16:34:32 -0800
Commit: 8100b79, github.com/apache/spark/pull/4821
[SPARK-6082] [SQL] Provides better error message for malformed rows when caching tables
Cheng Lian <lian@databricks.com>
2015-03-02 16:18:00 -0800
Commit: 866f281, github.com/apache/spark/pull/4842
[SPARK-6114][SQL] Avoid metastore conversions before plan is resolved
Michael Armbrust <michael@databricks.com>
2015-03-02 16:10:54 -0800
Commit: 3899c7c, github.com/apache/spark/pull/4855
[SPARK-6050] [yarn] Relax matching of vcore count in received containers.
Marcelo Vanzin <vanzin@cloudera.com>
2015-03-02 16:41:43 -0600
Commit: 650d1e7, github.com/apache/spark/pull/4818
[SPARK-6040][SQL] Fix the percent bug in tablesample
q00251598 <qiyadong@huawei.com>
2015-03-02 13:16:29 -0800
Commit: a83b9bb, github.com/apache/spark/pull/4789
[Minor] Fix doc typo for describing primitiveTerm effectiveness condition
Liang-Chi Hsieh <viirya@gmail.com>
2015-03-02 13:11:17 -0800
Commit: f92876a, github.com/apache/spark/pull/4762
SPARK-5390 [DOCS] Encourage users to post on Stack Overflow in Community Docs
Sean Owen <sowen@cloudera.com>
2015-03-02 21:10:08 +0000
Commit: 58e7198, github.com/apache/spark/pull/4843
[DOCS] Refactored Dataframe join comment to use correct parameter ordering
Paul Power <paul.power@peerside.com>
2015-03-02 13:08:47 -0800
Commit: 54ac243, github.com/apache/spark/pull/4847
[SPARK-6080] [PySpark] correct LogisticRegressionWithLBFGS regType parameter for pyspark
Yanbo Liang <ybliang8@gmail.com>
2015-03-02 10:17:24 -0800
Commit: 4ffaf85, github.com/apache/spark/pull/4831
[SPARK-5741][SQL] Support the path contains comma in HiveContext
q00251598 <qiyadong@huawei.com>
2015-03-02 10:13:11 -0800
Commit: f476108, github.com/apache/spark/pull/4532
[SPARK-6111] Fixed usage string in documentation.
Kenneth Myers <myerske@us.ibm.com>
2015-03-02 17:25:24 +0000
Commit: b2b7f01, github.com/apache/spark/pull/4852
[SPARK-6052][SQL]In JSON schema inference, we should always set containsNull of an ArrayType to true
Yin Huai <yhuai@databricks.com>
2015-03-02 23:18:07 +0800
Commit: a3fef2c, github.com/apache/spark/pull/4806
[SPARK-6073][SQL] Need to refresh metastore cache after append data in CreateMetastoreDataSourceAsSelect
Yin Huai <yhuai@databricks.com>
2015-03-02 22:42:18 +0800
Commit: c59871c, github.com/apache/spark/pull/4824
[Streaming][Minor]Fix some error docs in streaming examples
Saisai Shao <saisai.shao@intel.com>
2015-03-02 08:49:19 +0000
Commit: 1fe677a, github.com/apache/spark/pull/4837
[SPARK-6083] [MLLib] [DOC] Make Python API example consistent in NaiveBayes
MechCoder <manojkumarsivaraj334@gmail.com>
2015-03-01 16:28:15 -0800
Commit: 6a2fc85, github.com/apache/spark/pull/4834
[SPARK-6053][MLLIB] support save/load in PySpark's ALS
Xiangrui Meng <meng@databricks.com>
2015-03-01 16:26:57 -0800
Commit: b570d98, github.com/apache/spark/pull/4811
[SPARK-6074] [sql] Package pyspark sql bindings.
Marcelo Vanzin <vanzin@cloudera.com>
2015-03-01 11:05:10 +0000
Commit: bb16618, github.com/apache/spark/pull/4822
SPARK-5984: Fix TimSort bug causes ArrayOutOfBoundsException
Evan Yu <ehotou@gmail.com>
2015-02-28 18:55:34 -0800
Commit: 317694c, github.com/apache/spark/pull/4804
[SPARK-5775] [SQL] BugFix: GenericRow cannot be cast to SpecificMutableRow when nested data and partitioned table
Cheng Lian <lian@databricks.com>, Cheng Lian <liancheng@users.noreply.github.com>, Yin Huai <yhuai@databricks.com>
2015-02-28 21:15:43 +0800
Commit: aa39460, github.com/apache/spark/pull/4792
[SPARK-5979][SPARK-6032] Smaller safer --packages fix
Burak Yavuz <brkyvz@gmail.com>
2015-02-27 22:59:35 -0800
Commit: 5a55c96, github.com/apache/spark/pull/4802
[SPARK-6070] [yarn] Remove unneeded classes from shuffle service jar.
Marcelo Vanzin <vanzin@cloudera.com>
2015-02-27 22:44:11 -0800
Commit: 1747e0a, github.com/apache/spark/pull/4820
[SPARK-6055] [PySpark] fix incorrect __eq__ of DataType
Davies Liu <davies@databricks.com>
2015-02-27 20:07:17 -0800
Commit: 49f2187, github.com/apache/spark/pull/4808
[SPARK-5751] [SQL] Sets SPARK_HOME as SPARK_PID_DIR when running Thrift server test suites
Cheng Lian <lian@databricks.com>
2015-02-28 08:41:49 +0800
Commit: 5d19cf0, github.com/apache/spark/pull/4758
[Streaming][Minor] Remove useless type signature of Java Kafka direct stream API
Saisai Shao <saisai.shao@intel.com>
2015-02-27 13:01:42 -0800
Commit: ceebe3c, github.com/apache/spark/pull/4817
[SPARK-4587] [mllib] [docs] Fixed save,load calls in ML guide examples
Joseph K. Bradley <joseph@databricks.com>
2015-02-27 13:00:36 -0800
Commit: 117e10c, github.com/apache/spark/pull/4816
[SPARK-6058][Yarn] Log the user class exception in ApplicationMaster
zsxwing <zsxwing@gmail.com>
2015-02-27 13:31:46 +0000
Commit: bff8088, github.com/apache/spark/pull/4813
fix spark-6033, clarify the spark.worker.cleanup behavior in standalone mode
许鹏 <peng.xu@fraudmetrix.cn>
2015-02-26 23:05:56 -0800
Commit: b8db84c, github.com/apache/spark/pull/4803
SPARK-2168 [Spark core] Use relative URIs for the app links in the History Server.
Lukasz Jastrzebski <lukasz.jastrzebski@gmail.com>
2015-02-26 22:38:06 -0800
Commit: 485b919, github.com/apache/spark/pull/4778
[SPARK-6024][SQL] When a data source table has too many columns, it's schema cannot be stored in metastore.
Yin Huai <yhuai@databricks.com>
2015-02-26 20:46:05 -0800
Commit: 6200f07, github.com/apache/spark/pull/4795
[SPARK-6037][SQL] Avoiding duplicate Parquet schema merging
Liang-Chi Hsieh <viirya@gmail.com>
2015-02-27 11:06:47 +0800
Commit: 25a109e, github.com/apache/spark/pull/4786
SPARK-4579 [WEBUI] Scheduling Delay appears negative
Sean Owen <sowen@cloudera.com>
2015-02-26 17:35:09 -0800
Commit: b83a93e, github.com/apache/spark/pull/4796
[SPARK-5951][YARN] Remove unreachable driver memory properties in yarn client mode
mohit.goyal <mohit.goyal@guavus.com>
2015-02-26 14:27:47 -0800
Commit: 5b426cb, github.com/apache/spark/pull/4730
Add a note for context termination for History server on Yarn
moussa taifi <moutai10@gmail.com>
2015-02-26 14:19:43 -0800
Commit: 297c3ef, github.com/apache/spark/pull/4721
[SPARK-6018] [YARN] NoSuchMethodError in Spark app is swallowed by YARN AM
Cheolsoo Park <cheolsoop@netflix.com>
2015-02-26 13:53:49 -0800
Commit: fe79674, github.com/apache/spark/pull/4773
[SPARK-6027][SPARK-5546] Fixed --jar and --packages not working for KafkaUtils and improved error message
Tathagata Das <tathagata.das1565@gmail.com>
2015-02-26 13:46:07 -0800
Commit: 731a997, github.com/apache/spark/pull/4779
Modify default value description for spark.scheduler.minRegisteredResourcesRatio on docs.
Li Zhihui <zhihui.li@intel.com>
2015-02-26 13:07:07 -0800
Commit: 62652dc, github.com/apache/spark/pull/4781
[SPARK-5363] Fix bug in PythonRDD: remove() inside iterator is not safe
Davies Liu <davies@databricks.com>
2015-02-26 11:54:17 -0800
Commit: 5d309ad, github.com/apache/spark/pull/4776
[SPARK-6015] fix links to source code in Python API docs
Davies Liu <davies@databricks.com>
2015-02-26 10:45:29 -0800
Commit: dafb3d2, github.com/apache/spark/pull/4772
[SPARK-6007][SQL] Add numRows param in DataFrame.show()
Jacky Li <jacky.likun@huawei.com>
2015-02-26 10:40:58 -0800
Commit: 7c779d8, github.com/apache/spark/pull/4767
[SPARK-6016][SQL] Cannot read the parquet table after overwriting the existing table when spark.sql.parquet.cacheMetadata=true
Yin Huai <yhuai@databricks.com>
2015-02-27 01:01:32 +0800
Commit: b5c5e93, github.com/apache/spark/pull/4775
[SPARK-6023][SQL] ParquetConversions fails to replace the destination MetastoreRelation of an InsertIntoTable node to ParquetRelation2
Yin Huai <yhuai@databricks.com>
2015-02-26 22:39:49 +0800
Commit: e0f5fb0, github.com/apache/spark/pull/4782
[SPARK-5976][MLLIB] Add partitioner to factors returned by ALS
Xiangrui Meng <meng@databricks.com>
2015-02-25 23:43:29 -0800
Commit: a51d9db, github.com/apache/spark/pull/4748
[SPARK-1182][Docs] Sort the configuration parameters in configuration.md
Brennon York <brennon.york@capitalone.com>
2015-02-25 16:12:56 -0800
Commit: 56fa38a, github.com/apache/spark/pull/3863
[SPARK-5724] fix the misconfiguration in AkkaUtils
CodingCat <zhunansjtu@gmail.com>
2015-02-23 11:29:25 +0000
Commit: b32a653, github.com/apache/spark/pull/4512
[SPARK-5974] [SPARK-5980] [mllib] [python] [docs] Update ML guide with save/load, Python GBT
Joseph K. Bradley <joseph@databricks.com>
2015-02-25 16:13:17 -0800
Commit: a1b4856, github.com/apache/spark/pull/4750
[SPARK-5926] [SQL] make DataFrame.explain leverage queryExecution.logical
Yanbo Liang <ybliang8@gmail.com>
2015-02-25 15:37:13 -0800
Commit: 5bd4b49, github.com/apache/spark/pull/4707
[SPARK-5999][SQL] Remove duplicate Literal matching block
Liang-Chi Hsieh <viirya@gmail.com>
2015-02-25 15:22:33 -0800
Commit: 6fff9b8, github.com/apache/spark/pull/4760
[SPARK-6010] [SQL] Merging compatible Parquet schemas before computing splits
Cheng Lian <lian@databricks.com>
2015-02-25 15:15:22 -0800
Commit: 016f1f8, github.com/apache/spark/pull/4768
[SPARK-5944] [PySpark] fix version in Python API docs
Davies Liu <davies@databricks.com>
2015-02-25 15:13:34 -0800
Commit: 9aca3c6, github.com/apache/spark/pull/4731
[SPARK-5982] Remove incorrect Local Read Time Metric
Kay Ousterhout <kayousterhout@gmail.com>
2015-02-25 14:55:24 -0800
Commit: 791df93, github.com/apache/spark/pull/4749
[SPARK-1955][GraphX]: VertexRDD can incorrectly assume index sharing
Brennon York <brennon.york@capitalone.com>
2015-02-25 14:11:12 -0800
Commit: 8073767, github.com/apache/spark/pull/4705
SPARK-5930 [DOCS] Documented default of spark.shuffle.io.retryWait is confusing
Sean Owen <sowen@cloudera.com>
2015-02-25 12:20:44 -0800
Commit: eaffc6e, github.com/apache/spark/pull/4769
[SPARK-5996][SQL] Fix specialized outbound conversions
Michael Armbrust <michael@databricks.com>
2015-02-25 10:13:40 -0800
Commit: fada683, github.com/apache/spark/pull/4757
[SPARK-5994] [SQL] Python DataFrame documentation fixes
Davies Liu <davies@databricks.com>
2015-02-24 20:51:55 -0800
Commit: 5c421e0, github.com/apache/spark/pull/4756
[SPARK-5286][SQL] SPARK-5286 followup
Yin Huai <yhuai@databricks.com>
2015-02-24 19:51:36 -0800
Commit: e7a748e, github.com/apache/spark/pull/4755
[SPARK-5993][Streaming][Build] Fix assembly jar location of kafka-assembly
Tathagata Das <tathagata.das1565@gmail.com>
2015-02-24 19:10:37 -0800
Commit: 1e94894, github.com/apache/spark/pull/4753
[SPARK-5985][SQL] DataFrame sortBy -> orderBy in Python.
Reynold Xin <rxin@databricks.com>
2015-02-24 18:59:23 -0800
Commit: 5e233b2, github.com/apache/spark/pull/4752
[SPARK-5904][SQL] DataFrame Java API test suites.
Reynold Xin <rxin@databricks.com>
2015-02-24 18:51:41 -0800
Commit: 78a1781, github.com/apache/spark/pull/4751
[SPARK-5751] [SQL] [WIP] Revamped HiveThriftServer2Suite for robustness
Cheng Lian <lian@databricks.com>
2015-02-25 08:34:55 +0800
Commit: 17ee246, github.com/apache/spark/pull/4720
[SPARK-5973] [PySpark] fix zip with two RDDs with AutoBatchedSerializer
Davies Liu <davies@databricks.com>
2015-02-24 14:50:00 -0800
Commit: 91bf0f8, github.com/apache/spark/pull/4745
[SPARK-5952][SQL] Lock when using hive metastore client
Michael Armbrust <michael@databricks.com>
2015-02-24 13:39:29 -0800
Commit: 641423d, github.com/apache/spark/pull/4746
[MLLIB] Change x_i to y_i in Variance's user guide
Xiangrui Meng <meng@databricks.com>
2015-02-24 11:38:59 -0800
Commit: a4ff445, github.com/apache/spark/pull/4740
[SPARK-5965] Standalone Worker UI displays {{USER_JAR}}
Andrew Or <andrew@databricks.com>
2015-02-24 11:08:07 -0800
Commit: eaf7bf9, github.com/apache/spark/pull/4739
[Spark-5967] [UI] Correctly clean JobProgressListener.stageIdToActiveJobIds
Tathagata Das <tathagata.das1565@gmail.com>
2015-02-24 11:02:47 -0800
Commit: 28dd53b, github.com/apache/spark/pull/4741
[SPARK-5532][SQL] Repartition should not use external rdd representation
Michael Armbrust <michael@databricks.com>
2015-02-24 10:52:18 -0800
Commit: e46096b, github.com/apache/spark/pull/4738
[SPARK-5910][SQL] Support for as in selectExpr
Michael Armbrust <michael@databricks.com>
2015-02-24 10:49:51 -0800
Commit: ba5d60d, github.com/apache/spark/pull/4736
[SPARK-5968] [SQL] Suppresses ParquetOutputCommitter WARN logs
Cheng Lian <lian@databricks.com>
2015-02-24 10:45:38 -0800
Commit: 2b562b0, github.com/apache/spark/pull/4744
[SPARK-5958][MLLIB][DOC] update block matrix user guide
Xiangrui Meng <meng@databricks.com>
2015-02-23 22:08:44 -0800
Commit: dd42558, github.com/apache/spark/pull/4737
[SPARK-5873][SQL] Allow viewing of partially analyzed plans in queryExecution
Michael Armbrust <michael@databricks.com>
2015-02-23 17:34:54 -0800
Commit: 2d7786e, github.com/apache/spark/pull/4684
[SPARK-5935][SQL] Accept MapType in the schema provided to a JSON dataset.
Yin Huai <yhuai@databricks.com>, Yin Huai <huai@cse.ohio-state.edu>
2015-02-23 17:16:34 -0800
Commit: 33ccad2, github.com/apache/spark/pull/4710
[SPARK-5912] [docs] [mllib] Small fixes to ChiSqSelector docs
Joseph K. Bradley <joseph@databricks.com>
2015-02-23 16:15:57 -0800
Commit: ae97040, github.com/apache/spark/pull/4732
[MLLIB] SPARK-5912 Programming guide for feature selection
Alexander Ulanov <nashb@yandex.ru>
2015-02-23 12:09:40 -0800
Commit: 8355773, github.com/apache/spark/pull/4709
[SPARK-5939][MLLib] make FPGrowth example app take parameters
Jacky Li <jacky.likun@huawei.com>
2015-02-23 08:47:28 -0800
Commit: 33b9084, github.com/apache/spark/pull/4714
[SPARK-5943][Streaming] Update the test to use new API to reduce the warning
Saisai Shao <saisai.shao@intel.com>
2015-02-23 11:27:27 +0000
Commit: 67b7f79, github.com/apache/spark/pull/4722
[EXAMPLES] fix typo.
Makoto Fukuhara <fukuo33@gmail.com>
2015-02-23 09:24:33 +0000
Commit: f172387, github.com/apache/spark/pull/4724
Revert "[SPARK-4808] Removing minimum number of elements read before spill check"
Andrew Or <andrew@databricks.com>
2015-02-22 09:44:52 -0800
Commit: 4186dd3
SPARK-5669 [BUILD] Reverse exclusion of JBLAS libs for 1.3
Sean Owen <sowen@cloudera.com>
2015-02-22 09:09:06 +0000
Commit: eed7389, github.com/apache/spark/pull/4715
[DataFrame] [Typo] Fix the typo
Cheng Hao <hao.cheng@intel.com>
2015-02-22 08:56:30 +0000
Commit: 04d3b32, github.com/apache/spark/pull/4717
[DOCS] Fix typo in API for custom InputFormats based on the “new” MapReduce API
Alexander <abezzubov@nflabs.com>
2015-02-22 08:53:05 +0000
Commit: c5a5c6f, github.com/apache/spark/pull/4718
[SPARK-5937][YARN] Fix ClientSuite to set YARN mode, so that the correct class is used in t...
Hari Shreedharan <hshreedharan@apache.org>
2015-02-21 10:01:01 -0800
Commit: 76e3e65, github.com/apache/spark/pull/4711
SPARK-5841 [CORE] [HOTFIX 2] Memory leak in DiskBlockManager
Nishkam Ravi <nravi@cloudera.com>, nishkamravi2 <nishkamravi@gmail.com>, nravi <nravi@c1704.halxg.cloudera.com>
2015-02-21 09:59:28 -0800
Commit: 932338e, github.com/apache/spark/pull/4690
[SPARK-5909][SQL] Add a clearCache command to Spark SQL's cache manager
Yin Huai <yhuai@databricks.com>
2015-02-20 16:20:02 +0800
Commit: b9a6c5c, github.com/apache/spark/pull/4694
[SPARK-5898] [SPARK-5896] [SQL] [PySpark] create DataFrame from pandas and tuple/list
Davies Liu <davies@databricks.com>
2015-02-20 15:35:05 -0800
Commit: 913562a, github.com/apache/spark/pull/4679
[SPARK-5867] [SPARK-5892] [doc] [ml] [mllib] Doc cleanups for 1.3 release
Joseph K. Bradley <joseph@databricks.com>
2015-02-20 02:31:32 -0800
Commit: 8c12f31, github.com/apache/spark/pull/4675
[SPARK-4808] Removing minimum number of elements read before spill check
mcheah <mcheah@palantir.com>
2015-02-19 18:09:22 -0800
Commit: 0382dcc, github.com/apache/spark/pull/4420
[SPARK-5900][MLLIB] make PIC and FPGrowth Java-friendly
Xiangrui Meng <meng@databricks.com>
2015-02-19 18:06:16 -0800
Commit: ba941ce, github.com/apache/spark/pull/4695
SPARK-5570: No docs stating that `new SparkConf().set("spark.driver.memory", ...) will not work
Ilya Ganelin <ilya.ganelin@capitalone.com>
2015-02-19 15:50:58 -0800
Commit: c5f3b9e, github.com/apache/spark/pull/4665
SPARK-4682 [CORE] Consolidate various 'Clock' classes
Sean Owen <sowen@cloudera.com>
2015-02-19 15:35:23 -0800
Commit: bd49e8b, github.com/apache/spark/pull/4514
[Spark-5889] Remove pid file after stopping service.
Zhan Zhang <zhazhan@gmail.com>
2015-02-19 23:13:02 +0000
Commit: ff8976e, github.com/apache/spark/pull/4676
[SPARK-5902] [ml] Made PipelineStage.transformSchema public instead of private to ml
Joseph K. Bradley <joseph@databricks.com>
2015-02-19 12:46:27 -0800
Commit: 0c494cf, github.com/apache/spark/pull/4682
[SPARK-5904][SQL] DataFrame API fixes.
Reynold Xin <rxin@databricks.com>
2015-02-19 12:09:44 -0800
Commit: 55d91d9, github.com/apache/spark/pull/4686
[SPARK-5825] [Spark Submit] Remove the double checking instance name when stopping the service
Cheng Hao <hao.cheng@intel.com>
2015-02-19 12:07:51 -0800
Commit: fe00eb6, github.com/apache/spark/pull/4611
[SPARK-5423][Core] Cleanup resources in DiskMapIterator.finalize to ensure deleting the temp file
zsxwing <zsxwing@gmail.com>
2015-02-19 18:37:31 +0000
Commit: 25fae8e, github.com/apache/spark/pull/4219
[SPARK-5816] Add huge compatibility warning in DriverWrapper
Andrew Or <andrew@databricks.com>
2015-02-19 09:56:25 -0800
Commit: f93d4d9, github.com/apache/spark/pull/4687
SPARK-5548: Fix for AkkaUtilsSuite failure - attempt 2
Jacek Lewandowski <lewandowski.jacek@gmail.com>
2015-02-19 09:53:36 -0800
Commit: fbcb949, github.com/apache/spark/pull/4653
[SPARK-5846] Correctly set job description and pool for SQL jobs
Kay Ousterhout <kayousterhout@gmail.com>
2015-02-19 09:49:34 +0800
Commit: 092b45f, github.com/apache/spark/pull/4630
[SPARK-5879][MLLIB] update PIC user guide and add a Java example
Xiangrui Meng <meng@databricks.com>
2015-02-18 16:29:32 -0800
Commit: a64f374, github.com/apache/spark/pull/4680
[SPARK-5722] [SQL] [PySpark] infer int as LongType
Davies Liu <davies@databricks.com>
2015-02-18 14:17:04 -0800
Commit: 470cba8, github.com/apache/spark/pull/4666
[SPARK-5840][SQL] HiveContext cannot be serialized due to tuple extraction
Reynold Xin <rxin@databricks.com>
2015-02-18 14:02:32 -0800
Commit: b86e44c, github.com/apache/spark/pull/4628
[SPARK-5507] Added documentation for BlockMatrix
Burak Yavuz <brkyvz@gmail.com>
2015-02-18 10:11:08 -0800
Commit: 56f8f29, github.com/apache/spark/pull/4664
[SPARK-5519][MLLIB] add user guide with example code for fp-growth
Xiangrui Meng <meng@databricks.com>
2015-02-18 10:09:56 -0800
Commit: 661fbd3, github.com/apache/spark/pull/4661
SPARK-5669 [BUILD] [HOTFIX] Spark assembly includes incompatibly licensed libgfortran, libgcc code via JBLAS
Sean Owen <sowen@cloudera.com>
2015-02-18 14:41:44 +0000
Commit: 9f256ce, github.com/apache/spark/pull/4673
SPARK-4610 addendum: [Minor] [MLlib] Minor doc fix in GBT classification example
MechCoder <manojkumarsivaraj334@gmail.com>
2015-02-18 10:13:28 +0000
Commit: 3997e74, github.com/apache/spark/pull/4672
[SPARK-5878] fix DataFrame.repartition() in Python
Davies Liu <davies@databricks.com>
2015-02-18 01:00:54 -0800
Commit: aca7991, github.com/apache/spark/pull/4667
Avoid deprecation warnings in JDBCSuite.
Tor Myklebust <tmyklebu@gmail.com>
2015-02-18 01:00:13 -0800
Commit: 9a565b8, github.com/apache/spark/pull/4668
[Minor] [SQL] Cleans up DataFrame variable names and toDF() calls
Cheng Lian <lian@databricks.com>
2015-02-17 23:36:20 -0800
Commit: 2bd33ce, github.com/apache/spark/pull/4670
[SPARK-5731][Streaming][Test] Fix incorrect test in DirectKafkaStreamSuite
Tathagata Das <tathagata.das1565@gmail.com>
2015-02-17 22:44:16 -0800
Commit: f8f9a64, github.com/apache/spark/pull/4597
[SPARK-5723][SQL]Change the default file format to Parquet for CTAS statements.
Yin Huai <yhuai@databricks.com>
2015-02-17 18:14:33 -0800
Commit: 6e82c46, github.com/apache/spark/pull/4639
Preparing development version 1.3.1-SNAPSHOT
Patrick Wendell <patrick@databricks.com>
2015-02-18 01:52:06 +0000
Commit: 2ab0ba0
Preparing Spark release v1.3.0-rc1
Patrick Wendell <patrick@databricks.com>
2015-02-18 01:52:06 +0000
Commit: f97b0d4
[SPARK-5875][SQL]logical.Project should not be resolved if it contains aggregates or generators
Yin Huai <yhuai@databricks.com>
2015-02-17 17:50:39 -0800
Commit: e8284b2, github.com/apache/spark/pull/4663
Revert "Preparing Spark release v1.3.0-snapshot1"
Patrick Wendell <patrick@databricks.com>
2015-02-17 17:48:47 -0800
Commit: 7320605
Revert "Preparing development version 1.3.1-SNAPSHOT"
Patrick Wendell <patrick@databricks.com>
2015-02-17 17:48:43 -0800
Commit: 932ae4d
[SPARK-4454] Revert getOrElse() cleanup in DAGScheduler.getCacheLocs()
Josh Rosen <joshrosen@databricks.com>
2015-02-17 17:45:16 -0800
Commit: 7e5e4d8
[SPARK-4454] Properly synchronize accesses to DAGScheduler cacheLocs map
Josh Rosen <joshrosen@databricks.com>
2015-02-17 17:39:58 -0800
Commit: 07a401a, github.com/apache/spark/pull/4660
[SPARK-5811] Added documentation for maven coordinates and added Spark Packages support
Burak Yavuz <brkyvz@gmail.com>, Davies Liu <davies@databricks.com>
2015-02-17 17:15:43 -0800
Commit: cb90584, github.com/apache/spark/pull/4662
[SPARK-5785] [PySpark] narrow dependency for cogroup/join in PySpark
Davies Liu <davies@databricks.com>
2015-02-17 16:54:57 -0800
Commit: 8120235, github.com/apache/spark/pull/4629
[SPARK-5852][SQL]Fail to convert a newly created empty metastore parquet table to a data source parquet table.
Yin Huai <yhuai@databricks.com>, Cheng Hao <hao.cheng@intel.com>
2015-02-17 15:47:59 -0800
Commit: 07d8ef9, github.com/apache/spark/pull/4655
[SPARK-5872] [SQL] create a sqlCtx in pyspark shell
Davies Liu <davies@databricks.com>
2015-02-17 15:44:37 -0800
Commit: 0dba382, github.com/apache/spark/pull/4659
[SPARK-5871] output explain in Python
Davies Liu <davies@databricks.com>
2015-02-17 13:48:38 -0800
Commit: cb06160, github.com/apache/spark/pull/4658
[SPARK-4172] [PySpark] Progress API in Python
Davies Liu <davies@databricks.com>
2015-02-17 13:36:43 -0800
Commit: 35e23ff, github.com/apache/spark/pull/3027
[SPARK-5868][SQL] Fix python UDFs in HiveContext and checks in SQLContext
Michael Armbrust <michael@databricks.com>
2015-02-17 13:23:45 -0800
Commit: e65dc1f, github.com/apache/spark/pull/4657
[SQL] [Minor] Update the HiveContext Unittest
Cheng Hao <hao.cheng@intel.com>
2015-02-17 12:25:35 -0800
Commit: 0135651, github.com/apache/spark/pull/4584
[Minor][SQL] Use same function to check path parameter in JSONRelation
Liang-Chi Hsieh <viirya@gmail.com>
2015-02-17 12:24:13 -0800
Commit: d74d5e8, github.com/apache/spark/pull/4649
[SPARK-5862][SQL] Only transformUp the given plan once in HiveMetastoreCatalog
Liang-Chi Hsieh <viirya@gmail.com>
2015-02-17 12:23:18 -0800
Commit: 62063b7, github.com/apache/spark/pull/4651
[Minor] fix typo in SQL document
CodingCat <zhunansjtu@gmail.com>
2015-02-17 12:16:52 -0800
Commit: 5636c4a, github.com/apache/spark/pull/4656
[SPARK-5864] [PySpark] support .jar as python package
Davies Liu <davies@databricks.com>
2015-02-17 12:05:06 -0800
Commit: 71cf6e2, github.com/apache/spark/pull/4652
SPARK-5841 [CORE] [HOTFIX] Memory leak in DiskBlockManager
Sean Owen <sowen@cloudera.com>
2015-02-17 19:40:06 +0000
Commit: e64afcd, github.com/apache/spark/pull/4648
[SPARK-5661]function hasShutdownDeleteTachyonDir should use shutdownDeleteTachyonPaths to determine whether contains file
xukun 00228947 <xukun.xu@huawei.com>, viper-kun <xukun.xu@huawei.com>
2015-02-17 18:59:41 +0000
Commit: 420bc9b, github.com/apache/spark/pull/4418
[SPARK-5778] throw if nonexistent metrics config file provided
Ryan Williams <ryan.blake.williams@gmail.com>
2015-02-17 10:57:16 -0800
Commit: 2bf2b56, github.com/apache/spark/pull/4571
[SPARK-5859] [PySpark] [SQL] fix DataFrame Python API
Davies Liu <davies@databricks.com>
2015-02-17 10:22:48 -0800
Commit: 4a581aa, github.com/apache/spark/pull/4645
[SPARK-5166][SPARK-5247][SPARK-5258][SQL] API Cleanup / Documentation
Michael Armbrust <michael@databricks.com>
2015-02-17 10:21:17 -0800
Commit: cd3d415, github.com/apache/spark/pull/4642
[SPARK-5858][MLLIB] Remove unnecessary first() call in GLM
Xiangrui Meng <meng@databricks.com>
2015-02-17 10:17:45 -0800
Commit: 97cb568, github.com/apache/spark/pull/4647
SPARK-5856: In Maven build script, launch Zinc with more memory
Patrick Wendell <patrick@databricks.com>
2015-02-17 10:10:01 -0800
Commit: 8240629, github.com/apache/spark/pull/4643
Revert "[SPARK-5363] [PySpark] check ending mark in non-block way"
Josh Rosen <joshrosen@databricks.com>
2015-02-17 07:48:27 -0800
Commit: aeb85cd
[SPARK-5826][Streaming] Fix Configuration not serializable problem
jerryshao <saisai.shao@intel.com>
2015-02-17 10:45:18 +0000
Commit: b8da5c3, github.com/apache/spark/pull/4612
HOTFIX: Style issue causing build break
Patrick Wendell <patrick@databricks.com>
2015-02-16 22:10:39 -0800
Commit: e9241fa
[SPARK-5802][MLLIB] cache transformed data in glm
Xiangrui Meng <meng@databricks.com>
2015-02-16 22:09:04 -0800
Commit: dfe0fa0, github.com/apache/spark/pull/4593
[SPARK-5853][SQL] Schema support in Row.
Reynold Xin <rxin@databricks.com>
2015-02-16 20:42:57 -0800
Commit: d0701d9, github.com/apache/spark/pull/4640
SPARK-5850: Remove experimental label for Scala 2.11 and FlumePollingStream
Patrick Wendell <patrick@databricks.com>
2015-02-16 20:33:33 -0800
Commit: c6a7069, github.com/apache/spark/pull/4638
[SPARK-5363] [PySpark] check ending mark in non-block way
Davies Liu <davies@databricks.com>
2015-02-16 20:32:03 -0800
Commit: baad6b3, github.com/apache/spark/pull/4601
[SQL] Various DataFrame doc changes.
Reynold Xin <rxin@databricks.com>
2015-02-16 19:00:30 -0800
Commit: e355b54, github.com/apache/spark/pull/4636
[SPARK-5849] Handle more types of invalid JSON requests in SubmitRestProtocolMessage.parseAction
Josh Rosen <joshrosen@databricks.com>
2015-02-16 18:08:02 -0800
Commit: 385a339, github.com/apache/spark/pull/4637
[SPARK-3340] Deprecate ADD_JARS and ADD_FILES
azagrebin <azagrebin@gmail.com>
2015-02-16 18:06:19 -0800
Commit: d8c70fb, github.com/apache/spark/pull/4616
[SPARK-5788] [PySpark] capture the exception in python write thread
Davies Liu <davies@databricks.com>
2015-02-16 17:57:14 -0800
Commit: c2a9a61, github.com/apache/spark/pull/4577
SPARK-5848: tear down the ConsoleProgressBar timer
Matt Whelan <mwhelan@perka.com>
2015-02-17 00:59:49 +0000
Commit: 52994d8, github.com/apache/spark/pull/4635
[SPARK-4865][SQL]Include temporary tables in SHOW TABLES
Yin Huai <yhuai@databricks.com>
2015-02-16 15:59:23 -0800
Commit: 8a94bf7, github.com/apache/spark/pull/4618
[SQL] Optimize arithmetic and predicate operators
kai <kaizeng@eecs.berkeley.edu>
2015-02-16 15:58:05 -0800
Commit: 639a3c2, github.com/apache/spark/pull/4472
[SPARK-5839][SQL]HiveMetastoreCatalog does not recognize table names and aliases of data source tables.
Yin Huai <yhuai@databricks.com>
2015-02-16 15:54:01 -0800
Commit: a15a0a0, github.com/apache/spark/pull/4626
[SPARK-5746][SQL] Check invalid cases for the write path of data source API
Yin Huai <yhuai@databricks.com>
2015-02-16 15:51:59 -0800
Commit: 4198654, github.com/apache/spark/pull/4617
HOTFIX: Break in Jekyll build from #4589
Patrick Wendell <patrick@databricks.com>
2015-02-16 15:43:56 -0800
Commit: ad8fd4f
[SPARK-2313] Use socket to communicate GatewayServer port back to Python driver
Josh Rosen <joshrosen@databricks.com>
2015-02-16 15:25:11 -0800
Commit: b70b8ba, github.com/apache/spark/pull/3424.
SPARK-5357: Update commons-codec version to 1.10 (current)
Matt Whelan <mwhelan@perka.com>
2015-02-16 23:05:34 +0000
Commit: 8c45619, github.com/apache/spark/pull/4153
SPARK-5841: remove DiskBlockManager shutdown hook on stop
Matt Whelan <mwhelan@perka.com>
2015-02-16 22:54:32 +0000
Commit: dd977df, github.com/apache/spark/pull/4627
[SPARK-5833] [SQL] Adds REFRESH TABLE command
Cheng Lian <lian@databricks.com>
2015-02-16 12:52:05 -0800
Commit: 864d77e, github.com/apache/spark/pull/4624
[SPARK-5296] [SQL] Add more filter types for data sources API
Cheng Lian <lian@databricks.com>
2015-02-16 12:48:55 -0800
Commit: 363a9a7, github.com/apache/spark/pull/4623
[SQL] Add fetched row count in SparkSQLCLIDriver
OopsOutOfMemory <victorshengli@126.com>
2015-02-16 12:34:09 -0800
Commit: 0368494, github.com/apache/spark/pull/4604
[SQL] Initial support for reporting location of error in sql string
Michael Armbrust <michael@databricks.com>
2015-02-16 12:32:56 -0800
Commit: 63fa123, github.com/apache/spark/pull/4587
[SPARK-5824] [SQL] add null format in ctas and set default col comment to null
Daoyuan Wang <daoyuan.wang@intel.com>
2015-02-16 12:31:36 -0800
Commit: c2eaaea, github.com/apache/spark/pull/4609
[SQL] [Minor] Update the SpecificMutableRow.copy
Cheng Hao <hao.cheng@intel.com>
2015-02-16 12:21:08 -0800
Commit: 1a88955, github.com/apache/spark/pull/4619
SPARK-5795 [STREAMING] api.java.JavaPairDStream.saveAsNewAPIHadoopFiles may not friendly to java
Sean Owen <sowen@cloudera.com>
2015-02-16 19:32:31 +0000
Commit: fef2267, github.com/apache/spark/pull/4608
[SPARK-5799][SQL] Compute aggregation function on specified numeric columns
Liang-Chi Hsieh <viirya@gmail.com>
2015-02-16 10:06:11 -0800
Commit: 0165e9d, github.com/apache/spark/pull/4592
[SPARK-4553] [SPARK-5767] [SQL] Wires Parquet data source with the newly introduced write support for data source API
Cheng Lian <lian@databricks.com>
2015-02-16 01:38:31 -0800
Commit: 78f7edb, github.com/apache/spark/pull/4563
[Minor] [SQL] Renames stringRddToDataFrame to stringRddToDataFrameHolder for consistency
Cheng Lian <lian@databricks.com>
2015-02-16 01:33:37 -0800
Commit: 066301c, github.com/apache/spark/pull/4613
[Ml] SPARK-5804 Explicitly manage cache in Crossvalidator k-fold loop
Peter Rudenko <petro.rudenko@gmail.com>
2015-02-16 00:07:23 -0800
Commit: 0d93205, github.com/apache/spark/pull/4595
[Ml] SPARK-5796 Don't transform data on a last estimator in Pipeline
Peter Rudenko <petro.rudenko@gmail.com>
2015-02-15 20:51:32 -0800
Commit: 9cf7d70, github.com/apache/spark/pull/4590
SPARK-5815 [MLLIB] Deprecate SVDPlusPlus APIs that expose DoubleMatrix from JBLAS
Sean Owen <sowen@cloudera.com>
2015-02-15 20:41:27 -0800
Commit: db3c539, github.com/apache/spark/pull/4614
[SPARK-5769] Set params in constructors and in setParams in Python ML pipelines
Xiangrui Meng <meng@databricks.com>
2015-02-15 20:29:26 -0800
Commit: d710991, github.com/apache/spark/pull/4564
SPARK-5669 [BUILD] Spark assembly includes incompatibly licensed libgfortran, libgcc code via JBLAS
Sean Owen <sowen@cloudera.com>
2015-02-15 09:15:48 -0800
Commit: 4e099d7, github.com/apache/spark/pull/4453
[MLLIB][SPARK-5502] User guide for isotonic regression
martinzapletal <zapletal-martin@email.cz>
2015-02-15 09:10:03 -0800
Commit: d96e188, github.com/apache/spark/pull/4536
[HOTFIX] Ignore DirectKafkaStreamSuite.
Patrick Wendell <patrick@databricks.com>
2015-02-13 12:43:53 -0800
Commit: 70ebad4
[SPARK-5827][SQL] Add missing import in the example of SqlContext
Takeshi Yamamuro <linguin.m.s@gmail.com>
2015-02-15 14:42:20 +0000
Commit: 9c1c70d, github.com/apache/spark/pull/4615
SPARK-5822 [BUILD] cannot import src/main/scala & src/test/scala into eclipse as source folder
gli <gli@redhat.com>
2015-02-14 20:43:27 +0000
Commit: f87f3b7, github.com/apache/spark/pull/4531
Revise formatting of previous commit f80e2629bb74bc62960c61ff313f7e7802d61319
Sean Owen <sowen@cloudera.com>
2015-02-14 20:12:29 +0000
Commit: 1945fcf
[SPARK-5800] Streaming Docs. Change linked files according the selected language
gasparms <gmunoz@stratio.com>
2015-02-14 20:10:29 +0000
Commit: e99e170, github.com/apache/spark/pull/4589
[SPARK-5752][SQL] Don't implicitly convert RDDs directly to DataFrames
Reynold Xin <rxin@databricks.com>, Davies Liu <davies@databricks.com>
2015-02-13 23:03:22 -0800
Commit: ba91bf5, github.com/apache/spark/pull/4556
SPARK-3290 [GRAPHX] No unpersist callls in SVDPlusPlus
Sean Owen <sowen@cloudera.com>
2015-02-13 20:12:52 -0800
Commit: db57479, github.com/apache/spark/pull/4234
[SPARK-5227] [SPARK-5679] Disable FileSystem cache in WholeTextFileRecordReaderSuite
Josh Rosen <joshrosen@databricks.com>
2015-02-13 17:45:31 -0800
Commit: 152147f, github.com/apache/spark/pull/4599
[SPARK-5730][ML] add doc groups to spark.ml components
Xiangrui Meng <meng@databricks.com>
2015-02-13 16:45:59 -0800
Commit: fccd38d, github.com/apache/spark/pull/4600
[SPARK-5803][MLLIB] use ArrayBuilder to build primitive arrays
Xiangrui Meng <meng@databricks.com>
2015-02-13 16:43:49 -0800
Commit: 356b798, github.com/apache/spark/pull/4594
[SPARK-5806] re-organize sections in mllib-clustering.md
Xiangrui Meng <meng@databricks.com>
2015-02-13 15:09:27 -0800
Commit: 9658763, github.com/apache/spark/pull/4598
[SPARK-5789][SQL]Throw a better error message if JsonRDD.parseJson encounters unrecoverable parsing errors.
Yin Huai <yhuai@databricks.com>
2015-02-13 13:51:06 -0800
Commit: d9d0250, github.com/apache/spark/pull/4582
[SPARK-5642] [SQL] Apply column pruning on unused aggregation fields
Daoyuan Wang <daoyuan.wang@intel.com>, Michael Armbrust <michael@databricks.com>
2015-02-13 13:46:50 -0800
Commit: efffc2e, github.com/apache/spark/pull/4415
[HOTFIX] Fix build break in MesosSchedulerBackendSuite
Andrew Or <andrew@databricks.com>
2015-02-13 13:10:29 -0800
Commit: 4160371
SPARK-5805 Fixed the type error in documentation.
Emre Sevinç <emre.sevinc@gmail.com>
2015-02-13 12:31:27 -0800
Commit: ad73189, github.com/apache/spark/pull/4596
[SPARK-5735] Replace uses of EasyMock with Mockito
Josh Rosen <joshrosen@databricks.com>
2015-02-13 09:53:57 -0800
Commit: cc9eec1, github.com/apache/spark/pull/4578
[SPARK-5783] Better eventlog-parsing error messages
Ryan Williams <ryan.blake.williams@gmail.com>
2015-02-13 09:47:26 -0800
Commit: e5690a5, github.com/apache/spark/pull/4573
[SPARK-5503][MLLIB] Example code for Power Iteration Clustering
sboeschhuawei <stephen.boesch@huawei.com>
2015-02-13 09:45:57 -0800
Commit: 5e63942, github.com/apache/spark/pull/4495
[SPARK-5732][CORE]:Add an option to print the spark version in spark script.
uncleGen <hustyugm@gmail.com>, genmao.ygm <genmao.ygm@alibaba-inc.com>
2015-02-13 09:43:10 -0800
Commit: 5c883df, github.com/apache/spark/pull/4522
[SPARK-4832][Deploy]some other processes might take the daemon pid
WangTaoTheTonic <barneystinson@aliyun.com>, WangTaoTheTonic <wangtao111@huawei.com>
2015-02-13 10:27:23 +0000
Commit: 1255e83, github.com/apache/spark/pull/3683
[SQL] Fix docs of SQLContext.tables
Yin Huai <yhuai@databricks.com>
2015-02-12 20:37:55 -0800
Commit: a8f560c, github.com/apache/spark/pull/4579
[SPARK-3365][SQL]Wrong schema generated for List type
tianyi <tianyi.asiainfo@gmail.com>
2015-02-12 22:18:39 -0800
Commit: b9f332a, github.com/apache/spark/pull/4581
[SPARK-3299][SQL]Public API in SQLContext to list tables
Yin Huai <yhuai@databricks.com>
2015-02-12 18:08:01 -0800
Commit: edbac17, github.com/apache/spark/pull/4547
[SQL] Move SaveMode to SQL package.
Yin Huai <yhuai@databricks.com>
2015-02-12 15:32:17 -0800
Commit: 925fd84, github.com/apache/spark/pull/4542
[SPARK-5335] Fix deletion of security groups within a VPC
Vladimir Grigor <vladimir@kiosked.com>, Vladimir Grigor <vladimir@voukka.com>
2015-02-12 23:26:24 +0000
Commit: 5c9db4e, github.com/apache/spark/pull/4122
[SPARK-5755] [SQL] remove unnecessary Add
Daoyuan Wang <daoyuan.wang@intel.com>
2015-02-12 15:22:07 -0800
Commit: f7103b3, github.com/apache/spark/pull/4551
[SPARK-5573][SQL] Add explode to dataframes
Michael Armbrust <michael@databricks.com>
2015-02-12 15:19:19 -0800
Commit: c7eb9ee, github.com/apache/spark/pull/4546
[SPARK-5758][SQL] Use LongType as the default type for integers in JSON schema inference.
Yin Huai <yhuai@databricks.com>
2015-02-12 15:17:25 -0800
Commit: b0c79da, github.com/apache/spark/pull/4544
[SPARK-5780] [PySpark] Mute the logging during unit tests
Davies Liu <davies@databricks.com>
2015-02-12 14:54:38 -0800
Commit: bf0d15c, github.com/apache/spark/pull/4572
SPARK-5747: Fix wordsplitting bugs in make-distribution.sh
David Y. Ross <dyross@gmail.com>
2015-02-12 14:52:38 -0800
Commit: 11a0d5b, github.com/apache/spark/pull/4540
[SPARK-5759][Yarn]ExecutorRunnable should catch YarnException while NMClient start contain...
lianhuiwang <lianhuiwang09@gmail.com>
2015-02-12 14:50:16 -0800
Commit: 02d5b32, github.com/apache/spark/pull/4554
[SPARK-5760][SPARK-5761] Fix standalone rest protocol corner cases + revamp tests
Andrew Or <andrew@databricks.com>
2015-02-12 14:47:52 -0800
Commit: 11d1080, github.com/apache/spark/pull/4557
[SPARK-5762] Fix shuffle write time for sort-based shuffle
Kay Ousterhout <kayousterhout@gmail.com>
2015-02-12 14:46:37 -0800
Commit: 0040fc5, github.com/apache/spark/pull/4559
[SPARK-5765][Examples]Fixed word split problem in run-example and compute-classpath
Venkata Ramana G <ramana.gollamudihuawei.com>, Venkata Ramana Gollamudi <ramana.gollamudi@huawei.com>
2015-02-12 14:44:21 -0800
Commit: 9a1de4b, github.com/apache/spark/pull/4561
[SPARK-5645] Added local read bytes/time to task metrics
Kay Ousterhout <kayousterhout@gmail.com>
2015-02-12 14:35:44 -0800
Commit: 74f34bb, github.com/apache/spark/pull/4510
[SQL] Improve error messages
Michael Armbrust <michael@databricks.com>, wangfei <wangfei1@huawei.com>
2015-02-12 13:11:28 -0800
Commit: e3a975d, github.com/apache/spark/pull/4558
[SQL][DOCS] Update sql documentation
Antonio Navarro Perez <ajnavarro@users.noreply.github.com>
2015-02-12 12:46:17 -0800
Commit: cbd659e, github.com/apache/spark/pull/4560
[SPARK-5757][MLLIB] replace SQL JSON usage in model import/export by json4s
Xiangrui Meng <meng@databricks.com>
2015-02-12 10:48:13 -0800
Commit: e26c149, github.com/apache/spark/pull/4555
[SPARK-5655] Don't chmod700 application files if running in YARN
Andrew Rowson <github@growse.com>
2015-02-12 18:41:39 +0000
Commit: e23c8f5, github.com/apache/spark/pull/4509
[SQL] Make dataframe more tolerant of being serialized
Michael Armbrust <michael@databricks.com>
2015-02-11 19:05:49 -0800
Commit: 3c1b9bf, github.com/apache/spark/pull/4545
[SQL] Two DataFrame fixes.
Reynold Xin <rxin@databricks.com>
2015-02-11 18:32:48 -0800
Commit: bcb1382, github.com/apache/spark/pull/4543
[SPARK-3688][SQL] More inline comments for LogicalPlan.
Reynold Xin <rxin@databricks.com>
2015-02-11 15:26:31 -0800
Commit: 08ab3d2, github.com/apache/spark/pull/4539
[SPARK-3688][SQL]LogicalPlan can't resolve column correctlly
tianyi <tianyi.asiainfo@gmail.com>
2015-02-11 12:50:17 -0800
Commit: e136f47, github.com/apache/spark/pull/4524
[SPARK-5454] More robust handling of self joins
Michael Armbrust <michael@databricks.com>
2015-02-11 12:31:56 -0800
Commit: 1bb3631, github.com/apache/spark/pull/4520
Remove outdated remark about take(n).
Daniel Darabos <darabos.daniel@gmail.com>
2015-02-11 20:24:17 +0000
Commit: 72adfc5, github.com/apache/spark/pull/4533
[SPARK-5677] [SPARK-5734] [SQL] [PySpark] Python DataFrame API remaining tasks
Davies Liu <davies@databricks.com>
2015-02-11 12:13:16 -0800
Commit: d66aae2, github.com/apache/spark/pull/4528
[SPARK-5733] Error Link in Pagination of HistroyPage when showing Incomplete Applications
guliangliang <guliangliang@qiyi.com>
2015-02-11 15:55:49 +0000
Commit: 864dccd, github.com/apache/spark/pull/4523
SPARK-5727 [BUILD] Deprecate Debian packaging
Sean Owen <sowen@cloudera.com>
2015-02-11 08:30:16 +0000
Commit: 057ec4f, github.com/apache/spark/pull/4516
SPARK-5728 [STREAMING] MQTTStreamSuite leaves behind ActiveMQ database files
Sean Owen <sowen@cloudera.com>
2015-02-11 08:13:51 +0000
Commit: 476b6d7, github.com/apache/spark/pull/4517
[SPARK-4964] [Streaming] refactor createRDD to take leaders via map instead of array
cody koeninger <cody@koeninger.org>
2015-02-11 00:13:27 -0800
Commit: 811d179, github.com/apache/spark/pull/4511
Preparing development version 1.3.1-SNAPSHOT
Patrick Wendell <patrick@databricks.com>
2015-02-11 07:47:03 +0000
Commit: e57c81b
Preparing Spark release v1.3.0-snapshot1
Patrick Wendell <patrick@databricks.com>
2015-02-11 07:47:02 +0000
Commit: d97bfc6
Revert "Preparing Spark release v1.3.0-snapshot1"
Patrick Wendell <patrick@databricks.com>
2015-02-10 23:46:04 -0800
Commit: 6a91d59
Revert "Preparing development version 1.3.1-SNAPSHOT"
Patrick Wendell <patrick@databricks.com>
2015-02-10 23:46:02 -0800
Commit: 3a50383
HOTFIX: Adding Junit to Hive tests for Maven build
Patrick Wendell <patrick@databricks.com>
2015-02-10 23:39:21 -0800
Commit: 0386fc4
Preparing development version 1.3.1-SNAPSHOT
Patrick Wendell <patrick@databricks.com>
2015-02-11 06:45:03 +0000
Commit: ba12b79
Preparing Spark release v1.3.0-snapshot1
Patrick Wendell <patrick@databricks.com>
2015-02-11 06:45:03 +0000
Commit: 53068f5
HOTFIX: Java 6 compilation error in Spark SQL
Patrick Wendell <patrick@databricks.com>
2015-02-10 22:43:32 -0800
Commit: 15180bc
Revert "Preparing Spark release v1.3.0-snapshot1"
Patrick Wendell <patrick@databricks.com>
2015-02-10 22:44:10 -0800
Commit: 536dae9
Revert "Preparing development version 1.3.1-SNAPSHOT"
Patrick Wendell <patrick@databricks.com>
2015-02-10 22:44:07 -0800
Commit: 01b562e
Preparing development version 1.3.1-SNAPSHOT
Patrick Wendell <patrick@databricks.com>
2015-02-11 06:15:29 +0000
Commit: db80d0f
Preparing Spark release v1.3.0-snapshot1
Patrick Wendell <patrick@databricks.com>
2015-02-11 06:15:29 +0000
Commit: c2e4001
Updating versions for Spark 1.3
Patrick Wendell <patrick@databricks.com>
2015-02-10 21:54:55 -0800
Commit: 2f52489
[SPARK-5714][Mllib] Refactor initial step of LDA to remove redundant operations
Liang-Chi Hsieh <viirya@gmail.com>
2015-02-10 21:51:15 -0800
Commit: ba3aa8f, github.com/apache/spark/pull/4501
[SPARK-5702][SQL] Allow short names for built-in data sources.
Reynold Xin <rxin@databricks.com>
2015-02-10 20:40:21 -0800
Commit: 63af90c, github.com/apache/spark/pull/4489
[SPARK-5729] Potential NPE in standalone REST API
Andrew Or <andrew@databricks.com>
2015-02-10 20:19:14 -0800
Commit: 1bc75b0, github.com/apache/spark/pull/4518
[SPARK-4879] Use driver to coordinate Hadoop output committing for speculative tasks
mcheah <mcheah@palantir.com>, Josh Rosen <joshrosen@databricks.com>
2015-02-10 20:12:18 -0800
Commit: 79cd59c, github.com/apache/spark/pull/4155.
[SQL][DataFrame] Fix column computability bug.
Reynold Xin <rxin@databricks.com>
2015-02-10 19:50:44 -0800
Commit: e477e91, github.com/apache/spark/pull/4519
[SPARK-5709] [SQL] Add EXPLAIN support in DataFrame API for debugging purpose
Cheng Hao <hao.cheng@intel.com>
2015-02-10 19:40:51 -0800
Commit: 7fa0d5f, github.com/apache/spark/pull/4496
[SPARK-5704] [SQL] [PySpark] createDataFrame from RDD with columns
Davies Liu <davies@databricks.com>
2015-02-10 19:40:12 -0800
Commit: 1056c5b, github.com/apache/spark/pull/4498
[SPARK-5683] [SQL] Avoid multiple json generator created
Cheng Hao <hao.cheng@intel.com>
2015-02-10 18:19:56 -0800
Commit: fc0446f, github.com/apache/spark/pull/4468
[SQL] Add an exception for analysis errors.
Michael Armbrust <michael@databricks.com>
2015-02-10 17:32:42 -0800
Commit: 748cdc1, github.com/apache/spark/pull/4439
[SPARK-5658][SQL] Finalize DDL and write support APIs
Yin Huai <yhuai@databricks.com>
2015-02-10 17:29:52 -0800
Commit: a21090e, github.com/apache/spark/pull/4446
[SPARK-5493] [core] Add option to impersonate user.
Marcelo Vanzin <vanzin@cloudera.com>
2015-02-10 17:19:10 -0800
Commit: 8e75b0e, github.com/apache/spark/pull/4405
[SQL] Make Options in the data source API CREATE TABLE statements optional.
Yin Huai <yhuai@databricks.com>
2015-02-10 17:06:12 -0800
Commit: 445dbc7, github.com/apache/spark/pull/4515
[SPARK-5725] [SQL] Fixes ParquetRelation2.equals
Cheng Lian <lian@databricks.com>
2015-02-10 17:02:44 -0800
Commit: f43bc3d, github.com/apache/spark/pull/4513
[SPARK-5343][GraphX]: ShortestPaths traverses backwards
Brennon York <brennon.york@capitalone.com>
2015-02-10 14:57:00 -0800
Commit: 5be8902, github.com/apache/spark/pull/4478
[SPARK-5021] [MLlib] Gaussian Mixture now supports Sparse Input
MechCoder <manojkumarsivaraj334@gmail.com>
2015-02-10 14:05:55 -0800
Commit: bba0953, github.com/apache/spark/pull/4459
[HOTFIX][SPARK-4136] Fix compilation and tests
Andrew Or <andrew@databricks.com>
2015-02-10 11:18:01 -0800
Commit: 4e3aa68
[SPARK-5686][SQL] Add show current roles command in HiveQl
OopsOutOfMemory <victorshengli@126.com>
2015-02-10 13:20:15 -0800
Commit: 8b7587a, github.com/apache/spark/pull/4471
[SQL] Add toString to DataFrame/Column
Michael Armbrust <michael@databricks.com>
2015-02-10 13:14:01 -0800
Commit: ef739d9, github.com/apache/spark/pull/4436
SPARK-5613: Catch the ApplicationNotFoundException exception to avoid thread from getting killed on yarn restart.
Kashish Jain <kashish.jain@guavus.com>
2015-02-06 13:47:23 -0800
Commit: c294216, github.com/apache/spark/pull/4392
[SPARK-5592][SQL] java.net.URISyntaxException when insert data to a partitioned table
wangfei <wangfei1@huawei.com>, Fei Wang <wangfei1@huawei.com>
2015-02-10 11:54:30 -0800
Commit: dbfce30, github.com/apache/spark/pull/4368
SPARK-4136. Under dynamic allocation, cancel outstanding executor requests when no longer needed
Sandy Ryza <sandy@cloudera.com>
2015-02-10 11:07:25 -0800
Commit: e53da21, github.com/apache/spark/pull/4168
[SPARK-5716] [SQL] Support TOK_CHARSETLITERAL in HiveQl
Daoyuan Wang <daoyuan.wang@intel.com>
2015-02-10 11:08:21 -0800
Commit: e508237, github.com/apache/spark/pull/4502
[Spark-5717] [MLlib] add stop and reorganize import
JqueryFan <firing@126.com>, Yuhao Yang <hhbyyh@gmail.com>
2015-02-10 17:37:32 +0000
Commit: b32f553, github.com/apache/spark/pull/4503
[SPARK-5700] [SQL] [Build] Bumps jets3t to 0.9.3 for hadoop-2.3 and hadoop-2.4 profiles
Cheng Lian <lian@databricks.com>
2015-02-10 02:28:47 -0800
Commit: d6f31e0, github.com/apache/spark/pull/4499
SPARK-5239 [CORE] JdbcRDD throws "java.lang.AbstractMethodError: oracle.jdbc.driver.xxxxxx.isClosed()Z"
Sean Owen <sowen@cloudera.com>
2015-02-10 09:19:01 +0000
Commit: 4cfc025, github.com/apache/spark/pull/4470
[SPARK-4964][Streaming][Kafka] More updates to Exactly-once Kafka stream
Tathagata Das <tathagata.das1565@gmail.com>
2015-02-09 22:45:48 -0800
Commit: 281614d, github.com/apache/spark/pull/4384
[SPARK-5597][MLLIB] save/load for decision trees and emsembles
Joseph K. Bradley <joseph@databricks.com>, Xiangrui Meng <meng@databricks.com>
2015-02-09 22:09:07 -0800
Commit: 01905c4, github.com/apache/spark/pull/4444.
[SQL] Remove the duplicated code
Cheng Hao <hao.cheng@intel.com>
2015-02-09 21:33:34 -0800
Commit: 663d34e, github.com/apache/spark/pull/4494
[SPARK-5701] Only set ShuffleReadMetrics when task has shuffle deps
Kay Ousterhout <kayousterhout@gmail.com>
2015-02-09 21:22:09 -0800
Commit: 6ddbca4, github.com/apache/spark/pull/4488
[SPARK-5703] AllJobsPage throws empty.max exception
Andrew Or <andrew@databricks.com>
2015-02-09 21:18:48 -0800
Commit: 8326255, github.com/apache/spark/pull/4490
[SPARK-2996] Implement userClassPathFirst for driver, yarn.
Marcelo Vanzin <vanzin@cloudera.com>
2015-02-09 21:17:06 -0800
Commit: 6a1e0f9, github.com/apache/spark/pull/3233
SPARK-4900 [MLLIB] MLlib SingularValueDecomposition ARPACK IllegalStateException
Sean Owen <sowen@cloudera.com>
2015-02-09 21:13:58 -0800
Commit: ebf1df0, github.com/apache/spark/pull/4485
Add a config option to print DAG.
KaiXinXiaoLei <huleilei1@huawei.com>
2015-02-09 20:58:58 -0800
Commit: dad05e0, github.com/apache/spark/pull/4257
[SPARK-5469] restructure pyspark.sql into multiple files
Davies Liu <davies@databricks.com>
2015-02-09 20:49:22 -0800
Commit: f0562b4, github.com/apache/spark/pull/4479
[SPARK-5698] Do not let user request negative # of executors
Andrew Or <andrew@databricks.com>
2015-02-09 17:33:29 -0800
Commit: 62b1e1f, github.com/apache/spark/pull/4483
[SPARK-5699] [SQL] [Tests] Runs hive-thriftserver tests whenever SQL code is modified
Cheng Lian <lian@databricks.com>
2015-02-09 16:52:05 -0800
Commit: 71f0f51, github.com/apache/spark/pull/4486
[SPARK-5648][SQL] support "alter ... unset tblproperties("key")"
DoingDone9 <799203320@qq.com>
2015-02-09 16:40:26 -0800
Commit: e2bf59a, github.com/apache/spark/pull/4424
[SPARK-2096][SQL] support dot notation on array of struct
Wenchen Fan <cloud0fan@outlook.com>
2015-02-09 16:39:34 -0800
Commit: 15f557f, github.com/apache/spark/pull/2405
[SPARK-5614][SQL] Predicate pushdown through Generate.
Lu Yan <luyan02@baidu.com>
2015-02-09 16:25:38 -0800
Commit: ce2c89c, github.com/apache/spark/pull/4394
[SPARK-5696] [SQL] [HOTFIX] Asks HiveThriftServer2 to re-initialize log4j using Hive configurations
Cheng Lian <lian@databricks.com>
2015-02-09 16:23:12 -0800
Commit: 379233c, github.com/apache/spark/pull/4484
[SQL] Code cleanup.
Yin Huai <yhuai@databricks.com>
2015-02-09 16:20:42 -0800
Commit: e241601, github.com/apache/spark/pull/4482
[SQL] Add some missing DataFrame functions.
Michael Armbrust <michael@databricks.com>
2015-02-09 16:02:56 -0800
Commit: a70dca0, github.com/apache/spark/pull/4437
[SPARK-5675][SQL] XyzType companion object should subclass XyzType
Reynold Xin <rxin@databricks.com>
2015-02-09 14:51:46 -0800
Commit: 1e2fab2, github.com/apache/spark/pull/4463
[SPARK-4905][STREAMING] FlumeStreamSuite fix.
Hari Shreedharan <hshreedharan@apache.org>
2015-02-09 14:17:14 -0800
Commit: 18c5a99, github.com/apache/spark/pull/4371
[SPARK-5691] Fixing wrong data structure lookup for dupe app registratio...
mcheah <mcheah@palantir.com>
2015-02-09 13:20:14 -0800
Commit: 6a0144c, github.com/apache/spark/pull/4477
[SPARK-5678] Convert DataFrame to pandas.DataFrame and Series
Davies Liu <davies@databricks.com>
2015-02-09 11:42:52 -0800
Commit: 43972b5, github.com/apache/spark/pull/4476
[SPARK-5664][BUILD] Restore stty settings when exiting from SBT's spark-shell
Liang-Chi Hsieh <viirya@gmail.com>
2015-02-09 11:45:12 -0800
Commit: fa67877, github.com/apache/spark/pull/4451
SPARK-4267 [YARN] Failing to launch jobs on Spark on YARN with Hadoop 2.5.0 or later
Sean Owen <sowen@cloudera.com>
2015-02-09 10:33:57 -0800
Commit: c88d4ab, github.com/apache/spark/pull/4452
[SPARK-5473] [EC2] Expose SSH failures after status checks pass
Nicholas Chammas <nicholas.chammas@gmail.com>
2015-02-09 09:44:53 +0000
Commit: f2aa7b7, github.com/apache/spark/pull/4262
[SPARK-5539][MLLIB] LDA guide
Xiangrui Meng <meng@databricks.com>, Joseph K. Bradley <joseph@databricks.com>
2015-02-08 23:40:36 -0800
Commit: 5782ee2, github.com/apache/spark/pull/4465
[SPARK-5472][SQL] Fix Scala code style
Hung Lin <hung@zoomdata.com>
2015-02-08 22:36:42 -0800
Commit: 955f286, github.com/apache/spark/pull/4464
SPARK-4405 [MLLIB] Matrices.* construction methods should check for rows x cols overflow
Sean Owen <sowen@cloudera.com>
2015-02-08 21:08:50 -0800
Commit: fa8ea48, github.com/apache/spark/pull/4461
[SPARK-5660][MLLIB] Make Matrix apply public
Joseph K. Bradley <joseph@databricks.com>, Xiangrui Meng <meng@databricks.com>
2015-02-08 21:07:36 -0800
Commit: df9b105, github.com/apache/spark/pull/4447
[SPARK-5643][SQL] Add a show method to print the content of a DataFrame in tabular format.
Reynold Xin <rxin@databricks.com>
2015-02-08 18:56:51 -0800
Commit: e1996aa, github.com/apache/spark/pull/4416
SPARK-5665 [DOCS] Update netlib-java documentation
Sam Halliday <sam.halliday@Gmail.com>, Sam Halliday <sam.halliday@gmail.com>
2015-02-08 16:34:26 -0800
Commit: c515634, github.com/apache/spark/pull/4448
[SPARK-5598][MLLIB] model save/load for ALS
Xiangrui Meng <meng@databricks.com>
2015-02-08 16:26:20 -0800
Commit: 9e4d58f, github.com/apache/spark/pull/4422
[SQL] Set sessionState in QueryExecution.
Yin Huai <yhuai@databricks.com>
2015-02-08 14:55:07 -0800
Commit: 42c56b6, github.com/apache/spark/pull/4445
[SPARK-3039] [BUILD] Spark assembly for new hadoop API (hadoop 2) contai...
medale <medale94@yahoo.com>
2015-02-08 10:35:29 +0000
Commit: bc55e20, github.com/apache/spark/pull/4315
[SPARK-5672][Web UI] Don't return `ERROR 500` when have missing args
Kirill A. Korinskiy <catap@catap.ru>
2015-02-08 10:31:46 +0000
Commit: 96010fa, github.com/apache/spark/pull/4239
[SPARK-5671] Upgrade jets3t to 0.9.2 in hadoop-2.3 and 2.4 profiles
Josh Rosen <joshrosen@databricks.com>
2015-02-07 17:19:08 -0800
Commit: 0f9d765, github.com/apache/spark/pull/4454
[SPARK-5108][BUILD] Jackson dependency management for Hadoop-2.6.0 support
Zhan Zhang <zhazhan@gmail.com>
2015-02-07 19:41:30 +0000
Commit: 51fbca4, github.com/apache/spark/pull/3938
[BUILD] Add the ability to launch spark-shell from SBT.
Michael Armbrust <michael@databricks.com>
2015-02-07 00:14:38 -0800
Commit: 6bda169, github.com/apache/spark/pull/4438
[SPARK-5388] Provide a stable application submission gateway for standalone cluster mode
Andrew Or <andrew@databricks.com>
2015-02-06 15:57:06 -0800
Commit: 6ec0cdc, github.com/apache/spark/pull/4216
SPARK-5403: Ignore UserKnownHostsFile in SSH calls
Grzegorz Dubicki <grzegorz.dubicki@gmail.com>
2015-02-06 15:43:58 -0800
Commit: 3d99741, github.com/apache/spark/pull/4196
[SPARK-5601][MLLIB] make streaming linear algorithms Java-friendly
Xiangrui Meng <meng@databricks.com>
2015-02-06 15:42:59 -0800
Commit: 11b28b9, github.com/apache/spark/pull/4432
[SQL] [Minor] HiveParquetSuite was disabled by mistake, re-enable them
Cheng Lian <lian@databricks.com>
2015-02-06 15:23:42 -0800
Commit: 4005802, github.com/apache/spark/pull/4440
[SQL] Use TestSQLContext in Java tests
Michael Armbrust <michael@databricks.com>
2015-02-06 15:11:02 -0800
Commit: c950058, github.com/apache/spark/pull/4441
[SPARK-4994][network]Cleanup removed executors' ShuffleInfo in yarn shuffle service
lianhuiwang <lianhuiwang09@gmail.com>
2015-02-06 14:47:52 -0800
Commit: af6ddf8, github.com/apache/spark/pull/3828
[SPARK-5444][Network]Add a retry to deal with the conflict port in netty server.
huangzhaowei <carlmartinmax@gmail.com>
2015-02-06 14:35:29 -0800
Commit: caca15a, github.com/apache/spark/pull/4240
[SPARK-4874] [CORE] Collect record count metrics
Kostas Sakellis <kostas@cloudera.com>
2015-02-06 14:31:20 -0800
Commit: 9fa29a6, github.com/apache/spark/pull/4067
[HOTFIX] Fix the maven build after adding sqlContext to spark-shell
Michael Armbrust <michael@databricks.com>
2015-02-06 14:27:06 -0800
Commit: 11dbf71, github.com/apache/spark/pull/4443
[SPARK-5600] [core] Clean up FsHistoryProvider test, fix app sort order.
Marcelo Vanzin <vanzin@cloudera.com>
2015-02-06 14:23:09 -0800
Commit: 09feecc, github.com/apache/spark/pull/4370
SPARK-5633 pyspark saveAsTextFile support for compression codec
Vladimir Vladimirov <vladimir.vladimirov@magnetic.com>
2015-02-06 13:55:02 -0800
Commit: 1d32341, github.com/apache/spark/pull/4403
[HOTFIX][MLLIB] fix a compilation error with java 6
Xiangrui Meng <meng@databricks.com>
2015-02-06 13:52:35 -0800
Commit: 87e0f0d, github.com/apache/spark/pull/4442
[SPARK-4983] Insert waiting time before tagging EC2 instances
GenTang <gen.tang86@gmail.com>, Gen TANG <gen.tang86@gmail.com>
2015-02-06 13:27:34 -0800
Commit: 2872d83, github.com/apache/spark/pull/3986
[SPARK-5586][Spark Shell][SQL] Make `sqlContext` available in spark shell
OopsOutOfMemory <victorshengli@126.com>
2015-02-06 13:20:10 -0800
Commit: 2ef9853, github.com/apache/spark/pull/4387
[SPARK-5278][SQL] Introduce UnresolvedGetField and complete the check of ambiguous reference to fields
Wenchen Fan <cloud0fan@outlook.com>
2015-02-06 13:08:09 -0800
Commit: 1b148ad, github.com/apache/spark/pull/4068
[SQL][Minor] Remove cache keyword in SqlParser
wangfei <wangfei1@huawei.com>
2015-02-06 12:42:23 -0800
Commit: d822606, github.com/apache/spark/pull/4393
[SQL][HiveConsole][DOC] HiveConsole `correct hiveconsole imports`
OopsOutOfMemory <victorshengli@126.com>
2015-02-06 12:41:28 -0800
Commit: 2abaa6e, github.com/apache/spark/pull/4389
[SPARK-5595][SPARK-5603][SQL] Add a rule to do PreInsert type casting and field renaming and invalidating in memory cache after INSERT
Yin Huai <yhuai@databricks.com>
2015-02-06 12:38:07 -0800
Commit: 3c34d62, github.com/apache/spark/pull/4373
[SPARK-5324][SQL] Results of describe can't be queried
OopsOutOfMemory <victorshengli@126.com>, Sheng, Li <OopsOutOfMemory@users.noreply.github.com>
2015-02-06 12:33:20 -0800
Commit: 0fc35da, github.com/apache/spark/pull/4249
[SPARK-5619][SQL] Support 'show roles' in HiveContext
q00251598 <qiyadong@huawei.com>
2015-02-06 12:29:26 -0800
Commit: cc66a3c, github.com/apache/spark/pull/4397
[SPARK-5640] Synchronize ScalaReflection where necessary
Tobias Schlatter <tobias@meisch.ch>
2015-02-06 12:15:02 -0800
Commit: 779e28b, github.com/apache/spark/pull/4431
[SPARK-5650][SQL] Support optional 'FROM' clause
Liang-Chi Hsieh <viirya@gmail.com>
2015-02-06 12:13:44 -0800
Commit: 921121d, github.com/apache/spark/pull/4426
[SPARK-5628] Add version option to spark-ec2
Nicholas Chammas <nicholas.chammas@gmail.com>
2015-02-06 12:08:22 -0800
Commit: ab0ffde, github.com/apache/spark/pull/4414
[SPARK-2945][YARN][Doc]add doc for spark.executor.instances
WangTaoTheTonic <wangtao111@huawei.com>
2015-02-06 11:57:02 -0800
Commit: 540f474, github.com/apache/spark/pull/4350
[SPARK-4361][Doc] Add more docs for Hadoop Configuration
zsxwing <zsxwing@gmail.com>
2015-02-06 11:50:20 -0800
Commit: 528dd34, github.com/apache/spark/pull/3225
[HOTFIX] Fix test build break in ExecutorAllocationManagerSuite.
Josh Rosen <joshrosen@databricks.com>
2015-02-06 11:47:32 -0800
Commit: 9e828f4
[SPARK-5652][Mllib] Use broadcasted weights in LogisticRegressionModel
Liang-Chi Hsieh <viirya@gmail.com>
2015-02-06 11:22:11 -0800
Commit: 6fda4c1, github.com/apache/spark/pull/4429
[SPARK-5555] Enable UISeleniumSuite tests
Josh Rosen <joshrosen@databricks.com>
2015-02-06 11:14:58 -0800
Commit: 93fee7b, github.com/apache/spark/pull/4334
SPARK-2450 Adds executor log links to Web UI
Kostas Sakellis <kostas@cloudera.com>, Josh Rosen <joshrosen@databricks.com>
2015-02-06 11:13:00 -0800
Commit: e74dd04, github.com/apache/spark/pull/3486
[SPARK-5618][Spark Core][Minor] Optimise utility code.
Makoto Fukuhara <fukuo33@gmail.com>
2015-02-06 11:11:38 -0800
Commit: 3feb798, github.com/apache/spark/pull/4396
[SPARK-5593][Core]Replace BlockManagerListener with ExecutorListener in ExecutorAllocationListener
lianhuiwang <lianhuiwang09@gmail.com>
2015-02-06 11:09:37 -0800
Commit: 9387dc1, github.com/apache/spark/pull/4369
[SPARK-4877] Allow user first classes to extend classes in the parent.
Stephen Haberman <stephen@exigencecorp.com>
2015-02-06 11:03:56 -0800
Commit: 52386cf, github.com/apache/spark/pull/3725
[SPARK-5396] Syntax error in spark scripts on windows.
Masayoshi TSUZUKI <tsudukim@oss.nttdata.co.jp>
2015-02-06 10:58:26 -0800
Commit: 2dc94cd, github.com/apache/spark/pull/4428
[SPARK-5636] Ramp up faster in dynamic allocation
Andrew Or <andrew@databricks.com>
2015-02-06 10:54:23 -0800
Commit: 0a90305, github.com/apache/spark/pull/4409
SPARK-4337. [YARN] Add ability to cancel pending requests
Sandy Ryza <sandy@cloudera.com>
2015-02-06 10:53:16 -0800
Commit: 1568391, github.com/apache/spark/pull/4141
[SPARK-5416] init Executor.threadPool before ExecutorSource
Ryan Williams <ryan.blake.williams@gmail.com>
2015-02-06 12:22:25 +0000
Commit: f9bc4ef, github.com/apache/spark/pull/4212
[Build] Set all Debian package permissions to 755
Nicholas Chammas <nicholas.chammas@gmail.com>
2015-02-06 11:38:39 +0000
Commit: 3638216, github.com/apache/spark/pull/4277
Update ec2-scripts.md
Miguel Peralvo <miguel.peralvo@gmail.com>
2015-02-06 11:04:48 +0000
Commit: f6613fc, github.com/apache/spark/pull/4300
[SPARK-5470][Core]use defaultClassLoader to load classes in KryoSerializer
lianhuiwang <lianhuiwang09@gmail.com>
2015-02-06 11:00:35 +0000
Commit: 8007a4f, github.com/apache/spark/pull/4258
[SPARK-5653][YARN] In ApplicationMaster rename isDriver to isClusterMode
lianhuiwang <lianhuiwang09@gmail.com>
2015-02-06 10:48:31 -0800
Commit: 4ff8855, github.com/apache/spark/pull/4430
[SPARK-5582] [history] Ignore empty log directories.
Marcelo Vanzin <vanzin@cloudera.com>
2015-02-06 10:07:20 +0000
Commit: faccdcb, github.com/apache/spark/pull/4352
[SPARK-5157][YARN] Configure more JVM options properly when we use ConcMarkSweepGC for AM.
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2015-02-06 09:39:12 +0000
Commit: 25d8044, github.com/apache/spark/pull/3956
[Minor] Remove permission for execution from spark-shell.cmd
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2015-02-06 09:33:36 +0000
Commit: 7c54681, github.com/apache/spark/pull/3983
[SPARK-5380][GraphX] Solve an ArrayIndexOutOfBoundsException when build graph with a file format error
Leolh <leosandylh@gmail.com>
2015-02-06 09:01:53 +0000
Commit: ffdb2e9, github.com/apache/spark/pull/4176
[SPARK-5013] [MLlib] Added documentation and sample data file for GaussianMixture
Travis Galoppo <tjg2107@columbia.edu>
2015-02-06 10:26:51 -0800
Commit: f408db6, github.com/apache/spark/pull/4401
[SPARK-4789] [SPARK-4942] [SPARK-5031] [mllib] Standardize ML Prediction APIs
Joseph K. Bradley <joseph@databricks.com>
2015-02-05 23:43:47 -0800
Commit: 45b95e7, github.com/apache/spark/pull/3637
[SPARK-5604][MLLIB] remove checkpointDir from trees
Xiangrui Meng <meng@databricks.com>
2015-02-05 23:32:09 -0800
Commit: c35a11e, github.com/apache/spark/pull/4407
[SPARK-5639][SQL] Support DataFrame.renameColumn.
Reynold Xin <rxin@databricks.com>
2015-02-05 23:02:40 -0800
Commit: 0639d3e, github.com/apache/spark/pull/4410
Revert "SPARK-5607: Update to Kryo 2.24.0 to avoid including objenesis 1.2."
Patrick Wendell <patrick@databricks.com>
2015-02-05 18:37:55 -0800
Commit: 6d31531
SPARK-5557: Explicitly include servlet API in dependencies.
Patrick Wendell <patrick@databricks.com>
2015-02-05 18:14:54 -0800
Commit: 34131fd, github.com/apache/spark/pull/4411
[HOTFIX] [SQL] Disables Metastore Parquet table conversion for "SQLQuerySuite.CTAS with serde"
Cheng Lian <lian@databricks.com>
2015-02-05 18:09:18 -0800
Commit: ce6d8bb, github.com/apache/spark/pull/4413
[SPARK-5638][SQL] Add a config flag to disable eager analysis of DataFrames
Reynold Xin <rxin@databricks.com>
2015-02-05 18:07:10 -0800
Commit: 4fd67e4, github.com/apache/spark/pull/4408
[SPARK-5620][DOC] group methods in generated unidoc
Xiangrui Meng <meng@databricks.com>
2015-02-05 16:26:51 -0800
Commit: e2be79d, github.com/apache/spark/pull/4404
[SPARK-5182] [SPARK-5528] [SPARK-5509] [SPARK-3575] [SQL] Parquet data source improvements
Cheng Lian <lian@databricks.com>
2015-02-05 15:29:56 -0800
Commit: 50c48eb, github.com/apache/spark/pull/4308
[SPARK-5604[MLLIB] remove checkpointDir from LDA
Xiangrui Meng <meng@databricks.com>
2015-02-05 15:07:33 -0800
Commit: 59798cb, github.com/apache/spark/pull/4390
[SPARK-5460][MLlib] Wrapped `Try` around `deleteAllCheckpoints` - RandomForest.
x1- <viva008@gmail.com>
2015-02-05 15:02:04 -0800
Commit: 44768f5, github.com/apache/spark/pull/4347
[SPARK-5135][SQL] Add support for describe table to DDL in SQLContext
OopsOutOfMemory <victorshengli@126.com>
2015-02-05 13:07:48 -0800
Commit: 55cebcf, github.com/apache/spark/pull/4227
[SPARK-5617][SQL] fix test failure of SQLQuerySuite
wangfei <wangfei1@huawei.com>
2015-02-05 12:44:12 -0800
Commit: 785a2e3, github.com/apache/spark/pull/4395
[Branch-1.3] [DOC] doc fix for date
Daoyuan Wang <daoyuan.wang@intel.com>
2015-02-05 12:42:27 -0800
Commit: 17ef7f9, github.com/apache/spark/pull/4400
[SPARK-5474][Build]curl should support URL redirection in build/mvn
GuoQiang Li <witgo@qq.com>
2015-02-05 12:03:13 -0800
Commit: d1066e9, github.com/apache/spark/pull/4263
[HOTFIX] MLlib build break.
Reynold Xin <rxin@databricks.com>
2015-02-05 00:42:50 -0800
Commit: c83d118
SPARK-5548: Fixed a race condition in AkkaUtilsSuite
Jacek Lewandowski <lewandowski.jacek@gmail.com>
2015-02-05 12:00:04 -0800
Commit: fba2dc6, github.com/apache/spark/pull/4343
[SPARK-5608] Improve SEO of Spark documentation pages
Matei Zaharia <matei@databricks.com>
2015-02-05 11:12:50 -0800
Commit: de112a2, github.com/apache/spark/pull/4381
SPARK-4687. Add a recursive option to the addFile API
Sandy Ryza <sandy@cloudera.com>
2015-02-05 10:15:55 -0800
Commit: c22ccc0, github.com/apache/spark/pull/3670
[MLlib] Minor: UDF style update.
Reynold Xin <rxin@databricks.com>
2015-02-04 23:57:53 -0800
Commit: 4074674, github.com/apache/spark/pull/4388
[SPARK-5612][SQL] Move DataFrame implicit functions into SQLContext.implicits.
Reynold Xin <rxin@databricks.com>
2015-02-04 23:44:34 -0800
Commit: 0040b61, github.com/apache/spark/pull/4386
[SPARK-5606][SQL] Support plus sign in HiveContext
q00251598 <qiyadong@huawei.com>
2015-02-04 23:16:01 -0800
Commit: bf43781, github.com/apache/spark/pull/4378
[SPARK-5599] Check MLlib public APIs for 1.3
Xiangrui Meng <meng@databricks.com>
2015-02-04 23:03:47 -0800
Commit: abc184e, github.com/apache/spark/pull/4377
[SPARK-5596] [mllib] ML model import/export for GLMs, NaiveBayes
Joseph K. Bradley <joseph@databricks.com>
2015-02-04 22:46:48 -0800
Commit: 885bcbb, github.com/apache/spark/pull/4233
SPARK-5607: Update to Kryo 2.24.0 to avoid including objenesis 1.2.
Patrick Wendell <patrick@databricks.com>
2015-02-04 22:39:44 -0800
Commit: 59fb5c7, github.com/apache/spark/pull/4383
[SPARK-5602][SQL] Better support for creating DataFrame from local data collection
Reynold Xin <rxin@databricks.com>
2015-02-04 19:53:57 -0800
Commit: b8f9c00, github.com/apache/spark/pull/4372
[SPARK-5538][SQL] Fix flaky CachedTableSuite
Reynold Xin <rxin@databricks.com>
2015-02-04 19:52:41 -0800
Commit: 1901b19, github.com/apache/spark/pull/4379
[SQL][DataFrame] Minor cleanup.
Reynold Xin <rxin@databricks.com>
2015-02-04 19:51:48 -0800
Commit: f05bfa6, github.com/apache/spark/pull/4374
[SPARK-4520] [SQL] This pr fixes the ArrayIndexOutOfBoundsException as r...
Sadhan Sood <sadhan@tellapart.com>
2015-02-04 19:18:06 -0800
Commit: aa6f4ca, github.com/apache/spark/pull/4148
[SPARK-5605][SQL][DF] Allow using String to specify colum name in DSL aggregate functions
Reynold Xin <rxin@databricks.com>
2015-02-04 18:35:51 -0800
Commit: 478ee3f, github.com/apache/spark/pull/4376
[SPARK-5411] Allow SparkListeners to be specified in SparkConf and loaded when creating SparkContext
Josh Rosen <joshrosen@databricks.com>
2015-02-04 17:18:03 -0800
Commit: 47e4d57, github.com/apache/spark/pull/4111
[SPARK-5577] Python udf for DataFrame
Davies Liu <davies@databricks.com>
2015-02-04 15:55:09 -0800
Commit: dc9ead9, github.com/apache/spark/pull/4351
[SPARK-5118][SQL] Fix: create table test stored as parquet as select ..
guowei2 <guowei2@asiainfo.com>
2015-02-04 15:26:10 -0800
Commit: 06da868, github.com/apache/spark/pull/3921
[SQL] Use HiveContext's sessionState in HiveMetastoreCatalog.hiveDefaultTableFilePath
Yin Huai <yhuai@databricks.com>
2015-02-04 15:22:40 -0800
Commit: cb4c3e5, github.com/apache/spark/pull/4355
[SQL] Correct the default size of TimestampType and expose NumericType
Yin Huai <yhuai@databricks.com>
2015-02-04 15:14:49 -0800
Commit: 513bb2c, github.com/apache/spark/pull/4314
[SQL][Hiveconsole] Bring hive console code up to date and update README.md
OopsOutOfMemory <victorshengli@126.com>, Sheng, Li <OopsOutOfMemory@users.noreply.github.com>
2015-02-04 15:13:54 -0800
Commit: 2cdcfe3, github.com/apache/spark/pull/4330
[SPARK-5367][SQL] Support star expression in udfs
wangfei <wangfei1@huawei.com>, scwf <wangfei1@huawei.com>
2015-02-04 15:12:07 -0800
Commit: 8b803f6, github.com/apache/spark/pull/4353
[SPARK-5426][SQL] Add SparkSQL Java API helper methods.
kul <kuldeep.bora@gmail.com>
2015-02-04 15:08:37 -0800
Commit: 38ab92e, github.com/apache/spark/pull/4243
[SPARK-5587][SQL] Support change database owner
wangfei <wangfei1@huawei.com>
2015-02-04 14:35:12 -0800
Commit: 7920791, github.com/apache/spark/pull/4357
[SPARK-5591][SQL] Fix NoSuchObjectException for CTAS
wangfei <wangfei1@huawei.com>
2015-02-04 14:33:07 -0800
Commit: c79dd1e, github.com/apache/spark/pull/4365
[SPARK-4939] move to next locality when no pending tasks
Davies Liu <davies@databricks.com>
2015-02-04 14:22:07 -0800
Commit: f9bb3cb, github.com/apache/spark/pull/3779
[SPARK-4707][STREAMING] Reliable Kafka Receiver can lose data if the blo...
Hari Shreedharan <hshreedharan@apache.org>
2015-02-04 14:20:44 -0800
Commit: 14c9f32, github.com/apache/spark/pull/3655
[SPARK-4964] [Streaming] Exactly-once semantics for Kafka
cody koeninger <cody@koeninger.org>
2015-02-04 12:06:34 -0800
Commit: a119cae, github.com/apache/spark/pull/3798
[SPARK-5588] [SQL] support select/filter by SQL expression
Davies Liu <davies@databricks.com>
2015-02-04 11:34:46 -0800
Commit: 950a0d3, github.com/apache/spark/pull/4359
[SPARK-5585] Flaky test in MLlib python
Davies Liu <davies@databricks.com>
2015-02-04 08:54:20 -0800
Commit: 84c6273, github.com/apache/spark/pull/4358
[SPARK-5574] use given name prefix in dir
Imran Rashid <irashid@cloudera.com>
2015-02-04 01:02:20 -0800
Commit: 5d9278a, github.com/apache/spark/pull/4344
[Minor] Fix incorrect warning log
Liang-Chi Hsieh <viirya@gmail.com>
2015-02-04 00:52:41 -0800
Commit: 316a4bb, github.com/apache/spark/pull/4360
[SPARK-5379][Streaming] Add awaitTerminationOrTimeout
zsxwing <zsxwing@gmail.com>
2015-02-04 00:40:28 -0800
Commit: 4d3dbfd, github.com/apache/spark/pull/4171
[SPARK-5341] Use maven coordinates as dependencies in spark-shell and spark-submit
Burak Yavuz <brkyvz@gmail.com>
2015-02-03 22:39:17 -0800
Commit: 3b7acd2, github.com/apache/spark/pull/4215
[SPARK-4939] revive offers periodically in LocalBackend
Davies Liu <davies@databricks.com>
2015-02-03 22:30:23 -0800
Commit: e196da8, github.com/apache/spark/pull/4147
[SPARK-4969][STREAMING][PYTHON] Add binaryRecords to streaming
freeman <the.freeman.lab@gmail.com>
2015-02-03 22:24:30 -0800
Commit: 9a33f89, github.com/apache/spark/pull/3803
[SPARK-5579][SQL][DataFrame] Support for project/filter using SQL expressions
Reynold Xin <rxin@databricks.com>
2015-02-03 22:15:35 -0800
Commit: cb7f783, github.com/apache/spark/pull/4348
[FIX][MLLIB] fix seed handling in Python GMM
Xiangrui Meng <meng@databricks.com>
2015-02-03 20:39:11 -0800
Commit: 679228b, github.com/apache/spark/pull/4349
[SPARK-4795][Core] Redesign the "primitive type => Writable" implicit APIs to make them be activated automatically
zsxwing <zsxwing@gmail.com>
2015-02-03 20:17:12 -0800
Commit: 5c63e05, github.com/apache/spark/pull/3642
[SPARK-5578][SQL][DataFrame] Provide a convenient way for Scala users to use UDFs
Reynold Xin <rxin@databricks.com>
2015-02-03 20:07:46 -0800
Commit: b22d5b5, github.com/apache/spark/pull/4345
[SPARK-5520][MLlib] Make FP-Growth implementation take generic item types (WIP)
Jacky Li <jacky.likun@huawei.com>, Jacky Li <jackylk@users.noreply.github.com>, Xiangrui Meng <meng@databricks.com>
2015-02-03 17:02:42 -0800
Commit: 298ef5b, github.com/apache/spark/pull/4340
[SPARK-5554] [SQL] [PySpark] add more tests for DataFrame Python API
Davies Liu <davies@databricks.com>
2015-02-03 16:01:56 -0800
Commit: 4640623, github.com/apache/spark/pull/4331
[STREAMING] SPARK-4986 Wait for receivers to deregister and receiver job to terminate
Jesper Lundgren <jesper.lundgren@vpon.com>
2015-02-03 14:53:39 -0800
Commit: 092d4ba, github.com/apache/spark/pull/4338
[SPARK-5153][Streaming][Test] Increased timeout to deal with flaky KafkaStreamSuite
Tathagata Das <tathagata.das1565@gmail.com>
2015-02-03 13:46:02 -0800
Commit: d644bd9, github.com/apache/spark/pull/4342
[SPARK-4508] [SQL] build native date type to conform behavior to Hive
Daoyuan Wang <daoyuan.wang@intel.com>
2015-02-03 12:21:45 -0800
Commit: 6e244cf, github.com/apache/spark/pull/4325
[SPARK-5383][SQL] Support alias for udtfs
wangfei <wangfei1@huawei.com>, scwf <wangfei1@huawei.com>, Fei Wang <wangfei1@huawei.com>
2015-02-03 12:16:31 -0800
Commit: 5dbeb21, github.com/apache/spark/pull/4186
[SPARK-5550] [SQL] Support the case insensitive for UDF
Cheng Hao <hao.cheng@intel.com>
2015-02-03 12:12:26 -0800
Commit: 654c992, github.com/apache/spark/pull/4326
[SPARK-4987] [SQL] parquet timestamp type support
Daoyuan Wang <daoyuan.wang@intel.com>
2015-02-03 12:06:06 -0800
Commit: 67d5220, github.com/apache/spark/pull/3820
[SQL] DataFrame API update
Reynold Xin <rxin@databricks.com>
2015-02-03 10:34:56 -0800
Commit: 4204a12, github.com/apache/spark/pull/4332
Minor: Fix TaskContext deprecated annotations.
Reynold Xin <rxin@databricks.com>
2015-02-03 10:34:16 -0800
Commit: f7948f3, github.com/apache/spark/pull/4333
[SPARK-5549] Define TaskContext interface in Scala.
Reynold Xin <rxin@databricks.com>
2015-02-03 00:46:04 -0800
Commit: bebf4c4, github.com/apache/spark/pull/4324
[SPARK-5551][SQL] Create type alias for SchemaRDD for source backward compatibility
Reynold Xin <rxin@databricks.com>
2015-02-03 00:29:23 -0800
Commit: 523a935, github.com/apache/spark/pull/4327
[SQL][DataFrame] Remove DataFrameApi, ExpressionApi, and GroupedDataFrameApi
Reynold Xin <rxin@databricks.com>
2015-02-03 00:29:04 -0800
Commit: 37df330, github.com/apache/spark/pull/4328
[minor] update streaming linear algorithms
Xiangrui Meng <meng@databricks.com>
2015-02-03 00:14:43 -0800
Commit: 659329f, github.com/apache/spark/pull/4329
[SPARK-1405] [mllib] Latent Dirichlet Allocation (LDA) using EM
Xiangrui Meng <meng@databricks.com>
2015-02-02 23:57:35 -0800
Commit: 980764f, github.com/apache/spark/pull/2388
[SPARK-5536] replace old ALS implementation by the new one
Xiangrui Meng <meng@databricks.com>
2015-02-02 23:49:09 -0800
Commit: 0cc7b88, github.com/apache/spark/pull/4321
[SPARK-5414] Add SparkFirehoseListener class for consuming all SparkListener events
Josh Rosen <joshrosen@databricks.com>
2015-02-02 23:35:07 -0800
Commit: b8ebebe, github.com/apache/spark/pull/4210
[SPARK-5501][SPARK-5420][SQL] Write support for the data source API
Yin Huai <yhuai@databricks.com>
2015-02-02 23:30:44 -0800
Commit: 13531dd, github.com/apache/spark/pull/4294
[SPARK-5012][MLLib][PySpark]Python API for Gaussian Mixture Model
FlytxtRnD <meethu.mathew@flytxt.com>
2015-02-02 23:04:55 -0800
Commit: 50a1a87, github.com/apache/spark/pull/4059
[SPARK-3778] newAPIHadoopRDD doesn't properly pass credentials for secure hdfs
Thomas Graves <tgraves@apache.org>
2015-02-02 22:45:55 -0800
Commit: c31c36c, github.com/apache/spark/pull/4292
[SPARK-4979][MLLIB] Streaming logisitic regression
freeman <the.freeman.lab@gmail.com>
2015-02-02 22:42:15 -0800
Commit: eb0da6c, github.com/apache/spark/pull/4306
[SPARK-5219][Core] Add locks to avoid scheduling race conditions
zsxwing <zsxwing@gmail.com>
2015-02-02 21:42:18 -0800
Commit: c306555, github.com/apache/spark/pull/4019
[Doc] Minor: Fixes several formatting issues
Cheng Lian <lian@databricks.com>
2015-02-02 21:14:21 -0800
Commit: 60f67e7, github.com/apache/spark/pull/4316
SPARK-3996: Add jetty servlet and continuations.
Patrick Wendell <patrick@databricks.com>
2015-02-02 21:01:36 -0800
Commit: 7930d2b, github.com/apache/spark/pull/4323
SPARK-5542: Decouple publishing, packaging, and tagging in release script
Patrick Wendell <patrick@databricks.com>, Patrick Wendell <pwendell@gmail.com>
2015-02-02 21:00:30 -0800
Commit: 0ef38f5, github.com/apache/spark/pull/4319
[SPARK-5543][WebUI] Remove unused import JsonUtil from from JsonProtocol
nemccarthy <nathan@nemccarthy.me>
2015-02-02 20:03:13 -0800
Commit: cb39f12, github.com/apache/spark/pull/4320
[SPARK-5472][SQL] A JDBC data source for Spark SQL.
Tor Myklebust <tmyklebu@gmail.com>
2015-02-02 19:50:14 -0800
Commit: 8f471a6, github.com/apache/spark/pull/4261
[SPARK-5512][Mllib] Run the PIC algorithm with initial vector suggected by the PIC paper
Liang-Chi Hsieh <viirya@gmail.com>
2015-02-02 19:34:25 -0800
Commit: 1bcd465, github.com/apache/spark/pull/4301
[SPARK-5154] [PySpark] [Streaming] Kafka streaming support in Python
Davies Liu <davies@databricks.com>, Tathagata Das <tdas@databricks.com>
2015-02-02 19:16:27 -0800
Commit: 0561c45, github.com/apache/spark/pull/3715
[SQL] Improve DataFrame API error reporting
Reynold Xin <rxin@databricks.com>, Davies Liu <davies@databricks.com>
2015-02-02 19:01:47 -0800
Commit: 554403f, github.com/apache/spark/pull/4296
Revert "[SPARK-4508] [SQL] build native date type to conform behavior to Hive"
Patrick Wendell <patrick@databricks.com>
2015-02-02 17:52:17 -0800
Commit: eccb9fb
Spark 3883: SSL support for HttpServer and Akka
Jacek Lewandowski <lewandowski.jacek@gmail.com>, Jacek Lewandowski <jacek.lewandowski@datastax.com>
2015-02-02 17:18:54 -0800
Commit: cfea300, github.com/apache/spark/pull/3571
[SPARK-5540] hide ALS.solveLeastSquares
Xiangrui Meng <meng@databricks.com>
2015-02-02 17:10:01 -0800
Commit: ef65cf0, github.com/apache/spark/pull/4318
[SPARK-5534] [graphx] Graph getStorageLevel fix
Joseph K. Bradley <joseph@databricks.com>
2015-02-02 17:02:29 -0800
Commit: f133dec, github.com/apache/spark/pull/4317
[SPARK-5514] DataFrame.collect should call executeCollect
Reynold Xin <rxin@databricks.com>
2015-02-02 16:55:36 -0800
Commit: 8aa3cff, github.com/apache/spark/pull/4313
[SPARK-5195][sql]Update HiveMetastoreCatalog.scala(override the MetastoreRelation's sameresult method only compare databasename and table name)
seayi <405078363@qq.com>, Michael Armbrust <michael@databricks.com>
2015-02-02 16:06:52 -0800
Commit: dca6faa, github.com/apache/spark/pull/3898
[SPARK-2309][MLlib] Multinomial Logistic Regression
DB Tsai <dbtsai@alpinenow.com>
2015-02-02 15:59:15 -0800
Commit: b1aa8fe, github.com/apache/spark/pull/3833
[SPARK-5513][MLLIB] Add nonnegative option to ml's ALS
Xiangrui Meng <meng@databricks.com>
2015-02-02 15:55:44 -0800
Commit: 46d50f1, github.com/apache/spark/pull/4302
[SPARK-4508] [SQL] build native date type to conform behavior to Hive
Daoyuan Wang <daoyuan.wang@intel.com>
2015-02-02 15:49:22 -0800
Commit: 1646f89, github.com/apache/spark/pull/3732
SPARK-5500. Document that feeding hadoopFile into a shuffle operation wi...
Sandy Ryza <sandy@cloudera.com>
2015-02-02 14:52:46 -0800
Commit: 8309349, github.com/apache/spark/pull/4293
[SPARK-5461] [graphx] Add isCheckpointed, getCheckpointedFiles methods to Graph
Joseph K. Bradley <joseph@databricks.com>
2015-02-02 14:34:48 -0800
Commit: 842d000, github.com/apache/spark/pull/4253
SPARK-5425: Use synchronised methods in system properties to create SparkConf
Jacek Lewandowski <lewandowski.jacek@gmail.com>
2015-02-02 14:07:19 -0800
Commit: 5a55261, github.com/apache/spark/pull/4222
Disabling Utils.chmod700 for Windows
Martin Weindel <martin.weindel@gmail.com>, mweindel <m.weindel@usu-software.de>
2015-02-02 13:46:18 -0800
Commit: bff65b5, github.com/apache/spark/pull/4299
Make sure only owner can read / write to directories created for the job.
Josh Rosen <joshrosen@databricks.com>
2015-01-21 14:38:14 -0800
Commit: 52f5754
[HOTFIX] Add jetty references to build for YARN module.
Patrick Wendell <patrick@databricks.com>
2015-02-02 14:00:14 -0800
Commit: 2321dd1
[SPARK-4631][streaming][FIX] Wait for a receiver to start before publishing test data.
Iulian Dragos <jaguarul@gmail.com>
2015-02-02 14:00:33 -0800
Commit: e908322, github.com/apache/spark/pull/4270
[SPARK-5212][SQL] Add support of schema-less, custom field delimiter and SerDe for HiveQL transform
Liang-Chi Hsieh <viirya@gmail.com>
2015-02-02 13:53:55 -0800
Commit: 683e938, github.com/apache/spark/pull/4014
[SPARK-5530] Add executor container to executorIdToContainer
Xutingjun <1039320815@qq.com>
2015-02-02 12:37:51 -0800
Commit: 62a93a1, github.com/apache/spark/pull/4309
[Docs] Fix Building Spark link text
Nicholas Chammas <nicholas.chammas@gmail.com>
2015-02-02 12:33:49 -0800
Commit: 3f941b6, github.com/apache/spark/pull/4312
[SPARK-5173]support python application running on yarn cluster mode
lianhuiwang <lianhuiwang09@gmail.com>, Wang Lianhui <lianhuiwang09@gmail.com>
2015-02-02 12:32:28 -0800
Commit: f5e6375, github.com/apache/spark/pull/3976
SPARK-4585. Spark dynamic executor allocation should use minExecutors as...
Sandy Ryza <sandy@cloudera.com>
2015-02-02 12:27:08 -0800
Commit: b2047b5, github.com/apache/spark/pull/4051
[MLLIB] SPARK-5491 (ex SPARK-1473): Chi-square feature selection
Alexander Ulanov <nashb@yandex.ru>
2015-02-02 12:13:05 -0800
Commit: c081b21, github.com/apache/spark/pull/1484
SPARK-5492. Thread statistics can break with older Hadoop versions
Sandy Ryza <sandy@cloudera.com>
2015-02-02 00:54:06 -0800
Commit: 6f34131, github.com/apache/spark/pull/4305
[SPARK-5478][UI][Minor] Add missing right parentheses
jerryshao <saisai.shao@intel.com>
2015-02-01 23:56:13 -0800
Commit: 63dfe21, github.com/apache/spark/pull/4267
[SPARK-5353] Log failures in REPL class loading
Tobias Schlatter <tobias@meisch.ch>
2015-02-01 21:43:49 -0800
Commit: 9f0a6e1, github.com/apache/spark/pull/4130
[SPARK-3996]: Shade Jetty in Spark deliverables
Patrick Wendell <patrick@databricks.com>
2015-02-01 21:13:57 -0800
Commit: a15f6e3, github.com/apache/spark/pull/4285
[SPARK-4001][MLlib] adding parallel FP-Growth algorithm for frequent pattern mining in MLlib
Jacky Li <jacky.likun@huawei.com>, Jacky Li <jackylk@users.noreply.github.com>, Xiangrui Meng <meng@databricks.com>
2015-02-01 20:07:25 -0800
Commit: 859f724, github.com/apache/spark/pull/2847
[Spark-5406][MLlib] LocalLAPACK mode in RowMatrix.computeSVD should have much smaller upper bound
Yuhao Yang <hhbyyh@gmail.com>
2015-02-01 19:40:26 -0800
Commit: d85cd4e, github.com/apache/spark/pull/4200
[SPARK-5465] [SQL] Fixes filter push-down for Parquet data source
Cheng Lian <lian@databricks.com>
2015-02-01 18:52:39 -0800
Commit: ec10032, github.com/apache/spark/pull/4255
[SPARK-5262] [SPARK-5244] [SQL] add coalesce in SQLParser and widen types for parameters of coalesce
Daoyuan Wang <daoyuan.wang@intel.com>
2015-02-01 18:51:38 -0800
Commit: 8cf4a1f, github.com/apache/spark/pull/4057
[SPARK-5196][SQL] Support `comment` in Create Table Field DDL
OopsOutOfMemory <victorshengli@126.com>
2015-02-01 18:41:49 -0800
Commit: 1b56f1d, github.com/apache/spark/pull/3999
[SPARK-1825] Make Windows Spark client work fine with Linux YARN cluster
Masayoshi TSUZUKI <tsudukim@oss.nttdata.co.jp>
2015-02-01 18:26:28 -0800
Commit: 7712ed5, github.com/apache/spark/pull/3943
[SPARK-5176] The thrift server does not support cluster mode
Tom Panning <tom.panning@nextcentury.com>
2015-02-01 17:57:31 -0800
Commit: 1ca0a10, github.com/apache/spark/pull/4137
[SPARK-5155] Build fails with spark-ganglia-lgpl profile
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2015-02-01 17:53:56 -0800
Commit: c80194b, github.com/apache/spark/pull/4303
[Minor][SQL] Little refactor DataFrame related codes
Liang-Chi Hsieh <viirya@gmail.com>
2015-02-01 17:52:18 -0800
Commit: ef89b82, github.com/apache/spark/pull/4298
[SPARK-4859][Core][Streaming] Refactor LiveListenerBus and StreamingListenerBus
zsxwing <zsxwing@gmail.com>
2015-02-01 17:47:51 -0800
Commit: 883bc88, github.com/apache/spark/pull/4006
[SPARK-5424][MLLIB] make the new ALS impl take generic ID types
Xiangrui Meng <meng@databricks.com>
2015-02-01 14:13:31 -0800
Commit: 4a17122, github.com/apache/spark/pull/4281
[SPARK-5207] [MLLIB] StandardScalerModel mean and variance re-use
Octavian Geagla <ogeagla@gmail.com>
2015-02-01 09:21:14 -0800
Commit: bdb0680, github.com/apache/spark/pull/4140
[SPARK-5422] Add support for sending Graphite metrics via UDP
Ryan Williams <ryan.blake.williams@gmail.com>
2015-01-31 23:41:05 -0800
Commit: 80bd715, github.com/apache/spark/pull/4218
SPARK-3359 [CORE] [DOCS] `sbt/sbt unidoc` doesn't work with Java 8
Sean Owen <sowen@cloudera.com>
2015-01-31 10:40:42 -0800
Commit: c84d5a1, github.com/apache/spark/pull/4193
[SPARK-3975] Added support for BlockMatrix addition and multiplication
Burak Yavuz <brkyvz@gmail.com>, Burak Yavuz <brkyvz@dn51t42l.sunet>, Burak Yavuz <brkyvz@dn51t4rd.sunet>, Burak Yavuz <brkyvz@dn0a221430.sunet>, Burak Yavuz <brkyvz@dn0a22b17d.sunet>
2015-01-31 00:47:30 -0800
Commit: ef8974b, github.com/apache/spark/pull/4274
[MLLIB][SPARK-3278] Monotone (Isotonic) regression using parallel pool adjacent violators algorithm
martinzapletal <zapletal-martin@email.cz>, Xiangrui Meng <meng@databricks.com>, Martin Zapletal <zapletal-martin@email.cz>
2015-01-31 00:46:02 -0800
Commit: 34250a6, github.com/apache/spark/pull/3519
[SPARK-5307] Add a config option for SerializationDebugger.
Reynold Xin <rxin@databricks.com>
2015-01-31 00:06:36 -0800
Commit: 6364083, github.com/apache/spark/pull/4297
[SQL] remove redundant field "childOutput" from execution.Aggregate, use child.output instead
kai <kaizeng@eecs.berkeley.edu>
2015-01-30 23:19:10 -0800
Commit: f54c9f6, github.com/apache/spark/pull/4291
[SPARK-5307] SerializationDebugger
Reynold Xin <rxin@databricks.com>
2015-01-30 22:34:10 -0800
Commit: 740a568, github.com/apache/spark/pull/4098
[SPARK-5504] [sql] convertToCatalyst should support nested arrays
Joseph K. Bradley <joseph@databricks.com>
2015-01-30 15:40:14 -0800
Commit: e643de4, github.com/apache/spark/pull/4295
SPARK-5400 [MLlib] Changed name of GaussianMixtureEM to GaussianMixture
Travis Galoppo <tjg2107@columbia.edu>
2015-01-30 15:32:25 -0800
Commit: 9869773, github.com/apache/spark/pull/4290
[SPARK-4259][MLlib]: Add Power Iteration Clustering Algorithm with Gaussian Similarity Function
sboeschhuawei <stephen.boesch@huawei.com>, Fan Jiang <fanjiang.sc@huawei.com>, Jiang Fan <fjiang6@gmail.com>, Stephen Boesch <stephen.boesch@huawei.com>, Xiangrui Meng <meng@databricks.com>
2015-01-30 14:09:49 -0800
Commit: f377431, github.com/apache/spark/pull/4254
[SPARK-5486] Added validate method to BlockMatrix
Burak Yavuz <brkyvz@gmail.com>
2015-01-30 13:59:10 -0800
Commit: 6ee8338, github.com/apache/spark/pull/4279
[SPARK-5496][MLLIB] Allow both classification and Classification in Algo for trees.
Xiangrui Meng <meng@databricks.com>
2015-01-30 10:08:07 -0800
Commit: 0a95085, github.com/apache/spark/pull/4287
[MLLIB] SPARK-4846: throw a RuntimeException and give users hints to increase the minCount
Joseph J.C. Tang <jinntrance@gmail.com>
2015-01-30 10:07:26 -0800
Commit: 54d9575, github.com/apache/spark/pull/4247
SPARK-5393. Flood of util.RackResolver log messages after SPARK-1714
Sandy Ryza <sandy@cloudera.com>
2015-01-30 11:31:54 -0600
Commit: 254eaa4, github.com/apache/spark/pull/4192
[SPARK-5457][SQL] Add missing DSL for ApproxCountDistinct.
Takuya UESHIN <ueshin@happy-camper.st>
2015-01-30 01:21:35 -0800
Commit: 6f21dce, github.com/apache/spark/pull/4250
[SPARK-5094][MLlib] Add Python API for Gradient Boosted Trees
Kazuki Taniguchi <kazuki.t.1018@gmail.com>
2015-01-30 00:39:44 -0800
Commit: bc1fc9b, github.com/apache/spark/pull/3951
[SPARK-5322] Added transpose functionality to BlockMatrix
Burak Yavuz <brkyvz@gmail.com>
2015-01-29 21:26:29 -0800
Commit: dd4d84c, github.com/apache/spark/pull/4275
[SQL] Support df("*") to select all columns in a data frame.
Reynold Xin <rxin@databricks.com>
2015-01-29 19:09:08 -0800
Commit: 80def9d, github.com/apache/spark/pull/4283
[SPARK-5462] [SQL] Use analyzed query plan in DataFrame.apply()
Josh Rosen <joshrosen@databricks.com>
2015-01-29 18:23:05 -0800
Commit: 22271f9, github.com/apache/spark/pull/4282
[SPARK-5395] [PySpark] fix python process leak while coalesce()
Davies Liu <davies@databricks.com>
2015-01-29 17:28:37 -0800
Commit: 5c746ee, github.com/apache/spark/pull/4238
[SQL] DataFrame API improvements
Reynold Xin <rxin@databricks.com>
2015-01-29 17:24:00 -0800
Commit: ce9c43b, github.com/apache/spark/pull/4280
Revert "[WIP] [SPARK-3996]: Shade Jetty in Spark deliverables"
Patrick Wendell <patrick@databricks.com>
2015-01-29 17:14:27 -0800
Commit: d2071e8
remove 'return'
Yoshihiro Shimizu <shimizu@amoad.com>
2015-01-29 16:55:00 -0800
Commit: 5338772, github.com/apache/spark/pull/4268
[WIP] [SPARK-3996]: Shade Jetty in Spark deliverables
Patrick Wendell <patrick@databricks.com>
2015-01-29 16:31:19 -0800
Commit: f240fe3, github.com/apache/spark/pull/4252
[SPARK-5464] Fix help() for Python DataFrame instances
Josh Rosen <joshrosen@databricks.com>
2015-01-29 16:23:20 -0800
Commit: 0bb15f2, github.com/apache/spark/pull/4278
[SPARK-4296][SQL] Trims aliases when resolving and checking aggregate expressions
Yin Huai <yhuai@databricks.com>, Cheng Lian <lian@databricks.com>
2015-01-29 15:49:34 -0800
Commit: c00d517, github.com/apache/spark/pull/4010
[SPARK-5373][SQL] Literal in agg grouping expressions leads to incorrect result
wangfei <wangfei1@huawei.com>
2015-01-29 15:47:13 -0800
Commit: c1b3eeb, github.com/apache/spark/pull/4169
[SPARK-5367][SQL] Support star expression in udf
wangfei <wangfei1@huawei.com>, scwf <wangfei1@huawei.com>
2015-01-29 15:44:53 -0800
Commit: fbaf9e0, github.com/apache/spark/pull/4163
[SPARK-4786][SQL]: Parquet filter pushdown for castable types
Yash Datta <Yash.Datta@guavus.com>
2015-01-29 15:42:23 -0800
Commit: de221ea, github.com/apache/spark/pull/4156
[SPARK-5309][SQL] Add support for dictionaries in PrimitiveConverter for Strin...
Michael Davies <Michael.BellDavies@gmail.com>
2015-01-29 15:40:59 -0800
Commit: 940f375, github.com/apache/spark/pull/4187
[SPARK-5429][SQL] Use javaXML plan serialization for Hive golden answers on Hive 0.13.1
Liang-Chi Hsieh <viirya@gmail.com>
2015-01-29 15:28:22 -0800
Commit: bce0ba1, github.com/apache/spark/pull/4223
[SPARK-5445][SQL] Consolidate Java and Scala DSL static methods.
Reynold Xin <rxin@databricks.com>
2015-01-29 15:13:09 -0800
Commit: 7156322, github.com/apache/spark/pull/4276
[SPARK-5466] Add explicit guava dependencies where needed.
Marcelo Vanzin <vanzin@cloudera.com>
2015-01-29 13:00:45 -0800
Commit: f9e5694, github.com/apache/spark/pull/4272
[SPARK-5477] refactor stat.py
Xiangrui Meng <meng@databricks.com>
2015-01-29 10:11:44 -0800
Commit: a3dc618, github.com/apache/spark/pull/4266
[SQL] Various DataFrame DSL update.
Reynold Xin <rxin@databricks.com>
2015-01-29 00:01:10 -0800
Commit: 5ad78f6, github.com/apache/spark/pull/4260
[SPARK-3977] Conversion methods for BlockMatrix to other Distributed Matrices
Burak Yavuz <brkyvz@gmail.com>
2015-01-28 23:42:07 -0800
Commit: a63be1a, github.com/apache/spark/pull/4256
[SPARK-5445][SQL] Made DataFrame dsl usable in Java
Reynold Xin <rxin@databricks.com>
2015-01-28 19:10:32 -0800
Commit: 5b9760d, github.com/apache/spark/pull/4241
[SPARK-5430] move treeReduce and treeAggregate from mllib to core
Xiangrui Meng <meng@databricks.com>
2015-01-28 17:26:03 -0800
Commit: 4ee79c7, github.com/apache/spark/pull/4228
[SPARK-4586][MLLIB] Python API for ML pipeline and parameters
Xiangrui Meng <meng@databricks.com>, Davies Liu <davies@databricks.com>
2015-01-28 17:14:23 -0800
Commit: e80dc1c, github.com/apache/spark/pull/4151
[SPARK-5441][pyspark] Make SerDeUtil PairRDD to Python conversions more robust
Michael Nazario <mnazario@palantir.com>
2015-01-28 13:55:01 -0800
Commit: e023112, github.com/apache/spark/pull/4236
[SPARK-4387][PySpark] Refactoring python profiling code to make it extensible
Yandu Oppacher <yandu.oppacher@jadedpixel.com>, Davies Liu <davies@databricks.com>
2015-01-28 13:48:06 -0800
Commit: 3bead67, github.com/apache/spark/pull/3255.
[SPARK-5417] Remove redundant executor-id set() call
Ryan Williams <ryan.blake.williams@gmail.com>
2015-01-28 13:04:52 -0800
Commit: a731314, github.com/apache/spark/pull/4213
[SPARK-5434] [EC2] Preserve spaces in EC2 path
Nicholas Chammas <nicholas.chammas@gmail.com>
2015-01-28 12:56:03 -0800
Commit: d44ee43, github.com/apache/spark/pull/4224
[SPARK-5437] Fix DriverSuite and SparkSubmitSuite timeout issues
Andrew Or <andrew@databricks.com>
2015-01-28 12:52:31 -0800
Commit: 84b6ecd, github.com/apache/spark/pull/4230
[SPARK-4955]With executor dynamic scaling enabled,executor shoude be added or killed in yarn-cluster mode.
lianhuiwang <lianhuiwang09@gmail.com>
2015-01-28 12:50:57 -0800
Commit: 81f8f34, github.com/apache/spark/pull/3962
[SPARK-5440][pyspark] Add toLocalIterator to pyspark rdd
Michael Nazario <mnazario@palantir.com>
2015-01-28 12:47:12 -0800
Commit: 456c11f, github.com/apache/spark/pull/4237
SPARK-1934 [CORE] "this" reference escape to "selectorThread" during construction in ConnectionManager
Sean Owen <sowen@cloudera.com>
2015-01-28 12:44:35 -0800
Commit: 9b18009, github.com/apache/spark/pull/4225
[SPARK-5188][BUILD] make-distribution.sh should support curl, not only wget to get Tachyon
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2015-01-28 12:43:22 -0800
Commit: e902dc4, github.com/apache/spark/pull/3988
SPARK-5458. Refer to aggregateByKey instead of combineByKey in docs
Sandy Ryza <sandy@cloudera.com>
2015-01-28 12:41:23 -0800
Commit: 406f6d3, github.com/apache/spark/pull/4251
[SPARK-5447][SQL] Replaced reference to SchemaRDD with DataFrame.
Reynold Xin <rxin@databricks.com>
2015-01-28 12:10:01 -0800
Commit: c8e934e, github.com/apache/spark/pull/4242
[SPARK-5361]Multiple Java RDD <-> Python RDD conversions not working correctly
Winston Chen <wchen@quid.com>
2015-01-28 11:08:44 -0800
Commit: 453d799, github.com/apache/spark/pull/4146
[SPARK-5291][CORE] Add timestamp and reason why an executor is removed to SparkListenerExecutorAdded and SparkListenerExecutorRemoved
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2015-01-28 11:02:51 -0800
Commit: 0b35fcd, github.com/apache/spark/pull/4082
[SPARK-3974][MLlib] Distributed Block Matrix Abstractions
Burak Yavuz <brkyvz@gmail.com>, Xiangrui Meng <meng@databricks.com>, Burak Yavuz <brkyvz@dn51t42l.sunet>, Burak Yavuz <brkyvz@dn51t4rd.sunet>, Burak Yavuz <brkyvz@dn0a221430.sunet>
2015-01-28 10:06:37 -0800
Commit: eeb53bf, github.com/apache/spark/pull/3200
MAINTENANCE: Automated closing of pull requests.
Patrick Wendell <patrick@databricks.com>
2015-01-28 02:15:14 -0800
Commit: 622ff09, github.com/apache/spark/pull/1480
[SPARK-5415] bump sbt to version to 0.13.7
Ryan Williams <ryan.blake.williams@gmail.com>
2015-01-28 02:13:06 -0800
Commit: 661d3f9, github.com/apache/spark/pull/4211
[SPARK-4809] Rework Guava library shading.
Marcelo Vanzin <vanzin@cloudera.com>
2015-01-28 00:29:29 -0800
Commit: 37a5e27, github.com/apache/spark/pull/3658
[SPARK-5097][SQL] Test cases for DataFrame expressions.
Reynold Xin <rxin@databricks.com>
2015-01-27 18:10:49 -0800
Commit: d743732, github.com/apache/spark/pull/4235
[SPARK-5097][SQL] DataFrame
Reynold Xin <rxin@databricks.com>, Davies Liu <davies@databricks.com>
2015-01-27 16:08:24 -0800
Commit: 119f45d, github.com/apache/spark/pull/4173
SPARK-5199. FS read metrics should support CombineFileSplits and track bytes from all FSs
Sandy Ryza <sandy@cloudera.com>
2015-01-27 15:42:55 -0800
Commit: b1b35ca, github.com/apache/spark/pull/4050
[MLlib] fix python example of ALS in guide
Davies Liu <davies@databricks.com>
2015-01-27 15:33:01 -0800
Commit: fdaad4e, github.com/apache/spark/pull/4226
SPARK-5308 [BUILD] MD5 / SHA1 hash format doesn't match standard Maven output
Sean Owen <sowen@cloudera.com>
2015-01-27 10:22:50 -0800
Commit: ff356e2, github.com/apache/spark/pull/4161
[SPARK-5321] Support for transposing local matrices
Burak Yavuz <brkyvz@gmail.com>
2015-01-27 01:46:17 -0800
Commit: 9142674, github.com/apache/spark/pull/4109
[SPARK-5419][Mllib] Fix the logic in Vectors.sqdist
Liang-Chi Hsieh <viirya@gmail.com>
2015-01-27 01:29:14 -0800
Commit: 7b0ed79, github.com/apache/spark/pull/4217
[SPARK-3726] [MLlib] Allow sampling_rate not equal to 1.0 in RandomForests
MechCoder <manojkumarsivaraj334@gmail.com>
2015-01-26 19:46:17 -0800
Commit: d6894b1, github.com/apache/spark/pull/4073
[SPARK-5119] java.lang.ArrayIndexOutOfBoundsException on trying to train...
lewuathe <lewuathe@me.com>
2015-01-26 18:03:21 -0800
Commit: f2ba5c6, github.com/apache/spark/pull/3975
[SPARK-5052] Add common/base classes to fix guava methods signatures.
Elmer Garduno <elmerg@google.com>
2015-01-26 17:40:48 -0800
Commit: 661e0fc, github.com/apache/spark/pull/3874
SPARK-960 [CORE] [TEST] JobCancellationSuite "two jobs sharing the same stage" is broken
Sean Owen <sowen@cloudera.com>
2015-01-26 14:32:27 -0800
Commit: 0497ea5, github.com/apache/spark/pull/4180
Fix command spaces issue in make-distribution.sh
David Y. Ross <dyross@gmail.com>
2015-01-26 14:26:10 -0800
Commit: b38034e, github.com/apache/spark/pull/4126
SPARK-4147 [CORE] Reduce log4j dependency
Sean Owen <sowen@cloudera.com>
2015-01-26 14:23:42 -0800
Commit: 54e7b45, github.com/apache/spark/pull/4190
[SPARK-5339][BUILD] build/mvn doesn't work because of invalid URL for maven's tgz.
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2015-01-26 13:07:49 -0800
Commit: c094c73, github.com/apache/spark/pull/4124
[SPARK-5355] use j.u.c.ConcurrentHashMap instead of TrieMap
Davies Liu <davies@databricks.com>
2015-01-26 12:51:32 -0800
Commit: 1420931, github.com/apache/spark/pull/4208
[SPARK-5384][mllib] Vectors.sqdist returns inconsistent results for sparse/dense vectors when the vectors have different lengths
Yuhao Yang <hhbyyh@gmail.com>
2015-01-25 22:18:09 -0800
Commit: 8125168, github.com/apache/spark/pull/4183
[SPARK-5268] don't stop CoarseGrainedExecutorBackend for irrelevant DisassociatedEvent
CodingCat <zhunansjtu@gmail.com>
2015-01-25 19:28:53 -0800
Commit: 8df9435, github.com/apache/spark/pull/4063
SPARK-4430 [STREAMING] [TEST] Apache RAT Checks fail spuriously on test files
Sean Owen <sowen@cloudera.com>
2015-01-25 19:16:44 -0800
Commit: 0528b85, github.com/apache/spark/pull/4189
[SPARK-5326] Show fetch wait time as optional metric in the UI
Kay Ousterhout <kayousterhout@gmail.com>
2015-01-25 16:48:26 -0800
Commit: fc2168f, github.com/apache/spark/pull/4110
[SPARK-5344][WebUI] HistoryServer cannot recognize that inprogress file was renamed to completed file
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2015-01-25 15:34:20 -0800
Commit: 8f5c827, github.com/apache/spark/pull/4132
SPARK-4506 [DOCS] Addendum: Update more docs to reflect that standalone works in cluster mode
Sean Owen <sowen@cloudera.com>
2015-01-25 15:25:05 -0800
Commit: 9f64357, github.com/apache/spark/pull/4160
SPARK-5382: Use SPARK_CONF_DIR in spark-class if it is defined
Jacek Lewandowski <lewandowski.jacek@gmail.com>
2015-01-25 15:15:09 -0800
Commit: 1c30afd, github.com/apache/spark/pull/4179
SPARK-3782 [CORE] Direct use of log4j in AkkaUtils interferes with certain logging configurations
Sean Owen <sowen@cloudera.com>
2015-01-25 15:11:57 -0800
Commit: 383425a, github.com/apache/spark/pull/4184
SPARK-3852 [DOCS] Document spark.driver.extra* configs
Sean Owen <sowen@cloudera.com>
2015-01-25 15:08:05 -0800
Commit: c586b45, github.com/apache/spark/pull/4185
[SPARK-5402] log executor ID at executor-construction time
Ryan Williams <ryan.blake.williams@gmail.com>
2015-01-25 14:20:02 -0800
Commit: aea2548, github.com/apache/spark/pull/4195
[SPARK-5401] set executor ID before creating MetricsSystem
Ryan Williams <ryan.blake.williams@gmail.com>
2015-01-25 14:17:59 -0800
Commit: 2d9887b, github.com/apache/spark/pull/4194
Add comment about defaultMinPartitions
Idan Zalzberg <idanzalz@gmail.com>
2015-01-25 11:28:05 -0800
Commit: 412a58e, github.com/apache/spark/pull/4102
Closes #4157
Reynold Xin <rxin@databricks.com>
2015-01-25 00:24:59 -0800
Commit: d22ca1e
[SPARK-5214][Test] Add a test to demonstrate EventLoop can be stopped in the event thread
zsxwing <zsxwing@gmail.com>
2015-01-24 11:00:35 -0800
Commit: 0d1e67e, github.com/apache/spark/pull/4174
[SPARK-5058] Part 2. Typos and broken URL
Jongyoul Lee <jongyoul@gmail.com>
2015-01-23 23:34:11 -0800
Commit: 09e09c5, github.com/apache/spark/pull/4172
[SPARK-5351][GraphX] Do not use Partitioner.defaultPartitioner as a partitioner of EdgeRDDImp...
Takeshi Yamamuro <linguin.m.s@gmail.com>
2015-01-23 19:25:15 -0800
Commit: e224dbb, github.com/apache/spark/pull/4136
[SPARK-5063] More helpful error messages for several invalid operations
Josh Rosen <joshrosen@databricks.com>
2015-01-23 17:53:15 -0800
Commit: cef1f09, github.com/apache/spark/pull/3884
[SPARK-3541][MLLIB] New ALS implementation with improved storage
Xiangrui Meng <meng@databricks.com>
2015-01-22 22:09:13 -0800
Commit: ea74365, github.com/apache/spark/pull/3720
[SPARK-5315][Streaming] Fix reduceByWindow Java API not work bug
jerryshao <saisai.shao@intel.com>
2015-01-22 22:04:21 -0800
Commit: e0f7fb7, github.com/apache/spark/pull/4104
[SPARK-5233][Streaming] Fix error replaying of WAL introduced bug
jerryshao <saisai.shao@intel.com>
2015-01-22 21:58:53 -0800
Commit: 3c3fa63, github.com/apache/spark/pull/4032
SPARK-5370. [YARN] Remove some unnecessary synchronization in YarnAlloca...
Sandy Ryza <sandy@cloudera.com>
2015-01-22 13:49:35 -0600
Commit: 820ce03, github.com/apache/spark/pull/4164
[SPARK-5365][MLlib] Refactor KMeans to reduce redundant data
Liang-Chi Hsieh <viirya@gmail.com>
2015-01-22 08:16:35 -0800
Commit: 246111d, github.com/apache/spark/pull/4159
[SPARK-5147][Streaming] Delete the received data WAL log periodically
Tathagata Das <tathagata.das1565@gmail.com>, jerryshao <saisai.shao@intel.com>
2015-01-21 23:41:44 -0800
Commit: 3027f06, github.com/apache/spark/pull/4149
[SPARK-5317]Set BoostingStrategy.defaultParams With Enumeration Algo.Classification or Algo.Regression
Basin <jpsachilles@gmail.com>
2015-01-21 23:06:34 -0800
Commit: fcb3e18, github.com/apache/spark/pull/4103
[SPARK-3424][MLLIB] cache point distances during k-means|| init
Xiangrui Meng <meng@databricks.com>
2015-01-21 21:20:31 -0800
Commit: ca7910d, github.com/apache/spark/pull/4144
[SPARK-5202] [SQL] Add hql variable substitution support
Cheng Hao <hao.cheng@intel.com>
2015-01-21 17:34:18 -0800
Commit: 27bccc5, github.com/apache/spark/pull/4003
[SPARK-5355] make SparkConf thread-safe
Davies Liu <davies@databricks.com>
2015-01-21 16:51:42 -0800
Commit: 9bad062, github.com/apache/spark/pull/4143
[SPARK-4984][CORE][WEBUI] Adding a pop-up containing the full job description when it is very long
wangfei <wangfei1@huawei.com>
2015-01-21 15:27:42 -0800
Commit: 3be2a88, github.com/apache/spark/pull/3819
[SQL] [Minor] Remove deprecated parquet tests
Cheng Lian <lian@databricks.com>
2015-01-21 14:38:10 -0800
Commit: ba19689, github.com/apache/spark/pull/4116
Revert "[SPARK-5244] [SQL] add coalesce() in sql parser"
Josh Rosen <joshrosen@databricks.com>
2015-01-21 14:27:43 -0800
Commit: b328ac6
[SPARK-5009] [SQL] Long keyword support in SQL Parsers
Cheng Hao <hao.cheng@intel.com>
2015-01-21 13:05:56 -0800
Commit: 8361078, github.com/apache/spark/pull/3926
[SPARK-5244] [SQL] add coalesce() in sql parser
Daoyuan Wang <daoyuan.wang@intel.com>
2015-01-21 12:59:41 -0800
Commit: 812d367, github.com/apache/spark/pull/4040
[SPARK-5064][GraphX] Add numEdges upperbound validation for R-MAT graph generator to prevent infinite loop
Kenji Kikushima <kikushima.kenji@lab.ntt.co.jp>
2015-01-21 12:34:00 -0800
Commit: 3ee3ab5, github.com/apache/spark/pull/3950
[SPARK-4749] [mllib]: Allow initializing KMeans clusters using a seed
nate.crosswhite <nate.crosswhite@stresearch.com>, nxwhite-str <nxwhite-str@users.noreply.github.com>, Xiangrui Meng <meng@databricks.com>
2015-01-21 10:32:10 -0800
Commit: 7450a99, github.com/apache/spark/pull/3610
[MLlib] [SPARK-5301] Missing conversions and operations on IndexedRowMatrix and CoordinateMatrix
Reza Zadeh <reza@databricks.com>
2015-01-21 09:48:38 -0800
Commit: aa1e22b, github.com/apache/spark/pull/4089
SPARK-1714. Take advantage of AMRMClient APIs to simplify logic in YarnA...
Sandy Ryza <sandy@cloudera.com>
2015-01-21 10:31:54 -0600
Commit: 2eeada3, github.com/apache/spark/pull/3765
[SPARK-5336][YARN]spark.executor.cores must not be less than spark.task.cpus
WangTao <barneystinson@aliyun.com>, WangTaoTheTonic <barneystinson@aliyun.com>
2015-01-21 09:42:30 -0600
Commit: 8c06a5f, github.com/apache/spark/pull/4123
[SPARK-5297][Streaming] Fix Java file stream type erasure problem
jerryshao <saisai.shao@intel.com>
2015-01-20 23:37:47 -0800
Commit: 424d8c6, github.com/apache/spark/pull/4101
[HOTFIX] Update pom.xml to pull MapR's Hadoop version 2.4.1.
Kannan Rajah <rkannan82@gmail.com>
2015-01-20 23:34:04 -0800
Commit: ec5b0f2, github.com/apache/spark/pull/4108
[SPARK-5275] [Streaming] include python source code
Davies Liu <davies@databricks.com>
2015-01-20 22:44:58 -0800
Commit: bad6c57, github.com/apache/spark/pull/4128
[SPARK-5294][WebUI] Hide tables in AllStagePages for "Active Stages, Completed Stages and Failed Stages" when they are empty
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2015-01-20 16:40:46 -0800
Commit: 9a151ce, github.com/apache/spark/pull/4083
[SPARK-5186] [MLLIB] Vector.equals and Vector.hashCode are very inefficient
Yuhao Yang <hhbyyh@gmail.com>, Yuhao Yang <yuhao@yuhaodevbox.sh.intel.com>
2015-01-20 15:20:20 -0800
Commit: 2f82c84, github.com/apache/spark/pull/3997
[SPARK-5323][SQL] Remove Row's Seq inheritance.
Reynold Xin <rxin@databricks.com>
2015-01-20 15:16:14 -0800
Commit: d181c2a, github.com/apache/spark/pull/4115
[SPARK-5287][SQL] Add defaultSizeOf to every data type.
Yin Huai <yhuai@databricks.com>
2015-01-20 13:26:36 -0800
Commit: bc20a52, github.com/apache/spark/pull/4081
SPARK-5019 [MLlib] - GaussianMixtureModel exposes instances of MultivariateGauss...
Travis Galoppo <tjg2107@columbia.edu>
2015-01-20 12:58:11 -0800
Commit: 23e2554, github.com/apache/spark/pull/4088
[SPARK-5329][WebUI] UIWorkloadGenerator should stop SparkContext.
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2015-01-20 12:40:55 -0800
Commit: 769aced, github.com/apache/spark/pull/4112
SPARK-4660: Use correct class loader in JavaSerializer (copy of PR #3840...
Jacek Lewandowski <lewandowski.jacek@gmail.com>
2015-01-20 12:38:01 -0800
Commit: c93a57f, github.com/apache/spark/pull/4113
[SQL][Minor] Refactors deeply nested FP style code in BooleanSimplification
Cheng Lian <lian@databricks.com>
2015-01-20 11:20:14 -0800
Commit: 8140802, github.com/apache/spark/pull/4091
[SPARK-5333][Mesos] MesosTaskLaunchData occurs BufferUnderflowException
Jongyoul Lee <jongyoul@gmail.com>
2015-01-20 10:17:29 -0800
Commit: 9d9294a, github.com/apache/spark/pull/4119
[SPARK-4803] [streaming] Remove duplicate RegisterReceiver message
Ilayaperumal Gopinathan <igopinathan@pivotal.io>
2015-01-20 01:41:10 -0800
Commit: 4afad9c, github.com/apache/spark/pull/3648
[SQL][minor] Add a log4j file for catalyst test.
Reynold Xin <rxin@databricks.com>
2015-01-20 00:55:25 -0800
Commit: debc031, github.com/apache/spark/pull/4117
SPARK-5270 [CORE] Provide isEmpty() function in RDD API
Sean Owen <sowen@cloudera.com>
2015-01-19 22:50:44 -0800
Commit: 306ff18, github.com/apache/spark/pull/4074
[SPARK-5214][Core] Add EventLoop and change DAGScheduler to an EventLoop
zsxwing <zsxwing@gmail.com>
2015-01-19 18:15:51 -0800
Commit: e69fb8c, github.com/apache/spark/pull/4016
[SPARK-4504][Examples] fix run-example failure if multiple assembly jars exist
Venkata Ramana Gollamudi <ramana.gollamudi@huawei.com>
2015-01-19 11:58:16 -0800
Commit: 74de94e, github.com/apache/spark/pull/3377
[SPARK-5286][SQL] Fail to drop an invalid table when using the data source API
Yin Huai <yhuai@databricks.com>
2015-01-19 10:45:29 -0800
Commit: 2604bc3, github.com/apache/spark/pull/4076
[SPARK-5284][SQL] Insert into Hive throws NPE when a inner complex type field has a null value
Yin Huai <yhuai@databricks.com>
2015-01-19 10:44:12 -0800
Commit: cd5da42, github.com/apache/spark/pull/4077
[SPARK-5282][mllib]: RowMatrix easily gets int overflow in the memory size warning
Yuhao Yang <hhbyyh@gmail.com>
2015-01-19 10:10:15 -0800
Commit: 4432568, github.com/apache/spark/pull/4069
MAINTENANCE: Automated closing of pull requests.
Patrick Wendell <patrick@databricks.com>
2015-01-19 02:05:24 -0800
Commit: 1ac1c1d, github.com/apache/spark/pull/3584
[SPARK-5088] Use spark-class for running executors directly
Jongyoul Lee <jongyoul@gmail.com>
2015-01-19 02:01:56 -0800
Commit: 4a4f9cc, github.com/apache/spark/pull/3897
[SPARK-3288] All fields in TaskMetrics should be private and use getters/setters
Ilya Ganelin <ilya.ganelin@capitalone.com>
2015-01-19 01:32:22 -0800
Commit: 3453d57, github.com/apache/spark/pull/4020
SPARK-5217 Spark UI should report pending stages during job execution on AllStagesPage.
Prashant Sharma <prashant.s@imaginea.com>
2015-01-19 01:28:42 -0800
Commit: 851b6a9, github.com/apache/spark/pull/4043
[SQL] fix typo in class description
Jacky Li <jacky.likun@gmail.com>
2015-01-18 23:59:08 -0800
Commit: 7dbf1fd, github.com/apache/spark/pull/4100
[SQL][minor] Put DataTypes.java in java dir.
Reynold Xin <rxin@databricks.com>
2015-01-18 16:35:40 -0800
Commit: 1955645, github.com/apache/spark/pull/4097
[SQL][Minor] Update sql doc according to data type APIs changes
scwf <wangfei1@huawei.com>
2015-01-18 11:03:13 -0800
Commit: 1a200a3, github.com/apache/spark/pull/4095
[SPARK-5279][SQL] Use java.math.BigDecimal as the exposed Decimal type.
Reynold Xin <rxin@databricks.com>
2015-01-18 11:01:42 -0800
Commit: 1727e08, github.com/apache/spark/pull/4092
[HOTFIX]: Minor clean up regarding skipped artifacts in build files.
Patrick Wendell <patrick@databricks.com>
2015-01-17 23:15:12 -0800
Commit: ad16da1, github.com/apache/spark/pull/4080
MAINTENANCE: Automated closing of pull requests.
Patrick Wendell <patrick@databricks.com>
2015-01-17 20:39:54 -0800
Commit: e12b5b6, github.com/apache/spark/pull/681
[SQL][Minor] Added comments and examples to explain BooleanSimplification
Reynold Xin <rxin@databricks.com>
2015-01-17 17:35:53 -0800
Commit: e7884bc, github.com/apache/spark/pull/4090
[SPARK-5096] Use sbt tasks instead of vals to get hadoop version
Michael Armbrust <michael@databricks.com>
2015-01-17 17:03:07 -0800
Commit: 6999910, github.com/apache/spark/pull/3905
[SPARK-4937][SQL] Comment for the newly optimization rules in `BooleanSimplification`
scwf <wangfei1@huawei.com>
2015-01-17 15:51:24 -0800
Commit: c1f3c27, github.com/apache/spark/pull/4086
[SQL][minor] Improved Row documentation.
Reynold Xin <rxin@databricks.com>
2015-01-17 00:11:08 -0800
Commit: f3bfc76, github.com/apache/spark/pull/4085
[SPARK-5193][SQL] Remove Spark SQL Java-specific API.
Reynold Xin <rxin@databricks.com>
2015-01-16 21:09:06 -0800
Commit: 61b427d, github.com/apache/spark/pull/4065
[SPARK-4937][SQL] Adding optimization to simplify the And, Or condition in spark sql
scwf <wangfei1@huawei.com>, wangfei <wangfei1@huawei.com>
2015-01-16 14:01:22 -0800
Commit: ee1c1f3, github.com/apache/spark/pull/3778
[SPARK-733] Add documentation on use of accumulators in lazy transformation
Ilya Ganelin <ilya.ganelin@capitalone.com>
2015-01-16 13:25:17 -0800
Commit: fd3a8a1, github.com/apache/spark/pull/4022
[SPARK-4923][REPL] Add Developer API to REPL to allow re-publishing the REPL jar
Chip Senkbeil <rcsenkbe@us.ibm.com>, Chip Senkbeil <chip.senkbeil@gmail.com>
2015-01-16 12:56:40 -0800
Commit: d05c9ee, github.com/apache/spark/pull/4034
[WebUI] Fix collapse of WebUI layout
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2015-01-16 12:19:08 -0800
Commit: ecf943d, github.com/apache/spark/pull/3995
[SPARK-5231][WebUI] History Server shows wrong job submission time.
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2015-01-16 10:05:11 -0800
Commit: e8422c5, github.com/apache/spark/pull/4029
[DOCS] Fix typo in return type of cogroup
Sean Owen <sowen@cloudera.com>
2015-01-16 09:28:44 -0800
Commit: f6b852a, github.com/apache/spark/pull/4072
[SPARK-5201][CORE] deal with int overflow in the ParallelCollectionRDD.slice method
Ye Xianjin <advancedxy@gmail.com>
2015-01-16 09:20:53 -0800
Commit: e200ac8, github.com/apache/spark/pull/4002
[SPARK-1507][YARN]specify # cores for ApplicationMaster
WangTaoTheTonic <barneystinson@aliyun.com>, WangTao <barneystinson@aliyun.com>
2015-01-16 09:16:56 -0800
Commit: 2be82b1, github.com/apache/spark/pull/4018
[SPARK-4092] [CORE] Fix InputMetrics for coalesce'd Rdds
Kostas Sakellis <kostas@cloudera.com>
2015-01-15 18:48:39 -0800
Commit: a79a9f9, github.com/apache/spark/pull/3120
[SPARK-4857] [CORE] Adds Executor membership events to SparkListener
Kostas Sakellis <kostas@cloudera.com>
2015-01-15 17:53:42 -0800
Commit: 96c2c71, github.com/apache/spark/pull/3711
[Minor] Fix tiny typo in BlockManager
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2015-01-15 17:07:44 -0800
Commit: 65858ba, github.com/apache/spark/pull/4046
[SPARK-5274][SQL] Reconcile Java and Scala UDFRegistration.
Reynold Xin <rxin@databricks.com>
2015-01-15 16:15:12 -0800
Commit: 1881431, github.com/apache/spark/pull/4056
[SPARK-5224] [PySpark] improve performance of parallelize list/ndarray
Davies Liu <davies@databricks.com>
2015-01-15 11:40:41 -0800
Commit: 3c8650c, github.com/apache/spark/pull/4024
[SPARK-5193][SQL] Tighten up HiveContext API
Reynold Xin <rxin@databricks.com>
2015-01-14 20:31:02 -0800
Commit: 4b325c7, github.com/apache/spark/pull/4054
[SPARK-5254][MLLIB] remove developers section from spark.ml guide
Xiangrui Meng <meng@databricks.com>
2015-01-14 18:54:17 -0800
Commit: 6abc45e, github.com/apache/spark/pull/4053
[SPARK-5193][SQL] Tighten up SQLContext API
Reynold Xin <rxin@databricks.com>
2015-01-14 18:36:15 -0800
Commit: cfa397c, github.com/apache/spark/pull/4049
[SPARK-5254][MLLIB] Update the user guide to position spark.ml better
Xiangrui Meng <meng@databricks.com>
2015-01-14 17:50:33 -0800
Commit: 13d2406, github.com/apache/spark/pull/4052
[SPARK-5234][ml]examples for ml don't have sparkContext.stop
Yuhao Yang <yuhao@yuhaodevbox.sh.intel.com>
2015-01-14 11:53:43 -0800
Commit: 76389c5, github.com/apache/spark/pull/4044
[SPARK-5235] Make SQLConf Serializable
Alex Baretta <alexbaretta@gmail.com>
2015-01-14 11:51:55 -0800
Commit: 2fd7f72, github.com/apache/spark/pull/4031
[SPARK-4014] Add TaskContext.attemptNumber and deprecate TaskContext.attemptId
Josh Rosen <joshrosen@databricks.com>
2015-01-14 11:45:40 -0800
Commit: 259936b, github.com/apache/spark/pull/3849
[SPARK-5228][WebUI] Hide tables for "Active Jobs/Completed Jobs/Failed Jobs" when they are empty
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2015-01-14 11:10:29 -0800
Commit: 9d4449c, github.com/apache/spark/pull/4028
[SPARK-2909] [MLlib] [PySpark] SparseVector in pyspark now supports indexing
MechCoder <manojkumarsivaraj334@gmail.com>
2015-01-14 11:03:11 -0800
Commit: 5840f54, github.com/apache/spark/pull/4025
[SQL] some comments fix for GROUPING SETS
Daoyuan Wang <daoyuan.wang@intel.com>
2015-01-14 09:50:01 -0800
Commit: 38bdc99, github.com/apache/spark/pull/4000
[SPARK-5211][SQL]Restore HiveMetastoreTypes.toDataType
Yin Huai <yhuai@databricks.com>
2015-01-14 09:47:30 -0800
Commit: 81f72a0, github.com/apache/spark/pull/4026
[SPARK-5248] [SQL] move sql.types.decimal.Decimal to sql.types.Decimal
Daoyuan Wang <daoyuan.wang@intel.com>
2015-01-14 09:36:59 -0800
Commit: a3f7421, github.com/apache/spark/pull/4041
[SPARK-5167][SQL] Move Row into sql package and make it usable for Java.
Reynold Xin <rxin@databricks.com>
2015-01-14 00:38:55 -0800
Commit: d5eeb35, github.com/apache/spark/pull/4030
[SPARK-5123][SQL] Reconcile Java/Scala API for data types.
Reynold Xin <rxin@databricks.com>
2015-01-13 17:16:41 -0800
Commit: f996909, github.com/apache/spark/pull/3958
[SPARK-5168] Make SQLConf a field rather than mixin in SQLContext
Reynold Xin <rxin@databricks.com>
2015-01-13 13:30:35 -0800
Commit: 14e3f11, github.com/apache/spark/pull/3965
[SPARK-4912][SQL] Persistent tables for the Spark SQL data sources api
Yin Huai <yhuai@databricks.com>, Michael Armbrust <michael@databricks.com>
2015-01-13 13:01:27 -0800
Commit: 6463e0b, github.com/apache/spark/pull/3960
[SPARK-5223] [MLlib] [PySpark] fix MapConverter and ListConverter in MLlib
Davies Liu <davies@databricks.com>
2015-01-13 12:50:31 -0800
Commit: 8ead999, github.com/apache/spark/pull/4023
[SPARK-5131][Streaming][DOC]: There is a discrepancy in WAL implementation and configuration doc.
uncleGen <hustyugm@gmail.com>
2015-01-13 10:07:19 -0800
Commit: 39e333e, github.com/apache/spark/pull/3930
[SPARK-4697][YARN]System properties should override environment variables
WangTaoTheTonic <barneystinson@aliyun.com>, WangTao <barneystinson@aliyun.com>
2015-01-13 09:43:48 -0800
Commit: 9dea64e, github.com/apache/spark/pull/3557
[SPARK-5006][Deploy]spark.port.maxRetries doesn't work
WangTaoTheTonic <barneystinson@aliyun.com>, WangTao <barneystinson@aliyun.com>
2015-01-13 09:28:21 -0800
Commit: f7741a9, github.com/apache/spark/pull/3841
[SPARK-5138][SQL] Ensure schema can be inferred from a namedtuple
Gabe Mulley <gabe@edx.org>
2015-01-12 21:44:51 -0800
Commit: 1e42e96, github.com/apache/spark/pull/3978
[SPARK-5049][SQL] Fix ordering of partition columns in ParquetTableScan
Michael Armbrust <michael@databricks.com>
2015-01-12 15:19:09 -0800
Commit: 5d9fa55, github.com/apache/spark/pull/3990
[SPARK-4999][Streaming] Change storeInBlockManager to false by default
jerryshao <saisai.shao@intel.com>
2015-01-12 13:14:44 -0800
Commit: 3aed305, github.com/apache/spark/pull/3906
SPARK-5172 [BUILD] spark-examples-***.jar shades a wrong Hadoop distribution
Sean Owen <sowen@cloudera.com>
2015-01-12 12:15:34 -0800
Commit: aff49a3, github.com/apache/spark/pull/3992
[SPARK-5078] Optionally read from SPARK_LOCAL_HOSTNAME
Michael Armbrust <michael@databricks.com>
2015-01-12 11:57:59 -0800
Commit: a3978f3, github.com/apache/spark/pull/3893
SPARK-4159 [BUILD] Addendum: improve running of single test after enabling Java tests
Sean Owen <sowen@cloudera.com>
2015-01-12 11:00:56 -0800
Commit: 13e610b, github.com/apache/spark/pull/3993
[SPARK-5102][Core]subclass of MapStatus needs to be registered with Kryo
lianhuiwang <lianhuiwang09@gmail.com>
2015-01-12 10:57:12 -0800
Commit: ef9224e, github.com/apache/spark/pull/4007
[SPARK-5200] Disable web UI in Hive ThriftServer tests
Josh Rosen <joshrosen@databricks.com>
2015-01-12 10:47:12 -0800
Commit: 82fd38d, github.com/apache/spark/pull/3998
SPARK-5018 [MLlib] [WIP] Make MultivariateGaussian public
Travis Galoppo <tjg2107@columbia.edu>
2015-01-11 21:31:16 -0800
Commit: 2130de9, github.com/apache/spark/pull/3923
[SPARK-4033][Examples]Input of the SparkPi too big causes the emption exception
huangzhaowei <carlmartinmax@gmail.com>
2015-01-11 16:32:47 -0800
Commit: f38ef65, github.com/apache/spark/pull/2874
[SPARK-4951][Core] Fix the issue that a busy executor may be killed
zsxwing <zsxwing@gmail.com>
2015-01-11 16:23:28 -0800
Commit: 6942b97, github.com/apache/spark/pull/3783
[SPARK-5073] spark.storage.memoryMapThreshold have two default value
lewuathe <lewuathe@me.com>
2015-01-11 13:50:42 -0800
Commit: 1656aae, github.com/apache/spark/pull/3900
[SPARK-5032] [graphx] Remove GraphX MIMA exclude for 1.3
Joseph K. Bradley <joseph@databricks.com>
2015-01-10 17:25:39 -0800
Commit: 3313260, github.com/apache/spark/pull/3856
[SPARK-5029][SQL] Enable from follow multiple brackets
scwf <wangfei1@huawei.com>
2015-01-10 17:07:34 -0800
Commit: d22a31f, github.com/apache/spark/pull/3853
[SPARK-4871][SQL] Show sql statement in spark ui when run sql with spark-sql
wangfei <wangfei1@huawei.com>
2015-01-10 17:04:56 -0800
Commit: 92d9a70, github.com/apache/spark/pull/3718
[Minor]Resolve sbt warnings during build (MQTTStreamSuite.scala).
GuoQiang Li <witgo@qq.com>
2015-01-10 15:38:43 -0800
Commit: 8a29dc7, github.com/apache/spark/pull/3989
[SPARK-5181] do not print writing WAL log when WAL is disabled
CodingCat <zhunansjtu@gmail.com>
2015-01-10 15:35:41 -0800
Commit: f0d558b, github.com/apache/spark/pull/3985
[SPARK-4692] [SQL] Support ! boolean logic operator like NOT
YanTangZhai <hakeemzhai@tencent.com>, Michael Armbrust <michael@databricks.com>
2015-01-10 15:05:23 -0800
Commit: 0ca51cc, github.com/apache/spark/pull/3555
[SPARK-5187][SQL] Fix caching of tables with HiveUDFs in the WHERE clause
Michael Armbrust <michael@databricks.com>
2015-01-10 14:25:45 -0800
Commit: 3684fd2, github.com/apache/spark/pull/3987
SPARK-4963 [SQL] Add copy to SQL's Sample operator
Yanbo Liang <yanbohappy@gmail.com>
2015-01-10 14:16:37 -0800
Commit: 77106df, github.com/apache/spark/pull/3827
[SPARK-4861][SQL] Refactory command in spark sql
scwf <wangfei1@huawei.com>
2015-01-10 14:08:04 -0800
Commit: b3e86dc, github.com/apache/spark/pull/3948
[SPARK-4574][SQL] Adding support for defining schema in foreign DDL commands.
scwf <wangfei1@huawei.com>, Yin Huai <yhuai@databricks.com>, Fei Wang <wangfei1@huawei.com>, wangfei <wangfei1@huawei.com>
2015-01-10 13:53:21 -0800
Commit: 693a323, github.com/apache/spark/pull/3431
[SPARK-4943][SQL] Allow table name having dot for db/catalog
Alex Liu <alex_liu68@yahoo.com>
2015-01-10 13:23:09 -0800
Commit: 4b39fd1, github.com/apache/spark/pull/3941
[SPARK-4925][SQL] Publish Spark SQL hive-thriftserver maven artifact
Alex Liu <alex_liu68@yahoo.com>
2015-01-10 13:19:12 -0800
Commit: 1e56eba, github.com/apache/spark/pull/3766
[SPARK-5141][SQL]CaseInsensitiveMap throws java.io.NotSerializableException
luogankun <luogankun@gmail.com>
2015-01-09 20:38:41 -0800
Commit: 545dfcb, github.com/apache/spark/pull/3944
[SPARK-4406] [MLib] FIX: Validate k in SVD
MechCoder <manojkumarsivaraj334@gmail.com>
2015-01-09 17:45:18 -0800
Commit: 4554529, github.com/apache/spark/pull/3945
[SPARK-4990][Deploy]to find default properties file, search SPARK_CONF_DIR first
WangTaoTheTonic <barneystinson@aliyun.com>, WangTao <barneystinson@aliyun.com>
2015-01-09 17:10:02 -0800
Commit: 8782eb9, github.com/apache/spark/pull/3823
[Minor] Fix import order and other coding style
bilna <bilnap@am.amrita.edu>, Bilna P <bilna.p@gmail.com>
2015-01-09 14:45:28 -0800
Commit: 4e1f12d, github.com/apache/spark/pull/3966
[DOC] Fixed Mesos version in doc from 0.18.1 to 0.21.0
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2015-01-09 14:40:45 -0800
Commit: ae62872, github.com/apache/spark/pull/3982
[SPARK-4737] Task set manager properly handles serialization errors
mcheah <mcheah@palantir.com>
2015-01-09 14:16:20 -0800
Commit: e0f28e0, github.com/apache/spark/pull/3638
[SPARK-1953][YARN]yarn client mode Application Master memory size is same as driver memory...
WangTaoTheTonic <barneystinson@aliyun.com>
2015-01-09 13:20:32 -0800
Commit: e966452, github.com/apache/spark/pull/3607
[SPARK-5015] [mllib] Random seed for GMM + make test suite deterministic
Joseph K. Bradley <joseph@databricks.com>
2015-01-09 13:00:15 -0800
Commit: 7e8e62a, github.com/apache/spark/pull/3981
[SPARK-3619] Upgrade to Mesos 0.21 to work around MESOS-1688
Jongyoul Lee <jongyoul@gmail.com>
2015-01-09 10:47:08 -0800
Commit: 454fe12, github.com/apache/spark/pull/3934
[SPARK-5145][Mllib] Add BLAS.dsyr and use it in GaussianMixtureEM
Liang-Chi Hsieh <viirya@gmail.com>
2015-01-09 10:27:33 -0800
Commit: e9ca16e, github.com/apache/spark/pull/3949
[SPARK-1143] Separate pool tests into their own suite.
Kay Ousterhout <kayousterhout@gmail.com>
2015-01-09 09:47:06 -0800
Commit: b6aa557, github.com/apache/spark/pull/3967
HOTFIX: Minor improvements to make-distribution.sh
Patrick Wendell <pwendell@gmail.com>
2015-01-09 09:40:18 -0800
Commit: 1790b38, github.com/apache/spark/pull/3973
SPARK-5136 [DOCS] Improve documentation around setting up Spark IntelliJ project
Sean Owen <sowen@cloudera.com>
2015-01-09 09:35:46 -0800
Commit: 547df97, github.com/apache/spark/pull/3952
[Minor] Fix test RetryingBlockFetcherSuite after changed config name
Aaron Davidson <aaron@databricks.com>
2015-01-09 09:20:16 -0800
Commit: b4034c3, github.com/apache/spark/pull/3972
[SPARK-5169][YARN]fetch the correct max attempts
WangTaoTheTonic <barneystinson@aliyun.com>
2015-01-09 08:10:09 -0600
Commit: f3da4bd, github.com/apache/spark/pull/3942
[SPARK-5122] Remove Shark from spark-ec2
Nicholas Chammas <nicholas.chammas@gmail.com>
2015-01-08 17:42:08 -0800
Commit: 167a5ab, github.com/apache/spark/pull/3939
[SPARK-4048] Enhance and extend hadoop-provided profile.
Marcelo Vanzin <vanzin@cloudera.com>
2015-01-08 17:15:13 -0800
Commit: 48cecf6, github.com/apache/spark/pull/2982
[SPARK-4891][PySpark][MLlib] Add gamma/log normal/exp dist sampling to P...
RJ Nowling <rnowling@gmail.com>
2015-01-08 15:03:43 -0800
Commit: c9c8b21, github.com/apache/spark/pull/3955
[SPARK-4973][CORE] Local directory in the driver of client-mode continues remaining even if application finished when external shuffle is enabled
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2015-01-08 13:43:09 -0800
Commit: a00af6b, github.com/apache/spark/pull/3811
SPARK-5148 [MLlib] Make usersOut/productsOut storagelevel in ALS configurable
Fernando Otero (ZeoS) <fotero@gmail.com>
2015-01-08 12:42:54 -0800
Commit: 72df5a3, github.com/apache/spark/pull/3953
Document that groupByKey will OOM for large keys
Eric Moyer <eric_moyer@yahoo.com>
2015-01-08 11:55:23 -0800
Commit: 538f221, github.com/apache/spark/pull/3936
[SPARK-5130][Deploy]Take yarn-cluster as cluster mode in spark-submit
WangTaoTheTonic <barneystinson@aliyun.com>
2015-01-08 11:45:42 -0800
Commit: 0760787, github.com/apache/spark/pull/3929
[Minor] Fix the value represented by spark.executor.id for consistency.
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2015-01-08 11:35:56 -0800
Commit: 0a59727, github.com/apache/spark/pull/3812
[SPARK-4989][CORE] avoid wrong eventlog conf cause cluster down in standalone mode
Zhang, Liye <liye.zhang@intel.com>
2015-01-08 10:40:26 -0800
Commit: 06dc4b5, github.com/apache/spark/pull/3824
[SPARK-4917] Add a function to convert into a graph with canonical edges in GraphOps
Takeshi Yamamuro <linguin.m.s@gmail.com>
2015-01-08 09:55:12 -0800
Commit: f825e19, github.com/apache/spark/pull/3760
SPARK-5087. [YARN] Merge yarn.Client and yarn.ClientBase
Sandy Ryza <sandy@cloudera.com>
2015-01-08 09:25:43 -0800
Commit: 8d45834, github.com/apache/spark/pull/3896
MAINTENANCE: Automated closing of pull requests.
Patrick Wendell <pwendell@gmail.com>
2015-01-07 23:25:56 -0800
Commit: c082385, github.com/apache/spark/pull/3880
[SPARK-5116][MLlib] Add extractor for SparseVector and DenseVector
Shuo Xiang <shuoxiangpub@gmail.com>
2015-01-07 23:22:37 -0800
Commit: c66a976, github.com/apache/spark/pull/3919
[SPARK-5126][Core] Verify Spark urls before creating Actors so that invalid urls can crash the process.
zsxwing <zsxwing@gmail.com>
2015-01-07 23:01:30 -0800
Commit: 2b729d2, github.com/apache/spark/pull/3927
[SPARK-5132][Core]Correct stage Attempt Id key in stageInfofromJson
hushan[胡珊] <hushan@xiaomi.com>
2015-01-07 12:09:12 -0800
Commit: d345ebe, github.com/apache/spark/pull/3932
[SPARK-5128][MLLib] Add common used log1pExp API in MLUtils
DB Tsai <dbtsai@alpinenow.com>
2015-01-07 10:13:41 -0800
Commit: 60e2d9e, github.com/apache/spark/pull/3915
[SPARK-2458] Make failed application log visible on History Server
Masayoshi TSUZUKI <tsudukim@oss.nttdata.co.jp>
2015-01-07 07:32:16 -0800
Commit: 6e74ede, github.com/apache/spark/pull/3467
[SPARK-2165][YARN]add support for setting maxAppAttempts in the ApplicationSubmissionContext
WangTaoTheTonic <barneystinson@aliyun.com>
2015-01-07 08:14:39 -0600
Commit: 8fdd489, github.com/apache/spark/pull/3878
[YARN][SPARK-4929] Bug fix: fix the yarn-client code to support HA
huangzhaowei <carlmartinmax@gmail.com>
2015-01-07 08:10:42 -0600
Commit: 5fde661, github.com/apache/spark/pull/3771
[SPARK-5099][Mllib] Simplify logistic loss function
Liang-Chi Hsieh <viirya@gmail.com>
2015-01-06 21:23:31 -0800
Commit: e21acc1, github.com/apache/spark/pull/3899
[SPARK-5050][Mllib] Add unit test for sqdist
Liang-Chi Hsieh <viirya@gmail.com>
2015-01-06 14:00:45 -0800
Commit: bb38ebb, github.com/apache/spark/pull/3869
SPARK-5017 [MLlib] - Use SVD to compute determinant and inverse of covariance matrix
Travis Galoppo <tjg2107@columbia.edu>
2015-01-06 13:57:42 -0800
Commit: 4108e5f, github.com/apache/spark/pull/3871
SPARK-4159 [CORE] Maven build doesn't run JUnit test suites
Sean Owen <sowen@cloudera.com>
2015-01-06 12:02:08 -0800
Commit: 4cba6eb, github.com/apache/spark/pull/3651
[Minor] Fix comments for GraphX 2D partitioning strategy
kj-ki <kikushima.kenji@lab.ntt.co.jp>
2015-01-06 09:49:37 -0800
Commit: 5e3ec11, github.com/apache/spark/pull/3904
[SPARK-1600] Refactor FileInputStream tests to remove Thread.sleep() calls and SystemClock usage
Josh Rosen <joshrosen@databricks.com>
2015-01-06 00:31:19 -0800
Commit: a6394bc, github.com/apache/spark/pull/3801
SPARK-4843 [YARN] Squash ExecutorRunnableUtil and ExecutorRunnable
Kostas Sakellis <kostas@cloudera.com>
2015-01-05 23:26:33 -0800
Commit: 451546a, github.com/apache/spark/pull/3696
[SPARK-5040][SQL] Support expressing unresolved attributes using $"attribute name" notation in SQL DSL.
Reynold Xin <rxin@databricks.com>
2015-01-05 15:34:22 -0800
Commit: 04d55d8, github.com/apache/spark/pull/3862
[SPARK-5093] Set spark.network.timeout to 120s consistently.
Reynold Xin <rxin@databricks.com>
2015-01-05 15:19:53 -0800
Commit: bbcba3a, github.com/apache/spark/pull/3903
[SPARK-5089][PYSPARK][MLLIB] Fix vector convert
freeman <the.freeman.lab@gmail.com>
2015-01-05 13:10:59 -0800
Commit: 6c6f325, github.com/apache/spark/pull/3902
[SPARK-4465] runAsSparkUser doesn't affect TaskRunner in Mesos environme...
Jongyoul Lee <jongyoul@gmail.com>
2015-01-05 12:05:09 -0800
Commit: 1c0e7ce, github.com/apache/spark/pull/3741
[SPARK-5057] Log message in failed askWithReply attempts
WangTao <barneystinson@aliyun.com>, WangTaoTheTonic <barneystinson@aliyun.com>
2015-01-05 11:59:38 -0800
Commit: ce39b34, github.com/apache/spark/pull/3875
[SPARK-4688] Have a single shared network timeout in Spark
Varun Saxena <vsaxena.varun@gmail.com>, varunsaxena <vsaxena.varun@gmail.com>
2015-01-05 10:32:37 -0800
Commit: d3f07fd, github.com/apache/spark/pull/3562
[SPARK-5074][Core] Fix a non-deterministic test failure
zsxwing <zsxwing@gmail.com>
2015-01-04 21:18:33 -0800
Commit: 5c506ce, github.com/apache/spark/pull/3889
[SPARK-5083][Core] Fix a flaky test in TaskResultGetterSuite
zsxwing <zsxwing@gmail.com>
2015-01-04 21:09:21 -0800
Commit: 27e7f5a, github.com/apache/spark/pull/3894
[SPARK-5069][Core] Fix the race condition of TaskSchedulerImpl.dagScheduler
zsxwing <zsxwing@gmail.com>
2015-01-04 21:06:04 -0800
Commit: 6c726a3, github.com/apache/spark/pull/3887
[SPARK-5067][Core] Use '===' to compare well-defined case class
zsxwing <zsxwing@gmail.com>
2015-01-04 21:03:17 -0800
Commit: 7239652, github.com/apache/spark/pull/3886
[SPARK-4835] Disable validateOutputSpecs for Spark Streaming jobs
Josh Rosen <joshrosen@databricks.com>
2015-01-04 20:26:18 -0800
Commit: 939ba1f, github.com/apache/spark/pull/3832
[SPARK-4631] unit test for MQTT
bilna <bilnap@am.amrita.edu>, Bilna P <bilna.p@gmail.com>
2015-01-04 19:37:48 -0800
Commit: e767d7d, github.com/apache/spark/pull/3844
[SPARK-4787] Stop SparkContext if a DAGScheduler init error occurs
Dale <tigerquoll@outlook.com>
2015-01-04 13:28:37 -0800
Commit: 3fddc94, github.com/apache/spark/pull/3809
[SPARK-794][Core] Remove sleep() in ClusterScheduler.stop
Brennon York <brennon.york@capitalone.com>
2015-01-04 12:40:39 -0800
Commit: b96008d, github.com/apache/spark/pull/3851
[SPARK-5058] Updated broken links
sigmoidanalytics <mayur@sigmoidanalytics.com>
2015-01-03 19:46:08 -0800
Commit: 342612b, github.com/apache/spark/pull/3877
Fixed typos in streaming-kafka-integration.md
Akhil Das <akhld@darktech.ca>
2015-01-02 15:12:27 -0800
Commit: cdccc26, github.com/apache/spark/pull/3876
[SPARK-3325][Streaming] Add a parameter to the method print in class DStream
Yadong Qi <qiyadong2010@gmail.com>, q00251598 <qiyadong@huawei.com>, Tathagata Das <tathagata.das1565@gmail.com>, wangfei <wangfei1@huawei.com>
2015-01-02 15:09:41 -0800
Commit: bd88b71, github.com/apache/spark/pull/3865
[HOTFIX] Bind web UI to ephemeral port in DriverSuite
Josh Rosen <joshrosen@databricks.com>
2015-01-01 15:03:54 -0800
Commit: 0128398, github.com/apache/spark/pull/3873
[SPARK-5038] Add explicit return type for implicit functions.
Reynold Xin <rxin@databricks.com>
2014-12-31 17:07:47 -0800
Commit: 7749dd6, github.com/apache/spark/pull/3860
SPARK-2757 [BUILD] [STREAMING] Add Mima test for Spark Sink after 1.10 is released
Sean Owen <sowen@cloudera.com>
2014-12-31 16:59:17 -0800
Commit: 4bb1248, github.com/apache/spark/pull/3842
[SPARK-5035] [Streaming] ReceiverMessage trait should extend Serializable
Josh Rosen <joshrosen@databricks.com>
2014-12-31 16:02:47 -0800
Commit: fe6efac, github.com/apache/spark/pull/3857
SPARK-5020 [MLlib] GaussianMixtureModel.predictMembership() should take an RDD only
Travis Galoppo <tjg2107@columbia.edu>
2014-12-31 15:39:58 -0800
Commit: c4f0b4f, github.com/apache/spark/pull/3854
[SPARK-5028][Streaming]Add total received and processed records metrics to Streaming UI
jerryshao <saisai.shao@intel.com>
2014-12-31 14:45:31 -0800
Commit: fdc2aa4, github.com/apache/spark/pull/3852
[SPARK-4790][STREAMING] Fix ReceivedBlockTrackerSuite waits for old file...
Hari Shreedharan <hshreedharan@apache.org>
2014-12-31 14:35:07 -0800
Commit: 3610d3c, github.com/apache/spark/pull/3726
[SPARK-5038][SQL] Add explicit return type for implicit functions in Spark SQL
Reynold Xin <rxin@databricks.com>
2014-12-31 14:25:03 -0800
Commit: c88a3d7, github.com/apache/spark/pull/3859
[HOTFIX] Disable Spark UI in SparkSubmitSuite tests
Josh Rosen <joshrosen@databricks.com>
2014-12-12 12:38:37 -0800
Commit: e24d3a9
SPARK-4547 [MLLIB] OOM when making bins in BinaryClassificationMetrics
Sean Owen <sowen@cloudera.com>
2014-12-31 13:37:04 -0800
Commit: 3d194cc, github.com/apache/spark/pull/3702
[SPARK-4298][Core] - The spark-submit cannot read Main-Class from Manifest.
Brennon York <brennon.york@capitalone.com>
2014-12-31 11:54:10 -0800
Commit: 8e14c5e, github.com/apache/spark/pull/3561
[SPARK-4797] Replace breezeSquaredDistance
Liang-Chi Hsieh <viirya@gmail.com>
2014-12-31 11:50:53 -0800
Commit: 06a9aa5, github.com/apache/spark/pull/3643
[SPARK-1010] Clean up uses of System.setProperty in unit tests
Josh Rosen <joshrosen@databricks.com>
2014-12-30 18:12:20 -0800
Commit: 352ed6b, github.com/apache/spark/pull/3739
[SPARK-4998][MLlib]delete the "train" function
Liu Jiongzhou <ljzzju@163.com>
2014-12-30 15:55:56 -0800
Commit: 035bac8, github.com/apache/spark/pull/3836
[SPARK-4813][Streaming] Fix the issue that ContextWaiter didn't handle 'spurious wakeup'
zsxwing <zsxwing@gmail.com>
2014-12-30 14:39:13 -0800
Commit: 6a89782, github.com/apache/spark/pull/3661
[Spark-4995] Replace Vector.toBreeze.activeIterator with foreachActive
Jakub Dubovsky <dubovsky@avast.com>
2014-12-30 14:19:07 -0800
Commit: 0f31992, github.com/apache/spark/pull/3846
SPARK-3955 part 2 [CORE] [HOTFIX] Different versions between jackson-mapper-asl and jackson-core-asl
Sean Owen <sowen@cloudera.com>
2014-12-30 14:00:57 -0800
Commit: b239ea1, github.com/apache/spark/pull/3829
[SPARK-4570][SQL]add BroadcastLeftSemiJoinHash
wangxiaojing <u9jing@gmail.com>
2014-12-30 13:54:12 -0800
Commit: 07fa191, github.com/apache/spark/pull/3442
[SPARK-4935][SQL] When hive.cli.print.header configured, spark-sql aborted if passed in a invalid sql
wangfei <wangfei1@huawei.com>, Fei Wang <wangfei1@huawei.com>
2014-12-30 13:44:30 -0800
Commit: 8f29b7c, github.com/apache/spark/pull/3761
[SPARK-4386] Improve performance when writing Parquet files
Michael Davies <Michael.BellDavies@gmail.com>
2014-12-30 13:40:51 -0800
Commit: 7425bec, github.com/apache/spark/pull/3843
[SPARK-4937][SQL] Normalizes conjunctions and disjunctions to eliminate common predicates
Cheng Lian <lian@databricks.com>
2014-12-30 13:38:27 -0800
Commit: 61a99f6, github.com/apache/spark/pull/3784
[SPARK-4928][SQL] Fix: Operator '>,<,>=,<=' with decimal between different precision report error
guowei2 <guowei2@asiainfo.com>
2014-12-30 12:21:00 -0800
Commit: a75dd83, github.com/apache/spark/pull/3767
[SPARK-4930][SQL][DOCS]Update SQL programming guide, CACHE TABLE is eager
luogankun <luogankun@gmail.com>
2014-12-30 12:18:55 -0800
Commit: 2deac74, github.com/apache/spark/pull/3773
[SPARK-4916][SQL][DOCS]Update SQL programming guide about cache section
luogankun <luogankun@gmail.com>
2014-12-30 12:17:49 -0800
Commit: f7a41a0, github.com/apache/spark/pull/3759
[SPARK-4493][SQL] Tests for IsNull / IsNotNull in the ParquetFilterSuite
Cheng Lian <lian@databricks.com>
2014-12-30 12:16:45 -0800
Commit: 19a8802, github.com/apache/spark/pull/3748
[Spark-4512] [SQL] Unresolved Attribute Exception in Sort By
Cheng Hao <hao.cheng@intel.com>
2014-12-30 12:11:44 -0800
Commit: 53f0a00, github.com/apache/spark/pull/3386
[SPARK-5002][SQL] Using ascending by default when not specify order in order by
wangfei <wangfei1@huawei.com>
2014-12-30 12:07:24 -0800
Commit: daac221, github.com/apache/spark/pull/3838
[SPARK-4904] [SQL] Remove the unnecessary code change in Generic UDF
Cheng Hao <hao.cheng@intel.com>
2014-12-30 11:47:08 -0800
Commit: 63b84b7, github.com/apache/spark/pull/3745
[SPARK-4959] [SQL] Attributes are case sensitive when using a select query from a projection
Cheng Hao <hao.cheng@intel.com>
2014-12-30 11:33:47 -0800
Commit: 5595eaa, github.com/apache/spark/pull/3796
[SPARK-4975][SQL] Fix HiveInspectorSuite test failure
scwf <wangfei1@huawei.com>, Fei Wang <wangfei1@huawei.com>
2014-12-30 11:30:47 -0800
Commit: 65357f1, github.com/apache/spark/pull/3814
[SQL] enable view test
Daoyuan Wang <daoyuan.wang@intel.com>
2014-12-30 11:29:13 -0800
Commit: 94d60b7, github.com/apache/spark/pull/3826
[SPARK-4908][SQL] Prevent multiple concurrent hive native commands
Michael Armbrust <michael@databricks.com>
2014-12-30 11:24:46 -0800
Commit: 480bd1d, github.com/apache/spark/pull/3834
[SPARK-4882] Register PythonBroadcast with Kryo so that PySpark works with KryoSerializer
Josh Rosen <joshrosen@databricks.com>
2014-12-30 09:29:52 -0800
Commit: efa80a53, github.com/apache/spark/pull/3831
[SPARK-4920][UI] add version on master and worker page for standalone mode
Zhang, Liye <liye.zhang@intel.com>
2014-12-30 09:19:47 -0800
Commit: 9077e72, github.com/apache/spark/pull/3769
[SPARK-4972][MLlib] Updated the scala doc for lasso and ridge regression for the change of LeastSquaresGradient
DB Tsai <dbtsai@alpinenow.com>
2014-12-29 17:17:12 -0800
Commit: 040d6f2, github.com/apache/spark/pull/3808
Added setMinCount to Word2Vec.scala
ganonp <ganonp@gmail.com>
2014-12-29 15:31:19 -0800
Commit: 343db39, github.com/apache/spark/pull/3693
SPARK-4156 [MLLIB] EM algorithm for GMMs
Travis Galoppo <tjg2107@columbia.edu>, Travis Galoppo <travis@localhost.localdomain>, tgaloppo <tjg2107@columbia.edu>, FlytxtRnD <meethu.mathew@flytxt.com>
2014-12-29 15:29:15 -0800
Commit: 6cf6fdf, github.com/apache/spark/pull/3022
SPARK-4968: takeOrdered to skip reduce step in case mappers return no partitions
Yash Datta <Yash.Datta@guavus.com>
2014-12-29 13:49:45 -0800
Commit: 9bc0df6, github.com/apache/spark/pull/3830
[SPARK-4409][MLlib] Additional Linear Algebra Utils
Burak Yavuz <brkyvz@gmail.com>, Xiangrui Meng <meng@databricks.com>
2014-12-29 13:24:26 -0800
Commit: 02b55de, github.com/apache/spark/pull/3319
[Minor] Fix a typo of type parameter in JavaUtils.scala
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2014-12-29 12:05:08 -0800
Commit: 8d72341, github.com/apache/spark/pull/3789
[SPARK-4946] [CORE] Using AkkaUtils.askWithReply in MapOutputTracker.askTracker to reduce the chance of the communicating problem
YanTangZhai <hakeemzhai@tencent.com>, yantangzhai <tyz0303@163.com>
2014-12-29 11:30:54 -0800
Commit: 815de54, github.com/apache/spark/pull/3785
Adde LICENSE Header to build/mvn, build/sbt and sbt/sbt
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2014-12-29 10:48:53 -0800
Commit: 4cef05e, github.com/apache/spark/pull/3817
[SPARK-4982][DOC] `spark.ui.retainedJobs` description is wrong in Spark UI configuration guide
wangxiaojing <u9jing@gmail.com>
2014-12-29 10:45:14 -0800
Commit: 6645e52, github.com/apache/spark/pull/3818
[SPARK-4966][YARN]The MemoryOverhead value is setted not correctly
meiyoula <1039320815@qq.com>
2014-12-29 08:20:30 -0600
Commit: 14fa87b, github.com/apache/spark/pull/3797
[SPARK-4501][Core] - Create build/mvn to automatically download maven/zinc/scalac
Brennon York <brennon.york@capitalone.com>
2014-12-27 13:25:18 -0800
Commit: a3e51cc, github.com/apache/spark/pull/3707
[SPARK-4952][Core]Handle ConcurrentModificationExceptions in SparkEnv.environmentDetails
GuoQiang Li <witgo@qq.com>
2014-12-26 23:31:29 -0800
Commit: 080ceb7, github.com/apache/spark/pull/3788
[SPARK-4954][Core] add spark version infomation in log for standalone mode
Zhang, Liye <liye.zhang@intel.com>
2014-12-26 23:23:13 -0800
Commit: 786808a, github.com/apache/spark/pull/3790
[SPARK-3955] Different versions between jackson-mapper-asl and jackson-c...
Jongyoul Lee <jongyoul@gmail.com>
2014-12-26 22:59:34 -0800
Commit: 2483c1e, github.com/apache/spark/pull/3716
HOTFIX: Slight tweak on previous commit.
Patrick Wendell <pwendell@gmail.com>
2014-12-26 22:55:04 -0800
Commit: 82bf4be
[SPARK-3787][BUILD] Assembly jar name is wrong when we build with sbt omitting -Dhadoop.version
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2014-12-26 22:52:04 -0800
Commit: de95c57, github.com/apache/spark/pull/3046
MAINTENANCE: Automated closing of pull requests.
Patrick Wendell <pwendell@gmail.com>
2014-12-26 22:39:56 -0800
Commit: 534f24b, github.com/apache/spark/pull/3456
SPARK-4971: Fix typo in BlockGenerator comment
CodingCat <zhunansjtu@gmail.com>
2014-12-26 12:03:22 -0800
Commit: fda4331, github.com/apache/spark/pull/3807
[SPARK-4608][Streaming] Reorganize StreamingContext implicit to improve API convenience
zsxwing <zsxwing@gmail.com>
2014-12-25 19:46:05 -0800
Commit: f9ed2b6, github.com/apache/spark/pull/3464
[SPARK-4537][Streaming] Expand StreamingSource to add more metrics
jerryshao <saisai.shao@intel.com>
2014-12-25 19:39:49 -0800
Commit: f205fe4, github.com/apache/spark/pull/3466
[EC2] Update mesos/spark-ec2 branch to branch-1.3
Nicholas Chammas <nicholas.chammas@gmail.com>
2014-12-25 14:16:50 -0800
Commit: ac82785, github.com/apache/spark/pull/3804
[EC2] Update default Spark version to 1.2.0
Nicholas Chammas <nicholas.chammas@gmail.com>
2014-12-25 14:13:12 -0800
Commit: b6b6393, github.com/apache/spark/pull/3793
Fix "Building Spark With Maven" link in README.md
Denny Lee <denny.g.lee@gmail.com>
2014-12-25 14:05:55 -0800
Commit: 08b18c7, github.com/apache/spark/pull/3802
[SPARK-4953][Doc] Fix the description of building Spark with YARN
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2014-12-25 07:05:43 -0800
Commit: 11dd993, github.com/apache/spark/pull/3787
[SPARK-4873][Streaming] Use `Future.zip` instead of `Future.flatMap`(for-loop) in WriteAheadLogBasedBlockHandler
zsxwing <zsxwing@gmail.com>
2014-12-24 19:49:41 -0800
Commit: b4d0db8, github.com/apache/spark/pull/3721
SPARK-4297 [BUILD] Build warning fixes omnibus
Sean Owen <sowen@cloudera.com>
2014-12-24 13:32:51 -0800
Commit: 29fabb1, github.com/apache/spark/pull/3157
[SPARK-4881][Minor] Use SparkConf#getBoolean instead of get().toBoolean
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2014-12-23 19:14:34 -0800
Commit: 199e59a, github.com/apache/spark/pull/3733
[SPARK-4860][pyspark][sql] speeding up `sample()` and `takeSample()`
jbencook <jbenjamincook@gmail.com>, J. Benjamin Cook <jbenjamincook@gmail.com>
2014-12-23 17:46:24 -0800
Commit: fd41eb9, github.com/apache/spark/pull/3764
[SPARK-4606] Send EOF to child JVM when there's no more data to read.
Marcelo Vanzin <vanzin@cloudera.com>
2014-12-23 16:02:59 -0800
Commit: 7e2deb7, github.com/apache/spark/pull/3460
[SPARK-4671][Streaming]Do not replicate streaming block when WAL is enabled
jerryshao <saisai.shao@intel.com>
2014-12-23 15:45:53 -0800
Commit: 3f5f4cc, github.com/apache/spark/pull/3534
[SPARK-4802] [streaming] Remove receiverInfo once receiver is de-registered
Ilayaperumal Gopinathan <igopinathan@pivotal.io>
2014-12-23 15:14:54 -0800
Commit: 10d69e9, github.com/apache/spark/pull/3647
[SPARK-4913] Fix incorrect event log path
Liang-Chi Hsieh <viirya@gmail.com>
2014-12-23 14:58:33 -0800
Commit: 96281cd, github.com/apache/spark/pull/3755
[SPARK-4730][YARN] Warn against deprecated YARN settings
Andrew Or <andrew@databricks.com>
2014-12-23 14:28:36 -0800
Commit: 27c5399, github.com/apache/spark/pull/3590
[SPARK-4914][Build] Cleans lib_managed before compiling with Hive 0.13.1
Cheng Lian <lian@databricks.com>
2014-12-23 12:54:20 -0800
Commit: 395b771, github.com/apache/spark/pull/3756
[SPARK-4932] Add help comments in Analytics
Takeshi Yamamuro <linguin.m.s@gmail.com>
2014-12-23 12:39:41 -0800
Commit: 9c251c5, github.com/apache/spark/pull/3775
[SPARK-4834] [standalone] Clean up application files after app finishes.
Marcelo Vanzin <vanzin@cloudera.com>
2014-12-23 12:02:08 -0800
Commit: dd15536, github.com/apache/spark/pull/3705
[SPARK-4931][Yarn][Docs] Fix the format of running-on-yarn.md
zsxwing <zsxwing@gmail.com>
2014-12-23 11:18:06 -0800
Commit: 2d215ae, github.com/apache/spark/pull/3774
[SPARK-4890] Ignore downloaded EC2 libs
Nicholas Chammas <nicholas.chammas@gmail.com>
2014-12-23 11:12:16 -0800
Commit: 2823c7f, github.com/apache/spark/pull/3770
[Docs] Minor typo fixes
Nicholas Chammas <nicholas.chammas@gmail.com>
2014-12-22 22:54:32 -0800
Commit: 0e532cc, github.com/apache/spark/pull/3772
[SPARK-4907][MLlib] Inconsistent loss and gradient in LeastSquaresGradient compared with R
DB Tsai <dbtsai@alpinenow.com>
2014-12-22 16:42:55 -0800
Commit: a96b727, github.com/apache/spark/pull/3746
[SPARK-4818][Core] Add 'iterator' to reduce memory consumed by join
zsxwing <zsxwing@gmail.com>
2014-12-22 14:26:28 -0800
Commit: c233ab3, github.com/apache/spark/pull/3671
[SPARK-4920][UI]:current spark version in UI is not striking.
genmao.ygm <genmao.ygm@alibaba-inc.com>
2014-12-22 14:14:39 -0800
Commit: de9d7d2, github.com/apache/spark/pull/3763
[Minor] Fix scala doc
Liang-Chi Hsieh <viirya@gmail.com>
2014-12-22 14:13:31 -0800
Commit: a61aa66, github.com/apache/spark/pull/3751
[SPARK-4864] Add documentation to Netty-based configs
Aaron Davidson <aaron@databricks.com>
2014-12-22 13:09:22 -0800
Commit: fbca6b6, github.com/apache/spark/pull/3713
[SPARK-4079] [CORE] Consolidates Errors if a CompressionCodec is not available
Kostas Sakellis <kostas@cloudera.com>
2014-12-22 13:07:01 -0800
Commit: 7c0ed13, github.com/apache/spark/pull/3119
SPARK-4447. Remove layers of abstraction in YARN code no longer needed after dropping yarn-alpha
Sandy Ryza <sandy@cloudera.com>
2014-12-22 12:23:43 -0800
Commit: d62da64, github.com/apache/spark/pull/3652
[SPARK-4733] Add missing prameter comments in ShuffleDependency
Takeshi Yamamuro <linguin.m.s@gmail.com>
2014-12-22 12:19:23 -0800
Commit: fb8e85e, github.com/apache/spark/pull/3594
[Minor] Improve some code in BroadcastTest for short
carlmartin <carlmartinmax@gmail.com>
2014-12-22 12:13:53 -0800
Commit: 1d9788e, github.com/apache/spark/pull/3750
[SPARK-4883][Shuffle] Add a name to the directoryCleaner thread
zsxwing <zsxwing@gmail.com>
2014-12-22 12:11:36 -0800
Commit: 8773705, github.com/apache/spark/pull/3734
[SPARK-4870] Add spark version to driver log
Zhang, Liye <liye.zhang@intel.com>
2014-12-22 11:36:49 -0800
Commit: 39272c8, github.com/apache/spark/pull/3717
[SPARK-4915][YARN] Fix classname to be specified for external shuffle service.
Tsuyoshi Ozawa <ozawa.tsuyoshi@lab.ntt.co.jp>
2014-12-22 11:28:05 -0800
Commit: 96606f6, github.com/apache/spark/pull/3757
[SPARK-4918][Core] Reuse Text in saveAsTextFile
zsxwing <zsxwing@gmail.com>
2014-12-22 11:20:00 -0800
Commit: 93b2f3a, github.com/apache/spark/pull/3762
[SPARK-2075][Core] Make the compiler generate same bytes code for Hadoop 1.+ and Hadoop 2.+
zsxwing <zsxwing@gmail.com>
2014-12-21 22:10:19 -0800
Commit: 6ee6aa7, github.com/apache/spark/pull/3740
SPARK-4910 [CORE] build failed (use of FileStatus.isFile in Hadoop 1.x)
Sean Owen <sowen@cloudera.com>
2014-12-21 13:16:57 -0800
Commit: c6a3c0d, github.com/apache/spark/pull/3754
[Minor] Build Failed: value defaultProperties not found
huangzhaowei <carlmartinmax@gmail.com>
2014-12-19 23:32:56 -0800
Commit: a764960, github.com/apache/spark/pull/3749
[SPARK-4140] Document dynamic allocation
Andrew Or <andrew@databricks.com>, Tsuyoshi Ozawa <ozawa.tsuyoshi@gmail.com>
2014-12-19 19:36:20 -0800
Commit: 15c03e1, github.com/apache/spark/pull/3731
[SPARK-4831] Do not include SPARK_CLASSPATH if empty
Daniel Darabos <darabos.daniel@gmail.com>
2014-12-19 19:32:39 -0800
Commit: 7cb3f54, github.com/apache/spark/pull/3678
SPARK-2641: Passing num executors to spark arguments from properties file
Kanwaljit Singh <kanwaljit.singh@guavus.com>
2014-12-19 19:25:39 -0800
Commit: 1d64812, github.com/apache/spark/pull/1657
[SPARK-3060] spark-shell.cmd doesn't accept application options in Windows OS
Masayoshi TSUZUKI <tsudukim@oss.nttdata.co.jp>
2014-12-19 19:19:53 -0800
Commit: 8d93247, github.com/apache/spark/pull/3350
change signature of example to match released code
Eran Medan <ehrann.mehdan@gmail.com>
2014-12-19 18:29:36 -0800
Commit: c25c669, github.com/apache/spark/pull/3747
[SPARK-2261] Make event logger use a single file.
Marcelo Vanzin <vanzin@cloudera.com>
2014-12-19 18:21:15 -0800
Commit: 4564519, github.com/apache/spark/pull/1222
[SPARK-4890] Upgrade Boto to 2.34.0; automatically download Boto from PyPi instead of packaging it
Josh Rosen <joshrosen@databricks.com>
2014-12-19 17:02:37 -0800
Commit: c28083f, github.com/apache/spark/pull/3737
[SPARK-4896] dont redundantly overwrite executor JAR deps
Ryan Williams <ryan.blake.williams@gmail.com>
2014-12-19 15:24:41 -0800
Commit: 7981f96, github.com/apache/spark/pull/2848
[SPARK-4889] update history server example cmds
Ryan Williams <ryan.blake.williams@gmail.com>
2014-12-19 13:56:04 -0800
Commit: cdb2c64, github.com/apache/spark/pull/3736
Small refactoring to pass SparkEnv into Executor rather than creating SparkEnv in Executor.
Reynold Xin <rxin@databricks.com>
2014-12-19 12:51:12 -0800
Commit: 336cd34, github.com/apache/spark/pull/3738
[Build] Remove spark-staging-1038
scwf <wangfei1@huawei.com>
2014-12-19 08:29:38 -0800
Commit: 8e253eb, github.com/apache/spark/pull/3743
[SPARK-4901] [SQL] Hot fix for ByteWritables.copyBytes
Cheng Hao <hao.cheng@intel.com>
2014-12-19 08:04:41 -0800
Commit: 5479450, github.com/apache/spark/pull/3742
SPARK-3428. TaskMetrics for running tasks is missing GC time metrics
Sandy Ryza <sandy@cloudera.com>
2014-12-18 22:40:44 -0800
Commit: 283263f, github.com/apache/spark/pull/3684
[SPARK-4674] Refactor getCallSite
Liang-Chi Hsieh <viirya@gmail.com>
2014-12-18 21:41:02 -0800
Commit: d7fc69a, github.com/apache/spark/pull/3532
[SPARK-4728][MLLib] Add exponential, gamma, and log normal sampling to MLlib da...
RJ Nowling <rnowling@gmail.com>
2014-12-18 21:00:49 -0800
Commit: ee1fb97, github.com/apache/spark/pull/3680
[SPARK-4861][SQL] Refactory command in spark sql
wangfei <wangfei1@huawei.com>, scwf <wangfei1@huawei.com>
2014-12-18 20:24:56 -0800
Commit: c3d91da, github.com/apache/spark/pull/3712
[SPARK-4573] [SQL] Add SettableStructObjectInspector support in "wrap" function
Cheng Hao <hao.cheng@intel.com>
2014-12-18 20:21:52 -0800
Commit: ae9f128, github.com/apache/spark/pull/3429
[SPARK-2554][SQL] Supporting SumDistinct partial aggregation
ravipesala <ravindra.pesala@huawei.com>
2014-12-18 20:19:10 -0800
Commit: 7687415, github.com/apache/spark/pull/3348
[SPARK-4693] [SQL] PruningPredicates may be wrong if predicates contains an empty AttributeSet() references
YanTangZhai <hakeemzhai@tencent.com>, yantangzhai <tyz0303@163.com>
2014-12-18 20:13:46 -0800
Commit: e7de7e5, github.com/apache/spark/pull/3556
[SPARK-4756][SQL] FIX: sessionToActivePool grow infinitely, even as sessions expire
guowei2 <guowei2@asiainfo.com>
2014-12-18 20:10:23 -0800
Commit: 22ddb6e, github.com/apache/spark/pull/3617
[SPARK-3928][SQL] Support wildcard matches on Parquet files.
Thu Kyaw <trk007@gmail.com>
2014-12-18 20:08:32 -0800
Commit: b68bc6d, github.com/apache/spark/pull/3407
[SPARK-2663] [SQL] Support the Grouping Set
Cheng Hao <hao.cheng@intel.com>
2014-12-18 18:58:29 -0800
Commit: f728e0f, github.com/apache/spark/pull/1567
[SPARK-4754] Refactor SparkContext into ExecutorAllocationClient
Andrew Or <andrew@databricks.com>
2014-12-18 17:37:42 -0800
Commit: 9804a75, github.com/apache/spark/pull/3614
[SPARK-4837] NettyBlockTransferService should use spark.blockManager.port config
Aaron Davidson <aaron@databricks.com>
2014-12-18 16:43:16 -0800
Commit: 105293a, github.com/apache/spark/pull/3688
SPARK-4743 - Use SparkEnv.serializer instead of closureSerializer in aggregateByKey and foldByKey
Ivan Vergiliev <ivan@leanplum.com>
2014-12-18 16:29:36 -0800
Commit: f9f58b9, github.com/apache/spark/pull/3605
[SPARK-4884]: Improve Partition docs
Madhu Siddalingaiah <madhu@madhu.com>
2014-12-18 16:00:53 -0800
Commit: d5a596d, github.com/apache/spark/pull/3722
[SPARK-4880] remove spark.locality.wait in Analytics
Ernest <earneyzxl@gmail.com>
2014-12-18 15:42:26 -0800
Commit: a7ed6f3, github.com/apache/spark/pull/3730
[SPARK-4887][MLlib] Fix a bad unittest in LogisticRegressionSuite
DB Tsai <dbtsai@alpinenow.com>
2014-12-18 13:55:49 -0800
Commit: 59a49db, github.com/apache/spark/pull/3735
[SPARK-3607] ConnectionManager threads.max configs on the thread pools don't work
Ilya Ganelin <ilya.ganelin@capitalone.com>
2014-12-18 12:53:18 -0800
Commit: 3720057, github.com/apache/spark/pull/3664
Add mesos specific configurations into doc
Timothy Chen <tnachen@gmail.com>
2014-12-18 12:15:53 -0800
Commit: d9956f8, github.com/apache/spark/pull/3349
SPARK-3779. yarn spark.yarn.applicationMaster.waitTries config should be...
Sandy Ryza <sandy@cloudera.com>
2014-12-18 12:19:07 -0600
Commit: 253b72b, github.com/apache/spark/pull/3471
[SPARK-4461][YARN] pass extra java options to yarn application master
Zhan Zhang <zhazhan@gmail.com>
2014-12-18 10:01:46 -0600
Commit: 3b76469, github.com/apache/spark/pull/3409
[SPARK-4822] Use sphinx tags for Python doc annotations
lewuathe <lewuathe@me.com>
2014-12-17 17:31:24 -0800
Commit: 3cd5161, github.com/apache/spark/pull/3685
MAINTENANCE: Automated closing of pull requests.
Patrick Wendell <pwendell@gmail.com>
2014-12-17 15:50:10 -0800
Commit: ca12608, github.com/apache/spark/pull/3137
[SPARK-3891][SQL] Add array support to percentile, percentile_approx and constant inspectors support
Venkata Ramana G <ramana.gollamudihuawei.com>, Venkata Ramana Gollamudi <ramana.gollamudi@huawei.com>
2014-12-17 15:41:35 -0800
Commit: f33d550, github.com/apache/spark/pull/2802
[SPARK-4856] [SQL] NullType instead of StringType when sampling against empty string or nul...
Cheng Hao <hao.cheng@intel.com>
2014-12-17 15:01:59 -0800
Commit: 8d0d2a6, github.com/apache/spark/pull/3708
[HOTFIX][SQL] Fix parquet filter suite
Michael Armbrust <michael@databricks.com>
2014-12-17 14:27:02 -0800
Commit: 19c0faa, github.com/apache/spark/pull/3727
[SPARK-4821] [mllib] [python] [docs] Fix for pyspark.mllib.rand doc
Joseph K. Bradley <joseph@databricks.com>
2014-12-17 14:12:46 -0800
Commit: affc3f4, github.com/apache/spark/pull/3669
[SPARK-3739] [SQL] Update the split num base on block size for table scanning
Cheng Hao <hao.cheng@intel.com>
2014-12-17 13:39:36 -0800
Commit: 636d9fc, github.com/apache/spark/pull/2589
[SPARK-4755] [SQL] sqrt(negative value) should return null
Daoyuan Wang <daoyuan.wang@intel.com>
2014-12-17 12:51:27 -0800
Commit: 902e4d5, github.com/apache/spark/pull/3616
[SPARK-4493][SQL] Don't pushdown Eq, NotEq, Lt, LtEq, Gt and GtEq predicates with nulls for Parquet
Cheng Lian <lian@databricks.com>
2014-12-17 12:48:04 -0800
Commit: 6277135, github.com/apache/spark/pull/3367
[SPARK-3698][SQL] Fix case insensitive resolution of GetField.
Michael Armbrust <michael@databricks.com>
2014-12-17 12:43:51 -0800
Commit: 7ad579e, github.com/apache/spark/pull/3724
[SPARK-4694]Fix HiveThriftServer2 cann't stop In Yarn HA mode.
carlmartin <carlmartinmax@gmail.com>
2014-12-17 12:24:03 -0800
Commit: 4782def, github.com/apache/spark/pull/3576
[SPARK-4625] [SQL] Add sort by for DSL & SimpleSqlParser
Cheng Hao <hao.cheng@intel.com>
2014-12-17 12:01:57 -0800
Commit: 5fdcbdc, github.com/apache/spark/pull/3481
[SPARK-4595][Core] Fix MetricsServlet not work issue
Saisai Shao <saisai.shao@intel.com>, Josh Rosen <joshrosen@databricks.com>, jerryshao <saisai.shao@intel.com>
2014-12-17 11:47:44 -0800
Commit: cf50631, github.com/apache/spark/pull/3444
[HOTFIX] Fix RAT exclusion for known_translations file
Josh Rosen <joshrosen@databricks.com>
2014-12-16 23:00:25 -0800
Commit: 3d0c37b, github.com/apache/spark/pull/3719
[Release] Update contributors list format and sort it
Andrew Or <andrew@databricks.com>
2014-12-16 22:11:03 -0800
Commit: 4e1112e
[SPARK-4618][SQL] Make foreign DDL commands options case-insensitive
scwf <wangfei1@huawei.com>, wangfei <wangfei1@huawei.com>
2014-12-16 21:26:36 -0800
Commit: 6069880, github.com/apache/spark/pull/3470
[SPARK-4866] support StructType as key in MapType
Davies Liu <davies@databricks.com>
2014-12-16 21:23:28 -0800
Commit: ec5c427, github.com/apache/spark/pull/3714
[SPARK-4375] [SQL] Add 0 argument support for udf
Cheng Hao <hao.cheng@intel.com>
2014-12-16 21:21:11 -0800
Commit: 770d815, github.com/apache/spark/pull/3595
[SPARK-4720][SQL] Remainder should also return null if the divider is 0.
Takuya UESHIN <ueshin@happy-camper.st>
2014-12-16 21:19:57 -0800
Commit: ddc7ba3, github.com/apache/spark/pull/3581
[SPARK-4744] [SQL] Short circuit evaluation for AND & OR in CodeGen
Cheng Hao <hao.cheng@intel.com>
2014-12-16 21:18:39 -0800
Commit: 0aa834a, github.com/apache/spark/pull/3606
[SPARK-4798][SQL] A new set of Parquet testing API and test suites
Cheng Lian <lian@databricks.com>
2014-12-16 21:16:03 -0800
Commit: 3b395e1, github.com/apache/spark/pull/3644
[Release] Cache known author translations locally
Andrew Or <andrew@databricks.com>
2014-12-16 19:28:43 -0800
Commit: b85044e
[Release] Major improvements to generate contributors script
Andrew Or <andrew@databricks.com>
2014-12-16 17:55:27 -0800
Commit: 6f80b74
[SPARK-4269][SQL] make wait time configurable in BroadcastHashJoin
Jacky Li <jacky.likun@huawei.com>
2014-12-16 15:34:59 -0800
Commit: fa66ef6, github.com/apache/spark/pull/3133
[SPARK-4827][SQL] Fix resolution of deeply nested Project(attr, Project(Star,...)).
Michael Armbrust <michael@databricks.com>
2014-12-16 15:31:19 -0800
Commit: a66c23e, github.com/apache/spark/pull/3674
[SPARK-4483][SQL]Optimization about reduce memory costs during the HashOuterJoin
tianyi <tianyi@asiainfo-linkage.com>, tianyi <tianyi.asiainfo@gmail.com>
2014-12-16 15:22:29 -0800
Commit: 30f6b85, github.com/apache/spark/pull/3375
[SPARK-4527][SQl]Add BroadcastNestedLoopJoin operator selection testsuite
wangxiaojing <u9jing@gmail.com>
2014-12-16 14:45:56 -0800
Commit: ea1315e, github.com/apache/spark/pull/3395
SPARK-4767: Add support for launching in a specified placement group to spark_ec2
Holden Karau <holden@pigscanfly.ca>
2014-12-16 14:37:04 -0800
Commit: b0dfdbd, github.com/apache/spark/pull/3623
[SPARK-4812][SQL] Fix the initialization issue of 'codegenEnabled'
zsxwing <zsxwing@gmail.com>
2014-12-16 14:13:40 -0800
Commit: 6530243, github.com/apache/spark/pull/3660
[SPARK-4847][SQL]Fix "extraStrategies cannot take effect in SQLContext" issue
jerryshao <saisai.shao@intel.com>
2014-12-16 14:08:28 -0800
Commit: dc8280d, github.com/apache/spark/pull/3698
[DOCS][SQL] Add a Note on jsonFile having separate JSON objects per line
Peter Vandenabeele <peter@vandenabeele.com>
2014-12-16 13:57:55 -0800
Commit: 1a9e35e, github.com/apache/spark/pull/3517
[SQL] SPARK-4700: Add HTTP protocol spark thrift server
Judy Nash <judynash@microsoft.com>, judynash <judynash@microsoft.com>
2014-12-16 12:37:26 -0800
Commit: 17688d1, github.com/apache/spark/pull/3672
[SPARK-3405] add subnet-id and vpc-id options to spark_ec2.py
Mike Jennings <mvj101@gmail.com>, Mike Jennings <mvj@google.com>
2014-12-16 12:13:21 -0800
Commit: d12c071, github.com/apache/spark/pull/2872
[SPARK-4855][mllib] testing the Chi-squared hypothesis test
jbencook <jbenjamincook@gmail.com>
2014-12-16 11:37:23 -0800
Commit: cb48447, github.com/apache/spark/pull/3679
[SPARK-4437] update doc for WholeCombineFileRecordReader
Davies Liu <davies@databricks.com>, Josh Rosen <joshrosen@databricks.com>
2014-12-16 11:19:36 -0800
Commit: ed36200, github.com/apache/spark/pull/3301
[SPARK-4841] fix zip with textFile()
Davies Liu <davies@databricks.com>
2014-12-15 22:58:26 -0800
Commit: c246b95, github.com/apache/spark/pull/3706
[SPARK-4792] Add error message when making local dir unsuccessfully
meiyoula <1039320815@qq.com>
2014-12-15 22:30:18 -0800
Commit: c762877, github.com/apache/spark/pull/3635
SPARK-4814 [CORE] Enable assertions in SBT, Maven tests / AssertionError from Hive's LazyBinaryInteger
Sean Owen <sowen@cloudera.com>
2014-12-15 17:12:05 -0800
Commit: 81112e4, github.com/apache/spark/pull/3692
[Minor][Core] fix comments in MapOutputTracker
wangfei <wangfei1@huawei.com>
2014-12-15 16:46:21 -0800
Commit: 5c24759, github.com/apache/spark/pull/3700
SPARK-785 [CORE] ClosureCleaner not invoked on most PairRDDFunctions
Sean Owen <sowen@cloudera.com>
2014-12-15 16:06:15 -0800
Commit: 2a28bc6, github.com/apache/spark/pull/3690
[SPARK-4668] Fix some documentation typos.
Ryan Williams <ryan.blake.williams@gmail.com>
2014-12-15 14:52:17 -0800
Commit: 8176b7a, github.com/apache/spark/pull/3523
[SPARK-1037] The name of findTaskFromList & findTask in TaskSetManager.scala is confusing
Ilya Ganelin <ilya.ganelin@capitalone.com>
2014-12-15 14:51:15 -0800
Commit: 38703bb, github.com/apache/spark/pull/3665
[SPARK-4826] Fix generation of temp file names in WAL tests
Josh Rosen <joshrosen@databricks.com>
2014-12-15 14:33:43 -0800
Commit: f6b8591, github.com/apache/spark/pull/3695.
[SPARK-4494][mllib] IDFModel.transform() add support for single vector
Yuu ISHIKAWA <yuu.ishikawa@gmail.com>
2014-12-15 13:44:15 -0800
Commit: 8098fab, github.com/apache/spark/pull/3603
HOTFIX: Disabling failing block manager test
Patrick Wendell <pwendell@gmail.com>
2014-12-15 10:54:45 -0800
Commit: 4c06738
fixed spelling errors in documentation
Peter Klipfel <peter@klipfel.me>
2014-12-14 00:01:16 -0800
Commit: 2a2983f, github.com/apache/spark/pull/3691
MAINTENANCE: Automated closing of pull requests.
Patrick Wendell <pwendell@gmail.com>
2014-12-11 23:38:40 -0800
Commit: ef84dab, github.com/apache/spark/pull/3488
[SPARK-4829] [SQL] add rule to fold count(expr) if expr is not null
Daoyuan Wang <daoyuan.wang@intel.com>
2014-12-11 22:56:42 -0800
Commit: 41a3f93, github.com/apache/spark/pull/3676
[SPARK-4742][SQL] The name of Parquet File generated by AppendingParquetOutputFormat should be zero padded
Sasaki Toru <sasakitoa@nttdata.co.jp>
2014-12-11 22:54:21 -0800
Commit: 8091dd6, github.com/apache/spark/pull/3602
[SPARK-4825] [SQL] CTAS fails to resolve when created using saveAsTable
Cheng Hao <hao.cheng@intel.com>
2014-12-11 22:51:49 -0800
Commit: 0abbff2, github.com/apache/spark/pull/3673
[SQL] enable empty aggr test case
Daoyuan Wang <daoyuan.wang@intel.com>
2014-12-11 22:50:18 -0800
Commit: cbb634a, github.com/apache/spark/pull/3445
[SPARK-4828] [SQL] sum and avg on empty table should always return null
Daoyuan Wang <daoyuan.wang@intel.com>
2014-12-11 22:49:27 -0800
Commit: acb3be6, github.com/apache/spark/pull/3675
[SQL] Remove unnecessary case in HiveContext.toHiveString
scwf <wangfei1@huawei.com>
2014-12-11 22:48:03 -0800
Commit: d8cf678, github.com/apache/spark/pull/3563
[SPARK-4293][SQL] Make Cast be able to handle complex types.
Takuya UESHIN <ueshin@happy-camper.st>
2014-12-11 22:45:25 -0800
Commit: 3344803, github.com/apache/spark/pull/3150
[SPARK-4639] [SQL] Pass maxIterations in as a parameter in Analyzer
Jacky Li <jacky.likun@huawei.com>
2014-12-11 22:44:27 -0800
Commit: c152dde, github.com/apache/spark/pull/3499
[SPARK-4662] [SQL] Whitelist more unittest
Cheng Hao <hao.cheng@intel.com>
2014-12-11 22:43:02 -0800
Commit: a7f07f5, github.com/apache/spark/pull/3522
[SPARK-4713] [SQL] SchemaRDD.unpersist() should not raise exception if it is not persisted
Cheng Hao <hao.cheng@intel.com>
2014-12-11 22:41:36 -0800
Commit: bf40cf8, github.com/apache/spark/pull/3572
[SPARK-4806] Streaming doc update for 1.2
Tathagata Das <tathagata.das1565@gmail.com>, Josh Rosen <joshrosen@databricks.com>, Josh Rosen <rosenville@gmail.com>
2014-12-11 06:21:23 -0800
Commit: b004150, github.com/apache/spark/pull/3653
[SPARK-4791] [sql] Infer schema from case class with multiple constructors
Joseph K. Bradley <joseph@databricks.com>
2014-12-10 23:41:15 -0800
Commit: 2a5b5fd, github.com/apache/spark/pull/3646
[CORE]codeStyle: uniform ConcurrentHashMap define in StorageLevel.scala with other places
Zhang, Liye <liye.zhang@intel.com>
2014-12-10 20:44:59 -0800
Commit: 57d37f9, github.com/apache/spark/pull/2793
SPARK-3526 Add section about data locality to the tuning guide
Andrew Ash <andrew@andrewash.com>
2014-12-10 15:01:15 -0800
Commit: 652b781, github.com/apache/spark/pull/2519
MAINTENANCE: Automated closing of pull requests.
Patrick Wendell <pwendell@gmail.com>
2014-12-10 14:41:16 -0800
Commit: 36bdb5b, github.com/apache/spark/pull/2883
[SPARK-4759] Fix driver hanging from coalescing partitions
Andrew Or <andrew@databricks.com>
2014-12-10 14:27:53 -0800
Commit: 4f93d0c, github.com/apache/spark/pull/3633
[SPARK-4569] Rename 'externalSorting' in Aggregator
Ilya Ganelin <ilya.ganelin@capitalone.com>
2014-12-10 14:19:37 -0800
Commit: 447ae2d, github.com/apache/spark/pull/3666
[SPARK-4793] [Deploy] ensure .jar at end of line
Daoyuan Wang <daoyuan.wang@intel.com>
2014-12-10 13:29:27 -0800
Commit: e230da1, github.com/apache/spark/pull/3641
[SPARK-4215] Allow requesting / killing executors only in YARN mode
Andrew Or <andrew@databricks.com>
2014-12-10 12:48:24 -0800
Commit: faa8fd8, github.com/apache/spark/pull/3615
[SPARK-4771][Docs] Document standalone cluster supervise mode
Andrew Or <andrew@databricks.com>
2014-12-10 12:41:36 -0800
Commit: 5621283, github.com/apache/spark/pull/3627
[SPARK-4329][WebUI] HistoryPage pagenation
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2014-12-10 12:29:00 -0800
Commit: 0fc637b, github.com/apache/spark/pull/3194
[SPARK-4161]Spark shell class path is not correctly set if "spark.driver.extraClassPath" is set in defaults.conf
GuoQiang Li <witgo@qq.com>
2014-12-10 12:24:04 -0800
Commit: 742e709, github.com/apache/spark/pull/3050
[SPARK-4772] Clear local copies of accumulators as soon as we're done with them
Nathan Kronenfeld <nkronenfeld@oculusinfo.com>
2014-12-09 23:53:17 -0800
Commit: 94b377f, github.com/apache/spark/pull/3570
[Minor] Use <sup> tag for help icon in web UI page header
Josh Rosen <joshrosen@databricks.com>
2014-12-09 23:47:05 -0800
Commit: f79c1cf, github.com/apache/spark/pull/3659
Config updates for the new shuffle transport.
Reynold Xin <rxin@databricks.com>
2014-12-09 19:29:09 -0800
Commit: 9bd9334, github.com/apache/spark/pull/3657
[SPARK-4740] Create multiple concurrent connections between two peer nodes in Netty.
Reynold Xin <rxin@databricks.com>
2014-12-09 17:49:59 -0800
Commit: 2b9b726, github.com/apache/spark/pull/3625
SPARK-4805 [CORE] BlockTransferMessage.toByteArray() trips assertion
Sean Owen <sowen@cloudera.com>
2014-12-09 16:38:27 -0800
Commit: d8f84f2, github.com/apache/spark/pull/3650
SPARK-4567. Make SparkJobInfo and SparkStageInfo serializable
Sandy Ryza <sandy@cloudera.com>
2014-12-09 16:26:07 -0800
Commit: 5e4c06f, github.com/apache/spark/pull/3426
[SPARK-4714] BlockManager.dropFromMemory() should check whether block has been removed after synchronizing on BlockInfo instance.
hushan[胡珊] <hushan@xiaomi.com>
2014-12-09 15:11:20 -0800
Commit: 30dca92, github.com/apache/spark/pull/3574
[SPARK-4765] Make GC time always shown in UI.
Kay Ousterhout <kayousterhout@gmail.com>
2014-12-09 15:10:36 -0800
Commit: 1f51106, github.com/apache/spark/pull/3622
[SPARK-4691][shuffle] Restructure a few lines in shuffle code
maji2014 <maji3@asiainfo.com>
2014-12-09 13:13:12 -0800
Commit: b310744, github.com/apache/spark/pull/3553
[SPARK-874] adding a --wait flag
jbencook <jbenjamincook@gmail.com>
2014-12-09 12:16:19 -0800
Commit: 61f1a70, github.com/apache/spark/pull/3567
SPARK-4338. [YARN] Ditch yarn-alpha.
Sandy Ryza <sandy@cloudera.com>
2014-12-09 11:02:43 -0800
Commit: 912563a, github.com/apache/spark/pull/3215
[SPARK-4785][SQL] Initilize Hive UDFs on the driver and serialize them with a wrapper
Cheng Hao <hao.cheng@intel.com>, Cheng Lian <lian@databricks.com>
2014-12-09 10:28:15 -0800
Commit: 383c555, github.com/apache/spark/pull/3640
[SPARK-3154][STREAMING] Replace ConcurrentHashMap with mutable.HashMap and remove @volatile from 'stopped'
zsxwing <zsxwing@gmail.com>
2014-12-08 23:54:15 -0800
Commit: bcb5cda, github.com/apache/spark/pull/3634
[SPARK-4769] [SQL] CTAS does not work when reading from temporary tables
Cheng Hao <hao.cheng@intel.com>
2014-12-08 17:39:12 -0800
Commit: 51b1fe1, github.com/apache/spark/pull/3336
[SQL] remove unnecessary import in spark-sql
Jacky Li <jacky.likun@huawei.com>
2014-12-08 17:27:46 -0800
Commit: 9443843, github.com/apache/spark/pull/3630
SPARK-4770. [DOC] [YARN] spark.scheduler.minRegisteredResourcesRatio doc...
Sandy Ryza <sandy@cloudera.com>
2014-12-08 16:28:36 -0800
Commit: cda94d1, github.com/apache/spark/pull/3624
SPARK-3926 [CORE] Reopened: result of JavaRDD collectAsMap() is not serializable
Sean Owen <sowen@cloudera.com>
2014-12-08 16:13:03 -0800
Commit: e829bfa, github.com/apache/spark/pull/3587
[SPARK-4750] Dynamic allocation - synchronize kills
Andrew Or <andrew@databricks.com>
2014-12-08 16:02:33 -0800
Commit: 65f929d, github.com/apache/spark/pull/3612
[SPARK-4774] [SQL] Makes HiveFromSpark more portable
Kostas Sakellis <kostas@cloudera.com>
2014-12-08 15:44:18 -0800
Commit: d6a972b, github.com/apache/spark/pull/3628
[SPARK-4764] Ensure that files are fetched atomically
Christophe Préaud <christophe.preaud@kelkoo.com>
2014-12-08 11:44:54 -0800
Commit: ab2abcb, github.com/apache/spark/pull/2855
[SPARK-4620] Add unpersist in Graph and GraphImpl
Takeshi Yamamuro <linguin.m.s@gmail.com>
2014-12-07 19:42:02 -0800
Commit: 8817fc7, github.com/apache/spark/pull/3476
[SPARK-4646] Replace Scala.util.Sorting.quickSort with Sorter(TimSort) in Spark
Takeshi Yamamuro <linguin.m.s@gmail.com>
2014-12-07 19:36:08 -0800
Commit: 2e6b736, github.com/apache/spark/pull/3507
[SPARK-3623][GraphX] GraphX should support the checkpoint operation
GuoQiang Li <witgo@qq.com>
2014-12-06 00:56:51 -0800
Commit: e895e0c, github.com/apache/spark/pull/2631
Streaming doc : do you mean inadvertently?
CrazyJvm <crazyjvm@gmail.com>
2014-12-05 13:42:13 -0800
Commit: 6eb1b6f, github.com/apache/spark/pull/3620
[SPARK-4005][CORE] handle message replies in receive instead of in the individual private methods
Zhang, Liye <liye.zhang@intel.com>
2014-12-05 12:00:32 -0800
Commit: 98a7d09, github.com/apache/spark/pull/2853
[SPARK-4761][SQL] Enables Kryo by default in Spark SQL Thrift server
Cheng Lian <lian@databricks.com>
2014-12-05 10:27:40 -0800
Commit: 6f61e1f, github.com/apache/spark/pull/3621
[SPARK-4753][SQL] Use catalyst for partition pruning in newParquet.
Michael Armbrust <michael@databricks.com>
2014-12-04 22:25:21 -0800
Commit: f5801e8, github.com/apache/spark/pull/3613
Revert "SPARK-2624 add datanucleus jars to the container in yarn-cluster"
Andrew Or <andrew@databricks.com>
2014-12-04 21:53:49 -0800
Commit: fd85253
Revert "[HOT FIX] [YARN] Check whether `/lib` exists before listing its files"
Andrew Or <andrew@databricks.com>
2014-12-04 21:53:38 -0800
Commit: 87437df
[SPARK-4464] Description about configuration options need to be modified in docs.
Masayoshi TSUZUKI <tsudukim@oss.nttdata.co.jp>
2014-12-04 19:33:02 -0800
Commit: ca37903, github.com/apache/spark/pull/3329
Fix typo in Spark SQL docs.
Andy Konwinski <andykonwinski@gmail.com>
2014-12-04 18:27:02 -0800
Commit: 15cf3b0, github.com/apache/spark/pull/3611
[SPARK-4421] Wrong link in spark-standalone.html
Masayoshi TSUZUKI <tsudukim@oss.nttdata.co.jp>
2014-12-04 18:14:36 -0800
Commit: ddfc09c, github.com/apache/spark/pull/3279
[SPARK-4397] Move object RDD to the front of RDD.scala.
Reynold Xin <rxin@databricks.com>
2014-12-04 16:32:20 -0800
Commit: ed92b47, github.com/apache/spark/pull/3580
[SPARK-4652][DOCS] Add docs about spark-git-repo option
lewuathe <lewuathe@me.com>, Josh Rosen <joshrosen@databricks.com>
2014-12-04 15:14:36 -0800
Commit: ab8177d, github.com/apache/spark/pull/3513
[SPARK-4459] Change groupBy type parameter from K to U
Saldanha <saldaal1@phusca-l24858.wlan.na.novartis.net>
2014-12-04 14:22:09 -0800
Commit: 743a889, github.com/apache/spark/pull/3327
[SPARK-4745] Fix get_existing_cluster() function with multiple security groups
alexdebrie <alexdebrie1@gmail.com>
2014-12-04 14:13:59 -0800
Commit: 794f3ae, github.com/apache/spark/pull/3596
[HOTFIX] Fixing two issues with the release script.
Patrick Wendell <pwendell@gmail.com>
2014-12-04 12:11:41 -0800
Commit: 8dae26f, github.com/apache/spark/pull/3608
[SPARK-4253] Ignore spark.driver.host in yarn-cluster and standalone-cluster modes
WangTaoTheTonic <barneystinson@aliyun.com>, WangTao <barneystinson@aliyun.com>
2014-12-04 11:52:47 -0800
Commit: 8106b1e, github.com/apache/spark/pull/3112
[SPARK-4683][SQL] Add a beeline.cmd to run on Windows
Cheng Lian <lian@databricks.com>
2014-12-04 10:21:03 -0800
Commit: 28c7aca, github.com/apache/spark/pull/3599
[FIX][DOC] Fix broken links in ml-guide.md
Xiangrui Meng <meng@databricks.com>
2014-12-04 20:16:35 +0800
Commit: 7e758d7, github.com/apache/spark/pull/3601
[SPARK-4575] [mllib] [docs] spark.ml pipelines doc + bug fixes
Joseph K. Bradley <joseph@databricks.com>, jkbradley <joseph.kurata.bradley@gmail.com>, Xiangrui Meng <meng@databricks.com>
2014-12-04 17:00:06 +0800
Commit: 469a6e5, github.com/apache/spark/pull/3588
[docs] Fix outdated comment in tuning guide
Joseph K. Bradley <joseph@databricks.com>
2014-12-04 00:59:32 -0800
Commit: 529439b, github.com/apache/spark/pull/3592
[SQL] Minor: Avoid calling Seq#size in a loop
Aaron Davidson <aaron@databricks.com>
2014-12-04 00:58:42 -0800
Commit: c6c7165, github.com/apache/spark/pull/3593
[SPARK-4685] Include all spark.ml and spark.mllib packages in JavaDoc's MLlib group
lewuathe <lewuathe@me.com>, Xiangrui Meng <meng@databricks.com>
2014-12-04 16:51:41 +0800
Commit: 20bfea4, github.com/apache/spark/pull/3554
[SPARK-4719][API] Consolidate various narrow dep RDD classes with MapPartitionsRDD
Reynold Xin <rxin@databricks.com>
2014-12-04 00:45:57 -0800
Commit: c3ad486, github.com/apache/spark/pull/3578
[SQL] remove unnecessary import
Jacky Li <jacky.likun@huawei.com>
2014-12-04 00:43:55 -0800
Commit: ed88db4, github.com/apache/spark/pull/3585
MAINTENANCE: Automated closing of pull requests.
Patrick Wendell <pwendell@gmail.com>
2014-12-03 22:15:46 -0800
Commit: 3cdae03, github.com/apache/spark/pull/1875
[Release] Correctly translate contributors name in release notes
Andrew Or <andrew@databricks.com>
2014-12-03 19:08:29 -0800
Commit: a4dfb4e
[SPARK-4580] [SPARK-4610] [mllib] [docs] Documentation for tree ensembles + DecisionTree API fix
Joseph K. Bradley <joseph@databricks.com>, Joseph K. Bradley <joseph.kurata.bradley@gmail.com>
2014-12-04 09:57:50 +0800
Commit: 657a888, github.com/apache/spark/pull/3461
[SPARK-4711] [mllib] [docs] Programming guide advice on choosing optimizer
Joseph K. Bradley <joseph@databricks.com>
2014-12-04 08:58:03 +0800
Commit: 27ab0b8, github.com/apache/spark/pull/3569
[SPARK-4085] Propagate FetchFailedException when Spark fails to read local shuffle file.
Reynold Xin <rxin@databricks.com>
2014-12-03 16:28:24 -0800
Commit: 1826372, github.com/apache/spark/pull/3579
[SPARK-4498][core] Don't transition ExecutorInfo to RUNNING until Driver adds Executor
Mark Hamstra <markhamstra@gmail.com>
2014-12-03 15:08:01 -0800
Commit: 96b2785, github.com/apache/spark/pull/3550
[SPARK-4552][SQL] Avoid exception when reading empty parquet data through Hive
Michael Armbrust <michael@databricks.com>
2014-12-03 14:13:35 -0800
Commit: 513ef82, github.com/apache/spark/pull/3586
[HOT FIX] [YARN] Check whether `/lib` exists before listing its files
Andrew Or <andrew@databricks.com>
2014-12-03 13:56:23 -0800
Commit: 90ec643, github.com/apache/spark/pull/3589
[SPARK-4642] Add description about spark.yarn.queue to running-on-YARN document.
Masayoshi TSUZUKI <tsudukim@oss.nttdata.co.jp>
2014-12-03 13:16:24 -0800
Commit: 692f493, github.com/apache/spark/pull/3500
[SPARK-4715][Core] Make sure tryToAcquire won't return a negative value
zsxwing <zsxwing@gmail.com>
2014-12-03 12:19:40 -0800
Commit: edd3cd4, github.com/apache/spark/pull/3575
[SPARK-4701] Typo in sbt/sbt
Masayoshi TSUZUKI <tsudukim@oss.nttdata.co.jp>
2014-12-03 12:08:00 -0800
Commit: 96786e3, github.com/apache/spark/pull/3560
SPARK-2624 add datanucleus jars to the container in yarn-cluster
Jim Lim <jim@quixey.com>
2014-12-03 11:16:02 -0800
Commit: a975dc3, github.com/apache/spark/pull/3238
[SPARK-4717][MLlib] Optimize BLAS library to avoid de-reference multiple times in loop
DB Tsai <dbtsai@alpinenow.com>
2014-12-03 22:31:39 +0800
Commit: d005429, github.com/apache/spark/pull/3577
[SPARK-4708][MLLib] Make k-mean runs two/three times faster with dense/sparse sample
DB Tsai <dbtsai@alpinenow.com>
2014-12-03 19:01:56 +0800
Commit: 7fc49ed, github.com/apache/spark/pull/3565
[SPARK-4710] [mllib] Eliminate MLlib compilation warnings
Joseph K. Bradley <joseph@databricks.com>
2014-12-03 18:50:03 +0800
Commit: 4ac2151, github.com/apache/spark/pull/3568
[SPARK-4397][Core] Change the 'since' value of '@deprecated' to '1.3.0'
zsxwing <zsxwing@gmail.com>
2014-12-03 02:05:17 -0800
Commit: 8af551f, github.com/apache/spark/pull/3573
[SPARK-4672][Core]Checkpoint() should clear f to shorten the serialization chain
JerryLead <JerryLead@163.com>, Lijie Xu <csxulijie@gmail.com>
2014-12-02 23:53:29 -0800
Commit: 77be8b9, github.com/apache/spark/pull/3545
[SPARK-4672][GraphX]Non-transient PartitionsRDDs will lead to StackOverflow error
JerryLead <JerryLead@163.com>, Lijie Xu <csxulijie@gmail.com>
2014-12-02 17:14:11 -0800
Commit: 17c162f, github.com/apache/spark/pull/3544
[SPARK-4672][GraphX]Perform checkpoint() on PartitionsRDD to shorten the lineage
JerryLead <JerryLead@163.com>, Lijie Xu <csxulijie@gmail.com>
2014-12-02 17:08:02 -0800
Commit: fc0a147, github.com/apache/spark/pull/3549
[Release] Translate unknown author names automatically
Andrew Or <andrew@databricks.com>
2014-12-02 16:36:12 -0800
Commit: 5da21f0
Minor nit style cleanup in GraphX.
Reynold Xin <rxin@databricks.com>
2014-12-02 14:40:26 -0800
Commit: 2d4f6e7
[SPARK-4695][SQL] Get result using executeCollect
wangfei <wangfei1@huawei.com>
2014-12-02 14:30:44 -0800
Commit: 3ae0cda, github.com/apache/spark/pull/3547
[SPARK-4670] [SQL] wrong symbol for bitwise not
Daoyuan Wang <daoyuan.wang@intel.com>
2014-12-02 14:25:12 -0800
Commit: 1f5ddf1, github.com/apache/spark/pull/3528
[SPARK-4593][SQL] Return null when denominator is 0
Daoyuan Wang <daoyuan.wang@intel.com>
2014-12-02 14:21:12 -0800
Commit: f6df609, github.com/apache/spark/pull/3443
[SPARK-4676][SQL] JavaSchemaRDD.schema may throw NullType MatchError if sql has null
YanTangZhai <hakeemzhai@tencent.com>, yantangzhai <tyz0303@163.com>, Michael Armbrust <michael@databricks.com>
2014-12-02 14:12:48 -0800
Commit: 1066427, github.com/apache/spark/pull/3538
[SPARK-4663][sql]add finally to avoid resource leak
baishuo <vc_java@hotmail.com>
2014-12-02 12:12:03 -0800
Commit: 69b6fed, github.com/apache/spark/pull/3526
[SPARK-4536][SQL] Add sqrt and abs to Spark SQL DSL
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2014-12-02 12:07:52 -0800
Commit: e75e04f, github.com/apache/spark/pull/3401
Indent license header properly for interfaces.scala.
Reynold Xin <rxin@databricks.com>
2014-12-02 11:59:15 -0800
Commit: b1f8fe3, github.com/apache/spark/pull/3552
[SPARK-4686] Link to allowed master URLs is broken
Kay Ousterhout <kayousterhout@gmail.com>
2014-12-02 09:06:02 -0800
Commit: d9a148b, github.com/apache/spark/pull/3542
[SPARK-4397][Core] Cleanup 'import SparkContext._' in core
zsxwing <zsxwing@gmail.com>
2014-12-02 00:18:41 -0800
Commit: 6dfe38a, github.com/apache/spark/pull/3530
[SPARK-4611][MLlib] Implement the efficient vector norm
DB Tsai <dbtsai@alpinenow.com>
2014-12-02 11:40:43 +0800
Commit: 64f3175, github.com/apache/spark/pull/3462
MAINTENANCE: Automated closing of pull requests.
Patrick Wendell <pwendell@gmail.com>
2014-12-01 17:27:14 -0800
Commit: b0a46d8, github.com/apache/spark/pull/1612
[SPARK-4268][SQL] Use #::: to get benefit from Stream in SqlLexical.allCaseVersions
zsxwing <zsxwing@gmail.com>
2014-12-01 16:39:54 -0800
Commit: d3e02dd, github.com/apache/spark/pull/3132
[SPARK-4529] [SQL] support view with column alias
Daoyuan Wang <daoyuan.wang@intel.com>
2014-12-01 16:08:51 -0800
Commit: 4df60a8, github.com/apache/spark/pull/3396
[SQL][DOC] Date type in SQL programming guide
Daoyuan Wang <daoyuan.wang@intel.com>
2014-12-01 14:03:57 -0800
Commit: 5edbcbf, github.com/apache/spark/pull/3535
[SQL] Minor fix for doc and comment
wangfei <wangfei1@huawei.com>
2014-12-01 14:02:02 -0800
Commit: 7b79957, github.com/apache/spark/pull/3533
[SPARK-4658][SQL] Code documentation issue in DDL of datasource API
ravipesala <ravindra.pesala@huawei.com>
2014-12-01 13:31:27 -0800
Commit: bc35381, github.com/apache/spark/pull/3516
[SPARK-4650][SQL] Supporting multi column support in countDistinct function like count(distinct c1,c2..) in Spark SQL
ravipesala <ravindra.pesala@huawei.com>, Michael Armbrust <michael@databricks.com>
2014-12-01 13:26:44 -0800
Commit: 6a9ff19, github.com/apache/spark/pull/3511
[SPARK-4358][SQL] Let BigDecimal do checking type compatibility
Liang-Chi Hsieh <viirya@gmail.com>
2014-12-01 13:17:56 -0800
Commit: b57365a, github.com/apache/spark/pull/3208
[SQL] add @group tab in limit() and count()
Jacky Li <jacky.likun@gmail.com>
2014-12-01 13:12:30 -0800
Commit: bafee67, github.com/apache/spark/pull/3458
[SPARK-4258][SQL][DOC] Documents spark.sql.parquet.filterPushdown
Cheng Lian <lian@databricks.com>
2014-12-01 13:09:51 -0800
Commit: 5db8dca, github.com/apache/spark/pull/3440
Documentation: add description for repartitionAndSortWithinPartitions
Madhu Siddalingaiah <madhu@madhu.com>
2014-12-01 08:45:34 -0800
Commit: 2b233f5, github.com/apache/spark/pull/3390
[SPARK-4661][Core] Minor code and docs cleanup
zsxwing <zsxwing@gmail.com>
2014-12-01 00:35:01 -0800
Commit: 30a86ac, github.com/apache/spark/pull/3521
[SPARK-4664][Core] Throw an exception when spark.akka.frameSize > 2047
zsxwing <zsxwing@gmail.com>
2014-12-01 00:32:54 -0800
Commit: 1d238f2, github.com/apache/spark/pull/3527
SPARK-2192 [BUILD] Examples Data Not in Binary Distribution
Sean Owen <sowen@cloudera.com>
2014-12-01 16:31:04 +0800
Commit: 6384f42, github.com/apache/spark/pull/3480
Fix wrong file name pattern in .gitignore
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2014-12-01 00:29:28 -0800
Commit: 97eb6d7, github.com/apache/spark/pull/3529
[SPARK-4632] version update
Prabeesh K <prabsmails@gmail.com>
2014-11-30 20:51:53 -0800
Commit: 5e7a6dc, github.com/apache/spark/pull/3495
MAINTENANCE: Automated closing of pull requests.
Patrick Wendell <pwendell@gmail.com>
2014-11-30 20:51:13 -0800
Commit: 06dc1b1, github.com/apache/spark/pull/2915
[DOC] Fixes formatting typo in SQL programming guide
Cheng Lian <lian@databricks.com>
2014-11-30 19:04:07 -0800
Commit: 2a4d389, github.com/apache/spark/pull/3498
[SPARK-4656][Doc] Typo in Programming Guide markdown
lewuathe <lewuathe@me.com>
2014-11-30 17:18:50 -0800
Commit: a217ec5, github.com/apache/spark/pull/3412
[SPARK-4623]Add the some error infomation if using spark-sql in yarn-cluster mode
carlmartin <carlmartinmax@gmail.com>, huangzhaowei <carlmartinmax@gmail.com>
2014-11-30 16:19:41 -0800
Commit: aea7a99, github.com/apache/spark/pull/3479
SPARK-2143 [WEB UI] Add Spark version to UI footer
Sean Owen <sowen@cloudera.com>
2014-11-30 11:40:08 -0800
Commit: 048ecca, github.com/apache/spark/pull/3410
[DOCS][BUILD] Add instruction to use change-version-to-2.11.sh in 'Building for Scala 2.11'.
Takuya UESHIN <ueshin@happy-camper.st>
2014-11-30 00:10:31 -0500
Commit: 0fcd24c, github.com/apache/spark/pull/3361
SPARK-4507: PR merge script should support closing multiple JIRA tickets
Takayuki Hasegawa <takayuki.hasegawa0311@gmail.com>
2014-11-29 23:12:10 -0500
Commit: 4316a7b, github.com/apache/spark/pull/3428
[SPARK-4505][Core] Add a ClassTag parameter to CompactBuffer[T]
zsxwing <zsxwing@gmail.com>
2014-11-29 20:23:08 -0500
Commit: c062224, github.com/apache/spark/pull/3378
[SPARK-4057] Use -agentlib instead of -Xdebug in sbt-launch-lib.bash for debugging
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2014-11-29 20:14:14 -0500
Commit: 938dc14, github.com/apache/spark/pull/2904
Include the key name when failing on an invalid value.
Stephen Haberman <stephen@exigencecorp.com>
2014-11-29 20:12:05 -0500
Commit: 95290bf, github.com/apache/spark/pull/3514
[SPARK-3398] [SPARK-4325] [EC2] Use EC2 status checks.
Nicholas Chammas <nicholas.chammas@gmail.com>
2014-11-29 00:31:06 -0800
Commit: 317e114, github.com/apache/spark/pull/3195
MAINTENANCE: Automated closing of pull requests.
Patrick Wendell <pwendell@gmail.com>
2014-11-29 00:24:35 -0500
Commit: 047ff57, github.com/apache/spark/pull/3451
[SPARK-4597] Use proper exception and reset variable in Utils.createTempDir()
Liang-Chi Hsieh <viirya@gmail.com>
2014-11-28 18:04:05 -0800
Commit: 49fe879, github.com/apache/spark/pull/3449
SPARK-1450 [EC2] Specify the default zone in the EC2 script help
Sean Owen <sowen@cloudera.com>
2014-11-28 17:43:38 -0500
Commit: 48223d8, github.com/apache/spark/pull/3454
[SPARK-4584] [yarn] Remove security manager from Yarn AM.
Marcelo Vanzin <vanzin@cloudera.com>
2014-11-28 15:15:30 -0500
Commit: 915f8ee, github.com/apache/spark/pull/3484
[SPARK-4193][BUILD] Disable doclint in Java 8 to prevent from build error.
Takuya UESHIN <ueshin@happy-camper.st>
2014-11-28 13:00:15 -0500
Commit: e464f0a, github.com/apache/spark/pull/3058
[SPARK-4643] [Build] Remove unneeded staging repositories from build
Daoyuan Wang <daoyuan.wang@intel.com>
2014-11-28 12:41:38 -0500
Commit: 53ed7f1, github.com/apache/spark/pull/3504
Delete unnecessary function
KaiXinXiaoLei <huleilei1@huawei.com>
2014-11-28 12:34:07 -0500
Commit: 052e658, github.com/apache/spark/pull/3224
[SPARK-4645][SQL] Disables asynchronous execution in Hive 0.13.1 HiveThriftServer2
Cheng Lian <lian@databricks.com>
2014-11-28 11:42:40 -0500
Commit: 5b99bf2, github.com/apache/spark/pull/3506
[SPARK-4619][Storage]delete redundant time suffix
maji2014 <maji3@asiainfo.com>
2014-11-28 00:36:22 -0800
Commit: ceb6281, github.com/apache/spark/pull/3475
[SPARK-4613][Core] Java API for JdbcRDD
Cheng Lian <lian@databricks.com>
2014-11-27 18:01:14 -0800
Commit: 120a350, github.com/apache/spark/pull/3478
[SPARK-4626] Kill a task only if the executorId is (still) registered with the scheduler
roxchkplusony <roxchkplusony@gmail.com>
2014-11-27 15:54:40 -0800
Commit: 84376d3, github.com/apache/spark/pull/3483
SPARK-4170 [CORE] Closure problems when running Scala app that "extends App"
Sean Owen <sowen@cloudera.com>
2014-11-27 09:03:17 -0800
Commit: 5d7fe17, github.com/apache/spark/pull/3497
[Release] Automate generation of contributors list
Andrew Or <andrew@databricks.com>
2014-11-26 23:16:23 -0800
Commit: c86e9bc
[SPARK-732][SPARK-3628][CORE][RESUBMIT] eliminate duplicate update on accmulator
CodingCat <zhunansjtu@gmail.com>
2014-11-26 16:52:04 -0800
Commit: 5af53ad, github.com/apache/spark/pull/2524
[SPARK-4614][MLLIB] Slight API changes in Matrix and Matrices
Xiangrui Meng <meng@databricks.com>
2014-11-26 08:22:50 -0800
Commit: 561d31d, github.com/apache/spark/pull/3468
Removing confusing TripletFields
Joseph E. Gonzalez <joseph.e.gonzalez@gmail.com>
2014-11-26 00:55:28 -0800
Commit: 288ce58, github.com/apache/spark/pull/3472
[SPARK-4612] Reduce task latency and increase scheduling throughput by making configuration initialization lazy
Tathagata Das <tathagata.das1565@gmail.com>
2014-11-25 23:15:58 -0800
Commit: e7f4d25, github.com/apache/spark/pull/3463
[SPARK-4516] Avoid allocating Netty PooledByteBufAllocators unnecessarily
Aaron Davidson <aaron@databricks.com>
2014-11-26 00:32:45 -0500
Commit: 346bc17, github.com/apache/spark/pull/3465
[SPARK-4516] Cap default number of Netty threads at 8
Aaron Davidson <aaron@databricks.com>
2014-11-25 23:57:04 -0500
Commit: f5f2d27, github.com/apache/spark/pull/3469
[SPARK-4604][MLLIB] make MatrixFactorizationModel public
Xiangrui Meng <meng@databricks.com>
2014-11-25 20:11:40 -0800
Commit: b5fb141, github.com/apache/spark/pull/3459
[HOTFIX]: Adding back without-hive dist
Patrick Wendell <pwendell@gmail.com>
2014-11-25 23:10:19 -0500
Commit: 4d95526
[SPARK-4583] [mllib] LogLoss for GradientBoostedTrees fix + doc updates
Joseph K. Bradley <joseph@databricks.com>
2014-11-25 20:10:15 -0800
Commit: c251fd7, github.com/apache/spark/pull/3439
[Spark-4509] Revert EC2 tag-based cluster membership patch
Xiangrui Meng <meng@databricks.com>
2014-11-25 16:07:09 -0800
Commit: 7eba0fb, github.com/apache/spark/pull/3453
Fix SPARK-4471: blockManagerIdFromJson function throws exception while B...
hushan[胡珊] <hushan@xiaomi.com>
2014-11-25 15:51:08 -0800
Commit: 9bdf5da, github.com/apache/spark/pull/3340
[SPARK-4546] Improve HistoryServer first time user experience
Andrew Or <andrew@databricks.com>
2014-11-25 15:48:02 -0800
Commit: 9afcbe4, github.com/apache/spark/pull/3411
[SPARK-4592] Avoid duplicate worker registrations in standalone mode
Andrew Or <andrew@databricks.com>
2014-11-25 15:46:26 -0800
Commit: 1b2ab1c, github.com/apache/spark/pull/3447
[SPARK-4196][SPARK-4602][Streaming] Fix serialization issue in PairDStreamFunctions.saveAsNewAPIHadoopFiles
Tathagata Das <tathagata.das1565@gmail.com>
2014-11-25 14:16:27 -0800
Commit: 8838ad7, github.com/apache/spark/pull/3457
[SPARK-4581][MLlib] Refactorize StandardScaler to improve the transformation performance
DB Tsai <dbtsai@alpinenow.com>
2014-11-25 11:07:11 -0800
Commit: bf1a6aa, github.com/apache/spark/pull/3435
[SPARK-4601][Streaming] Set correct call site for streaming jobs so that it is displayed correctly on the Spark UI
Tathagata Das <tathagata.das1565@gmail.com>
2014-11-25 06:50:36 -0800
Commit: 69cd53e, github.com/apache/spark/pull/3455
[SPARK-4344][DOCS] adding documentation on spark.yarn.user.classpath.first
arahuja <aahuja11@gmail.com>
2014-11-25 08:23:41 -0600
Commit: d240760, github.com/apache/spark/pull/3209
[SPARK-4381][Streaming]Add warning log when user set spark.master to local in Spark Streaming and there's no job executed
jerryshao <saisai.shao@intel.com>
2014-11-25 05:36:29 -0800
Commit: fef27b2, github.com/apache/spark/pull/3244
[SPARK-4535][Streaming] Fix the error in comments
q00251598 <qiyadong@huawei.com>
2014-11-25 04:01:56 -0800
Commit: a51118a, github.com/apache/spark/pull/3400
[SPARK-4526][MLLIB]GradientDescent get a wrong gradient value according to the gradient formula.
GuoQiang Li <witgo@qq.com>
2014-11-25 02:01:19 -0800
Commit: f515f94, github.com/apache/spark/pull/3399
[SPARK-4596][MLLib] Refactorize Normalizer to make code cleaner
DB Tsai <dbtsai@alpinenow.com>
2014-11-25 01:57:34 -0800
Commit: 89f9122, github.com/apache/spark/pull/3446
[DOC][Build] Wrong cmd for build spark with apache hadoop 2.4.X and hive 12
wangfei <wangfei1@huawei.com>
2014-11-24 22:32:39 -0800
Commit: 0fe54cf, github.com/apache/spark/pull/3335
[SQL] Compute timeTaken correctly
w00228970 <wangfei1@huawei.com>
2014-11-24 21:17:24 -0800
Commit: 723be60, github.com/apache/spark/pull/3423
[SPARK-4582][MLLIB] get raw vectors for further processing in Word2Vec
tkaessmann <tobias.kaessmanns24.com>, tkaessmann <tobias.kaessmann@s24.com>
2014-11-24 19:58:01 -0800
Commit: 9ce2bf3, github.com/apache/spark/pull/3309
[SPARK-4525] Mesos should decline unused offers
Patrick Wendell <pwendell@gmail.com>, Jongyoul Lee <jongyoul@gmail.com>
2014-11-24 19:14:14 -0800
Commit: f0afb62, github.com/apache/spark/pull/3436
Revert "[SPARK-4525] Mesos should decline unused offers"
Patrick Wendell <pwendell@gmail.com>
2014-11-24 19:16:53 -0800
Commit: a68d442
[SPARK-4525] Mesos should decline unused offers
Patrick Wendell <pwendell@gmail.com>, Jongyoul Lee <jongyoul@gmail.com>
2014-11-24 19:14:14 -0800
Commit: b043c27, github.com/apache/spark/pull/3436
[SPARK-4266] [Web-UI] Reduce stage page load time.
Kay Ousterhout <kayousterhout@gmail.com>
2014-11-24 18:03:10 -0800
Commit: d24d5bf, github.com/apache/spark/pull/3328
[SPARK-4548] []SPARK-4517] improve performance of python broadcast
Davies Liu <davies@databricks.com>
2014-11-24 17:17:03 -0800
Commit: 6cf5076, github.com/apache/spark/pull/3417
[SPARK-4578] fix asDict() with nested Row()
Davies Liu <davies@databricks.com>
2014-11-24 16:41:23 -0800
Commit: 050616b, github.com/apache/spark/pull/3434
[SPARK-4562] [MLlib] speedup vector
Davies Liu <davies@databricks.com>
2014-11-24 16:37:14 -0800
Commit: b660de7, github.com/apache/spark/pull/3420
[SPARK-4518][SPARK-4519][Streaming] Refactored file stream to prevent files from being processed multiple times
Tathagata Das <tathagata.das1565@gmail.com>
2014-11-24 13:50:20 -0800
Commit: cb0e9b0, github.com/apache/spark/pull/3419
[SPARK-4145] Web UI job pages
Josh Rosen <joshrosen@databricks.com>
2014-11-24 13:18:14 -0800
Commit: 4a90276, github.com/apache/spark/pull/3009
[SPARK-4487][SQL] Fix attribute reference resolution error when using ORDER BY.
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2014-11-24 12:54:37 -0800
Commit: dd1c9cb, github.com/apache/spark/pull/3363
[SQL] Fix path in HiveFromSpark
scwf <wangfei1@huawei.com>
2014-11-24 12:49:08 -0800
Commit: b384119, github.com/apache/spark/pull/3415
[SQL] Fix comment in HiveShim
Daniel Darabos <darabos.daniel@gmail.com>
2014-11-24 12:45:07 -0800
Commit: d5834f0, github.com/apache/spark/pull/3432
[SPARK-4479][SQL] Avoids unnecessary defensive copies when sort based shuffle is on
Cheng Lian <lian@databricks.com>
2014-11-24 12:43:45 -0800
Commit: a6d7b61, github.com/apache/spark/pull/3422
SPARK-4457. Document how to build for Hadoop versions greater than 2.4
Sandy Ryza <sandy@cloudera.com>
2014-11-24 13:28:48 -0600
Commit: 29372b6, github.com/apache/spark/pull/3322
[SPARK-4377] Fixed serialization issue by switching to akka provided serializer.
Prashant Sharma <prashant.s@imaginea.com>
2014-11-22 14:05:38 -0800
Commit: 9b2a3c6, github.com/apache/spark/pull/3402
[SPARK-4431][MLlib] Implement efficient foreachActive for dense and sparse vector
DB Tsai <dbtsai@alpinenow.com>
2014-11-21 18:15:07 -0800
Commit: b5d17ef, github.com/apache/spark/pull/3288
[SPARK-4531] [MLlib] cache serialized java object
Davies Liu <davies@databricks.com>
2014-11-21 15:02:31 -0800
Commit: ce95bd8, github.com/apache/spark/pull/3397
SPARK-4532: Fix bug in detection of Hive in Spark 1.2
Patrick Wendell <pwendell@gmail.com>
2014-11-21 12:10:04 -0800
Commit: a81918c, github.com/apache/spark/pull/3398
[SPARK-4397][Core] Reorganize 'implicit's to improve the API convenience
zsxwing <zsxwing@gmail.com>
2014-11-21 10:06:30 -0800
Commit: 65b987c, github.com/apache/spark/pull/3262
[SPARK-4472][Shell] Print "Spark context available as sc." only when SparkContext is created...
zsxwing <zsxwing@gmail.com>
2014-11-21 00:42:43 -0800
Commit: f1069b8, github.com/apache/spark/pull/3341
[Doc][GraphX] Remove unused png files.
Reynold Xin <rxin@databricks.com>
2014-11-21 00:30:58 -0800
Commit: 28fdc6f
[Doc][GraphX] Remove Motivation section and did some minor update.
Reynold Xin <rxin@databricks.com>
2014-11-21 00:29:02 -0800
Commit: b97070e
[SPARK-4522][SQL] Parse schema with missing metadata.
Michael Armbrust <michael@databricks.com>
2014-11-20 20:34:43 -0800
Commit: 90a6a46, github.com/apache/spark/pull/3392
add Sphinx as a dependency of building docs
Davies Liu <davies@databricks.com>
2014-11-20 19:12:45 -0800
Commit: 8cd6eea, github.com/apache/spark/pull/3388
[SPARK-4413][SQL] Parquet support through datasource API
Michael Armbrust <michael@databricks.com>
2014-11-20 18:31:02 -0800
Commit: 02ec058, github.com/apache/spark/pull/3269
[SPARK-4244] [SQL] Support Hive Generic UDFs with constant object inspector parameters
Cheng Hao <hao.cheng@intel.com>
2014-11-20 16:50:59 -0800
Commit: 84d79ee, github.com/apache/spark/pull/3109
[SPARK-4477] [PySpark] remove numpy from RDDSampler
Davies Liu <davies@databricks.com>, Xiangrui Meng <meng@databricks.com>
2014-11-20 16:40:25 -0800
Commit: d39f2e9, github.com/apache/spark/pull/3351
[SQL] fix function description mistake
Jacky Li <jacky.likun@gmail.com>
2014-11-20 15:48:36 -0800
Commit: ad5f1f3, github.com/apache/spark/pull/3344
[SPARK-2918] [SQL] Support the CTAS in EXPLAIN command
Cheng Hao <hao.cheng@intel.com>
2014-11-20 15:46:00 -0800
Commit: 6aa0fc9, github.com/apache/spark/pull/3357
[SPARK-4318][SQL] Fix empty sum distinct.
Takuya UESHIN <ueshin@happy-camper.st>
2014-11-20 15:41:24 -0800
Commit: 2c2e7a4, github.com/apache/spark/pull/3184
[SPARK-4513][SQL] Support relational operator '<=>' in Spark SQL
ravipesala <ravindra.pesala@huawei.com>
2014-11-20 15:34:03 -0800
Commit: 98e9419, github.com/apache/spark/pull/3387
[SPARK-4439] [MLlib] add python api for random forest
Davies Liu <davies@databricks.com>
2014-11-20 15:31:28 -0800
Commit: 1c53a5d, github.com/apache/spark/pull/3320
[SPARK-4228][SQL] SchemaRDD to JSON
Dan McClary <dan.mcclary@gmail.com>
2014-11-20 13:36:50 -0800
Commit: b8e6886, github.com/apache/spark/pull/3213
[SPARK-3938][SQL] Names in-memory columnar RDD with corresponding table name
Cheng Lian <lian@databricks.com>
2014-11-20 13:12:24 -0800
Commit: abf2918, github.com/apache/spark/pull/3383
[SPARK-4486][MLLIB] Improve GradientBoosting APIs and doc
Xiangrui Meng <meng@databricks.com>
2014-11-20 00:48:59 -0800
Commit: 15cacc8, github.com/apache/spark/pull/3374
[SPARK-4446] [SPARK CORE]
Leolh <leosandylh@gmail.com>
2014-11-19 18:18:55 -0800
Commit: e216ffa, github.com/apache/spark/pull/3306
[SPARK-4480] Avoid many small spills in external data structures
Andrew Or <andrew@databricks.com>
2014-11-19 18:07:27 -0800
Commit: 0eb4a7f, github.com/apache/spark/pull/3353
[Spark-4484] Treat maxResultSize as unlimited when set to 0; improve error message
Nishkam Ravi <nravi@cloudera.com>, nravi <nravi@c1704.halxg.cloudera.com>, nishkamravi2 <nishkamravi@gmail.com>
2014-11-19 17:23:42 -0800
Commit: 73fedf5, github.com/apache/spark/pull/3360
[SPARK-4478] Keep totalRegisteredExecutors up-to-date
Akshat Aranya <aaranya@quantcast.com>
2014-11-19 17:20:20 -0800
Commit: 9ccc53c, github.com/apache/spark/pull/3373
Updating GraphX programming guide and documentation
Joseph E. Gonzalez <joseph.e.gonzalez@gmail.com>
2014-11-19 16:53:33 -0800
Commit: 377b068, github.com/apache/spark/pull/3359
[SPARK-4495] Fix memory leak in JobProgressListener
Josh Rosen <joshrosen@databricks.com>
2014-11-19 16:50:21 -0800
Commit: 04d462f, github.com/apache/spark/pull/3372
[SPARK-4294][Streaming] UnionDStream stream should express the requirements in the same way as TransformedDStream
Yadong Qi <qiyadong2010@gmail.com>
2014-11-19 15:53:06 -0800
Commit: c3002c4, github.com/apache/spark/pull/3152
[SPARK-4384] [PySpark] improve sort spilling
Davies Liu <davies@databricks.com>
2014-11-19 15:45:37 -0800
Commit: 73c8ea8, github.com/apache/spark/pull/3252
[SPARK-4429][BUILD] Build for Scala 2.11 using sbt fails.
Takuya UESHIN <ueshin@happy-camper.st>
2014-11-19 14:40:21 -0800
Commit: f9adda9, github.com/apache/spark/pull/3342
[DOC][PySpark][Streaming] Fix docstring for sphinx
Ken Takagiwa <ugw.gi.world@gmail.com>
2014-11-19 14:23:18 -0800
Commit: 9b7bbce, github.com/apache/spark/pull/3311
SPARK-3962 Marked scope as provided for external projects.
Prashant Sharma <prashant.s@imaginea.com>, Prashant Sharma <scrapcodes@gmail.com>
2014-11-19 14:18:10 -0800
Commit: 1c93841, github.com/apache/spark/pull/2959
[HOT FIX] MiMa tests are broken
Andrew Or <andrew@databricks.com>
2014-11-19 14:03:44 -0800
Commit: 0df02ca, github.com/apache/spark/pull/3371
[SPARK-4481][Streaming][Doc] Fix the wrong description of updateFunc
zsxwing <zsxwing@gmail.com>
2014-11-19 13:17:15 -0800
Commit: 3bf7cee, github.com/apache/spark/pull/3356
[SPARK-4482][Streaming] Disable ReceivedBlockTracker's write ahead log by default
Tathagata Das <tathagata.das1565@gmail.com>
2014-11-19 13:06:48 -0800
Commit: 22fc4e7, github.com/apache/spark/pull/3358
[SPARK-4470] Validate number of threads in local mode
Kenichi Maehashi <webmaster@kenichimaehashi.com>
2014-11-19 12:11:09 -0800
Commit: eacc788, github.com/apache/spark/pull/3337
[SPARK-4467] fix elements read count for ExtrenalSorter
Tianshuo Deng <tdeng@twitter.com>
2014-11-19 10:01:09 -0800
Commit: d75579d, github.com/apache/spark/pull/3302
SPARK-4455 Exclude dependency on hbase-annotations module
tedyu <yuzhihong@gmail.com>
2014-11-19 00:55:39 -0800
Commit: 5f5ac2d, github.com/apache/spark/pull/3286
MAINTENANCE: Automated closing of pull requests.
Patrick Wendell <pwendell@gmail.com>
2014-11-19 00:27:31 -0800
Commit: 8327df6, github.com/apache/spark/pull/2777
[Spark-4432]close InStream after the block is accessed
Mingfei <mingfei.shi@intel.com>
2014-11-18 22:17:06 -0800
Commit: 165cec9, github.com/apache/spark/pull/3290
[SPARK-4441] Close Tachyon client when TachyonBlockManager is shutdown
Mingfei <mingfei.shi@intel.com>
2014-11-18 22:16:36 -0800
Commit: 67e9876, github.com/apache/spark/pull/3299
Bumping version to 1.3.0-SNAPSHOT.
Marcelo Vanzin <vanzin@cloudera.com>
2014-11-18 21:24:18 -0800
Commit: 397d3aa, github.com/apache/spark/pull/3277
[SPARK-4468][SQL] Fixes Parquet filter creation for inequality predicates with literals on the left hand side
Cheng Lian <lian@databricks.com>
2014-11-18 17:41:54 -0800
Commit: 423baea, github.com/apache/spark/pull/3334
[SPARK-4327] [PySpark] Python API for RDD.randomSplit()
Davies Liu <davies@databricks.com>
2014-11-18 16:37:35 -0800
Commit: 7f22fa8, github.com/apache/spark/pull/3193
[SPARK-4433] fix a racing condition in zipWithIndex
Xiangrui Meng <meng@databricks.com>
2014-11-18 16:25:44 -0800
Commit: bb46046, github.com/apache/spark/pull/3291
[SPARK-3721] [PySpark] broadcast objects larger than 2G
Davies Liu <davies@databricks.com>, Davies Liu <davies.liu@gmail.com>
2014-11-18 16:17:51 -0800
Commit: 4a377af, github.com/apache/spark/pull/2659
[SPARK-4306] [MLlib] Python API for LogisticRegressionWithLBFGS
Davies Liu <davies@databricks.com>
2014-11-18 15:57:33 -0800
Commit: d2e2951, github.com/apache/spark/pull/3307
[SPARK-4463] Add (de)select all button for add'l metrics.
Kay Ousterhout <kayousterhout@gmail.com>
2014-11-18 15:01:06 -0800
Commit: 010bc86, github.com/apache/spark/pull/3331
[SPARK-4017] show progress bar in console
Davies Liu <davies@databricks.com>
2014-11-18 13:37:21 -0800
Commit: e34f38f, github.com/apache/spark/pull/3029
[SPARK-4404] remove sys.exit() in shutdown hook
Davies Liu <davies@databricks.com>
2014-11-18 13:11:38 -0800
Commit: 80f3177, github.com/apache/spark/pull/3289
[SPARK-4075][SPARK-4434] Fix the URI validation logic for Application Jar name.
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2014-11-18 12:17:33 -0800
Commit: bfebfd8, github.com/apache/spark/pull/3326
[SQL] Support partitioned parquet tables that have the key in both the directory and the file
Michael Armbrust <michael@databricks.com>
2014-11-18 12:13:23 -0800
Commit: 90d72ec, github.com/apache/spark/pull/3272
[SPARK-4396] allow lookup by index in Python's Rating
Xiangrui Meng <meng@databricks.com>
2014-11-18 10:35:29 -0800
Commit: b54c6ab, github.com/apache/spark/pull/3261
[SPARK-4435] [MLlib] [PySpark] improve classification
Davies Liu <davies@databricks.com>
2014-11-18 10:11:13 -0800
Commit: 8fbf72b, github.com/apache/spark/pull/3305
ALS implicit: added missing parameter alpha in doc string
Felix Maximilian Möller <felixmaximilian.moeller@immobilienscout24.de>
2014-11-18 10:08:24 -0800
Commit: cedc3b5, github.com/apache/spark/pull/3343
SPARK-4466: Provide support for publishing Scala 2.11 artifacts to Maven
Patrick Wendell <pwendell@gmail.com>
2014-11-17 21:07:50 -0800
Commit: c6e0c2a, github.com/apache/spark/pull/3332
[SPARK-4453][SPARK-4213][SQL] Simplifies Parquet filter generation code
Cheng Lian <lian@databricks.com>
2014-11-17 16:55:12 -0800
Commit: 36b0956, github.com/apache/spark/pull/3317
[SPARK-4448] [SQL] unwrap for the ConstantObjectInspector
Cheng Hao <hao.cheng@intel.com>
2014-11-17 16:35:49 -0800
Commit: ef7c464, github.com/apache/spark/pull/3308
[SPARK-4443][SQL] Fix statistics for external table in spark sql hive
w00228970 <wangfei1@huawei.com>
2014-11-17 16:33:50 -0800
Commit: 42389b1, github.com/apache/spark/pull/3304
[SPARK-4309][SPARK-4407][SQL] Date type support for Thrift server, and fixes for complex types
Cheng Lian <lian@databricks.com>
2014-11-17 16:31:05 -0800
Commit: 6b7f2f7, github.com/apache/spark/pull/3298
[SQL] Construct the MutableRow from an Array
Cheng Hao <hao.cheng@intel.com>
2014-11-17 16:29:52 -0800
Commit: 69e858c, github.com/apache/spark/pull/3217
[SPARK-4425][SQL] Handle NaN or Infinity cast to Timestamp correctly.
Takuya UESHIN <ueshin@happy-camper.st>
2014-11-17 16:28:07 -0800
Commit: 566c791, github.com/apache/spark/pull/3283
[SPARK-4420][SQL] Change nullability of Cast from DoubleType/FloatType to DecimalType.
Takuya UESHIN <ueshin@happy-camper.st>
2014-11-17 16:26:48 -0800
Commit: 3a81a1c, github.com/apache/spark/pull/3278
[SQL] Makes conjunction pushdown more aggressive for in-memory table
Cheng Lian <lian@databricks.com>
2014-11-17 15:33:13 -0800
Commit: 5ce7dae, github.com/apache/spark/pull/3318
[SPARK-4180] [Core] Prevent creation of multiple active SparkContexts
Josh Rosen <joshrosen@databricks.com>
2014-11-17 12:48:18 -0800
Commit: 0f3ceb5, github.com/apache/spark/pull/3121
[DOCS][SQL] Fix broken link to Row class scaladoc
Andy Konwinski <andykonwinski@gmail.com>
2014-11-17 11:52:23 -0800
Commit: cec1116, github.com/apache/spark/pull/3323
Revert "[SPARK-4075] [Deploy] Jar url validation is not enough for Jar file"
Andrew Or <andrew@databricks.com>
2014-11-17 11:24:28 -0800
Commit: dbb9da5
[SPARK-4444] Drop VD type parameter from EdgeRDD
Ankur Dave <ankurdave@gmail.com>
2014-11-17 11:06:31 -0800
Commit: 9ac2bb1, github.com/apache/spark/pull/3303
SPARK-2811 upgrade algebird to 0.8.1
Adam Pingel <adam@axle-lang.org>
2014-11-17 10:47:29 -0800
Commit: e7690ed, github.com/apache/spark/pull/3282
SPARK-4445, Don't display storage level in toDebugString unless RDD is persisted.
Prashant Sharma <prashant.s@imaginea.com>
2014-11-17 10:40:33 -0800
Commit: 5c92d47, github.com/apache/spark/pull/3310
[SPARK-4410][SQL] Add support for external sort
Michael Armbrust <michael@databricks.com>
2014-11-16 21:55:57 -0800
Commit: 64c6b9b, github.com/apache/spark/pull/3268
[SPARK-4422][MLLIB]In some cases, Vectors.fromBreeze get wrong results.
GuoQiang Li <witgo@qq.com>
2014-11-16 21:31:51 -0800
Commit: 5168c6c, github.com/apache/spark/pull/3281
Revert "[SPARK-4309][SPARK-4407][SQL] Date type support for Thrift server, and fixes for complex types"
Michael Armbrust <michael@databricks.com>
2014-11-16 15:05:04 -0800
Commit: 45ce327, github.com/apache/spark/pull/3292
[SPARK-4309][SPARK-4407][SQL] Date type support for Thrift server, and fixes for complex types
Cheng Lian <lian@databricks.com>
2014-11-16 14:26:41 -0800
Commit: cb6bd83, github.com/apache/spark/pull/3178
[SPARK-4393] Fix memory leak in ConnectionManager ACK timeout TimerTasks; use HashedWheelTimer
Josh Rosen <joshrosen@databricks.com>
2014-11-16 00:44:15 -0800
Commit: 7850e0c, github.com/apache/spark/pull/3259
[SPARK-4426][SQL][Minor] The symbol of BitwiseOr is wrong, should not be '&'
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2014-11-15 22:23:47 -0800
Commit: 84468b2, github.com/apache/spark/pull/3284
[SPARK-4419] Upgrade snappy-java to 1.1.1.6
Josh Rosen <joshrosen@databricks.com>
2014-11-15 22:22:34 -0800
Commit: 7d8e152, github.com/apache/spark/pull/3287
[SPARK-2321] Several progress API improvements / refactorings
Josh Rosen <joshrosen@databricks.com>
2014-11-14 23:46:25 -0800
Commit: 40eb8b6, github.com/apache/spark/pull/3197
Added contains(key) to Metadata
kai <kaizeng@eecs.berkeley.edu>
2014-11-14 23:44:23 -0800
Commit: cbddac2, github.com/apache/spark/pull/3273
[SPARK-4260] Httpbroadcast should set connection timeout.
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2014-11-14 22:36:56 -0800
Commit: 60969b0, github.com/apache/spark/pull/3122
[SPARK-4363][Doc] Update the Broadcast example
zsxwing <zsxwing@gmail.com>
2014-11-14 22:28:48 -0800
Commit: 861223e, github.com/apache/spark/pull/3226
[SPARK-4379][Core] Change Exception to SparkException in checkpoint
zsxwing <zsxwing@gmail.com>
2014-11-14 22:25:41 -0800
Commit: dba1405, github.com/apache/spark/pull/3241
[SPARK-4415] [PySpark] JVM should exit after Python exit
Davies Liu <davies@databricks.com>
2014-11-14 20:13:46 -0800
Commit: 7fe08b4, github.com/apache/spark/pull/3274
[SPARK-4404]SparkSubmitDriverBootstrapper should stop after its SparkSubmit sub-proc...
WangTao <barneystinson@aliyun.com>, WangTaoTheTonic <barneystinson@aliyun.com>
2014-11-14 20:11:51 -0800
Commit: 303a4e4, github.com/apache/spark/pull/3266
SPARK-4214. With dynamic allocation, avoid outstanding requests for more...
Sandy Ryza <sandy@cloudera.com>
2014-11-14 15:51:05 -0800
Commit: ad42b28, github.com/apache/spark/pull/3204
[SPARK-4412][SQL] Fix Spark's control of Parquet logging.
Jim Carroll <jim@dontcallme.com>
2014-11-14 15:33:21 -0800
Commit: 37482ce, github.com/apache/spark/pull/3271
[SPARK-4365][SQL] Remove unnecessary filter call on records returned from parquet library
Yash Datta <Yash.Datta@guavus.com>
2014-11-14 15:16:36 -0800
Commit: 63ca3af, github.com/apache/spark/pull/3229
[SPARK-4386] Improve performance when writing Parquet files.
Jim Carroll <jim@dontcallme.com>
2014-11-14 15:11:53 -0800
Commit: f76b968, github.com/apache/spark/pull/3254
[SPARK-4322][SQL] Enables struct fields as sub expressions of grouping fields
Cheng Lian <lian@databricks.com>
2014-11-14 15:09:36 -0800
Commit: 0c7b66b, github.com/apache/spark/pull/3248
[SQL] Don't shuffle code generated rows
Michael Armbrust <michael@databricks.com>
2014-11-14 15:03:23 -0800
Commit: 4b4b50c, github.com/apache/spark/pull/3263
[SQL] Minor cleanup of comments, errors and override.
Michael Armbrust <michael@databricks.com>
2014-11-14 15:00:42 -0800
Commit: f805025, github.com/apache/spark/pull/3257
[SPARK-4391][SQL] Configure parquet filters using SQLConf
Michael Armbrust <michael@databricks.com>
2014-11-14 14:59:35 -0800
Commit: e47c387, github.com/apache/spark/pull/3258
[SPARK-4390][SQL] Handle NaN cast to decimal correctly
Michael Armbrust <michael@databricks.com>
2014-11-14 14:56:57 -0800
Commit: a0300ea, github.com/apache/spark/pull/3256
[SPARK-4062][Streaming]Add ReliableKafkaReceiver in Spark Streaming Kafka connector
jerryshao <saisai.shao@intel.com>, Tathagata Das <tathagata.das1565@gmail.com>, Saisai Shao <saisai.shao@intel.com>
2014-11-14 14:33:37 -0800
Commit: 5930f64, github.com/apache/spark/pull/2991
[SPARK-4333][SQL] Correctly log number of iterations in RuleExecutor
DoingDone9 <799203320@qq.com>
2014-11-14 14:28:06 -0800
Commit: 0cbdb01, github.com/apache/spark/pull/3180
SPARK-4375. no longer require -Pscala-2.10
Sandy Ryza <sandy@cloudera.com>
2014-11-14 14:21:57 -0800
Commit: f5f757e, github.com/apache/spark/pull/3239
[SPARK-4245][SQL] Fix containsNull of the result ArrayType of CreateArray expression.
Takuya UESHIN <ueshin@happy-camper.st>
2014-11-14 14:21:16 -0800
Commit: bbd8f5b, github.com/apache/spark/pull/3110
[SPARK-4239] [SQL] support view in HiveQl
Daoyuan Wang <daoyuan.wang@intel.com>
2014-11-14 13:51:20 -0800
Commit: ade72c4, github.com/apache/spark/pull/3131
Update failed assert text to match code in SizeEstimatorSuite
Jeff Hammerbacher <jeff.hammerbacher@gmail.com>
2014-11-14 13:37:48 -0800
Commit: c258db9, github.com/apache/spark/pull/3242
[SPARK-4313][WebUI][Yarn] Fix link issue of the executor thread dump page in yarn-cluster mode
zsxwing <zsxwing@gmail.com>
2014-11-14 13:36:13 -0800
Commit: 156cf33, github.com/apache/spark/pull/3183
SPARK-3663 Document SPARK_LOG_DIR and SPARK_PID_DIR
Andrew Ash <andrew@andrewash.com>
2014-11-14 13:33:35 -0800
Commit: 5c265cc, github.com/apache/spark/pull/2518
[Spark Core] SPARK-4380 Edit spilling log from MB to B
Hong Shen <hongshen@tencent.com>
2014-11-14 13:29:41 -0800
Commit: 0c56a03, github.com/apache/spark/pull/3243
[SPARK-4398][PySpark] specialize sc.parallelize(xrange)
Xiangrui Meng <meng@databricks.com>
2014-11-14 12:43:17 -0800
Commit: abd5817, github.com/apache/spark/pull/3264
[SPARK-4394][SQL] Data Sources API Improvements
Michael Armbrust <michael@databricks.com>
2014-11-14 12:00:08 -0800
Commit: 77e845c, github.com/apache/spark/pull/3260
[SPARK-3722][Docs]minor improvement and fix in docs
WangTao <barneystinson@aliyun.com>
2014-11-14 08:09:42 -0600
Commit: e421072, github.com/apache/spark/pull/2579
[SPARK-4310][WebUI] Sort 'Submitted' column in Stage page by time
zsxwing <zsxwing@gmail.com>
2014-11-13 14:37:04 -0800
Commit: 825709a, github.com/apache/spark/pull/3179
[SPARK-4372][MLLIB] Make LR and SVM's default parameters consistent in Scala and Python
Xiangrui Meng <meng@databricks.com>
2014-11-13 13:54:16 -0800
Commit: 3221830, github.com/apache/spark/pull/3232
[SPARK-4326] fix unidoc
Xiangrui Meng <meng@databricks.com>
2014-11-13 13:16:20 -0800
Commit: 4b0c1ed, github.com/apache/spark/pull/3253
[HOT FIX] make-distribution.sh fails if Yarn shuffle jar DNE
Andrew Or <andrew@databricks.com>
2014-11-13 11:54:45 -0800
Commit: a0fa1ba, github.com/apache/spark/pull/3250
[SPARK-4378][MLLIB] make ALS more Java-friendly
Xiangrui Meng <meng@databricks.com>
2014-11-13 11:42:27 -0800
Commit: ca26a21, github.com/apache/spark/pull/3240
[SPARK-4348] [PySpark] [MLlib] rename random.py to rand.py
Davies Liu <davies@databricks.com>
2014-11-13 10:24:54 -0800
Commit: ce0333f, github.com/apache/spark/pull/3216
[SPARK-4256] Make Binary Evaluation Metrics functions defined in cases where there ar...
Andrew Bullen <andrew.bullen@workday.com>
2014-11-12 22:14:44 -0800
Commit: 484fecb, github.com/apache/spark/pull/3118
[SPARK-4370] [Core] Limit number of Netty cores based on executor size
Aaron Davidson <aaron@databricks.com>
2014-11-12 18:46:37 -0800
Commit: b9e1c2e, github.com/apache/spark/pull/3155
[SPARK-4373][MLLIB] fix MLlib maven tests
Xiangrui Meng <meng@databricks.com>
2014-11-12 18:15:14 -0800
Commit: 23f5bdf, github.com/apache/spark/pull/3235
[Release] Bring audit scripts up-to-date
Andrew Or <andrew@databricks.com>
2014-11-13 00:30:58 +0000
Commit: 723a86b
[SPARK-2672] support compressed file in wholeTextFile
Davies Liu <davies@databricks.com>
2014-11-12 15:58:12 -0800
Commit: d7d54a4, github.com/apache/spark/pull/3005
[SPARK-4369] [MLLib] fix TreeModel.predict() with RDD
Davies Liu <davies@databricks.com>
2014-11-12 13:56:41 -0800
Commit: bd86118, github.com/apache/spark/pull/3230
[SPARK-3666] Extract interfaces for EdgeRDD and VertexRDD
Ankur Dave <ankurdave@gmail.com>
2014-11-12 13:49:20 -0800
Commit: a5ef581, github.com/apache/spark/pull/2530
[Release] Correct make-distribution.sh log path
Andrew Or <andrew@databricks.com>
2014-11-12 13:46:26 -0800
Commit: c3afd32
Internal cleanup for aggregateMessages
Ankur Dave <ankurdave@gmail.com>
2014-11-12 13:44:49 -0800
Commit: 0402be9, github.com/apache/spark/pull/3231
[SPARK-4281][Build] Package Yarn shuffle service into its own jar
Andrew Or <andrew@databricks.com>
2014-11-12 13:39:45 -0800
Commit: aa43a8d, github.com/apache/spark/pull/3147
[Test] Better exception message from SparkSubmitSuite
Andrew Or <andrew@databricks.com>
2014-11-12 13:35:48 -0800
Commit: 6e3c5a2, github.com/apache/spark/pull/3212
[SPARK-3660][STREAMING] Initial RDD for updateStateByKey transformation
Soumitra Kumar <kumar.soumitra@gmail.com>
2014-11-12 12:25:31 -0800
Commit: 36ddeb7, github.com/apache/spark/pull/2665
[SPARK-3530][MLLIB] pipeline and parameters with examples
Xiangrui Meng <meng@databricks.com>
2014-11-12 10:38:57 -0800
Commit: 4b736db, github.com/apache/spark/pull/3099
[SPARK-4355][MLLIB] fix OnlineSummarizer.merge when other.mean is zero
Xiangrui Meng <meng@databricks.com>
2014-11-12 01:50:11 -0800
Commit: 84324fb, github.com/apache/spark/pull/3220
[SPARK-3936] Add aggregateMessages, which supersedes mapReduceTriplets
Ankur Dave <ankurdave@gmail.com>
2014-11-11 23:38:27 -0800
Commit: faeb41d, github.com/apache/spark/pull/3100
[MLLIB] SPARK-4347: Reducing GradientBoostingSuite run time.
Manish Amde <manish9ue@gmail.com>
2014-11-11 22:47:53 -0800
Commit: 2ef016b, github.com/apache/spark/pull/3214
Support cross building for Scala 2.11
Prashant Sharma <prashant.s@imaginea.com>, Patrick Wendell <pwendell@gmail.com>
2014-11-11 21:36:48 -0800
Commit: daaca14, github.com/apache/spark/pull/3159
[Release] Log build output for each distribution
Andrew Or <andrew@databricks.com>
2014-11-11 18:02:59 -0800
Commit: 2ddb141
SPARK-2269 Refactor mesos scheduler resourceOffers and add unit test
Timothy Chen <tnachen@gmail.com>
2014-11-11 14:29:18 -0800
Commit: a878660, github.com/apache/spark/pull/1487
[SPARK-4282][YARN] Stopping flag in YarnClientSchedulerBackend should be volatile
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2014-11-11 12:33:53 -0600
Commit: 7f37188, github.com/apache/spark/pull/3143
SPARK-4305 [BUILD] yarn-alpha profile won't build due to network/yarn module
Sean Owen <sowen@cloudera.com>
2014-11-11 12:30:35 -0600
Commit: f820b56, github.com/apache/spark/pull/3167
SPARK-1830 Deploy failover, Make Persistence engine and LeaderAgent Pluggable
Prashant Sharma <prashant.s@imaginea.com>
2014-11-11 09:29:48 -0800
Commit: deefd9d, github.com/apache/spark/pull/771
[Streaming][Minor]Replace some 'if-else' in Clock
huangzhaowei <carlmartinmax@gmail.com>
2014-11-11 03:02:12 -0800
Commit: 6e03de3, github.com/apache/spark/pull/3088
[SPARK-2492][Streaming] kafkaReceiver minor changes to align with Kafka 0.8
jerryshao <saisai.shao@intel.com>
2014-11-11 02:22:23 -0800
Commit: c8850a3, github.com/apache/spark/pull/1420
[SPARK-4295][External]Fix exception in SparkSinkSuite
maji2014 <maji3@asiainfo.com>
2014-11-11 02:18:27 -0800
Commit: f8811a5, github.com/apache/spark/pull/3177
[SPARK-4307] Initialize FileDescriptor lazily in FileRegion.
Reynold Xin <rxin@databricks.com>, Reynold Xin <rxin@apache.org>
2014-11-11 00:25:31 -0800
Commit: ef29a9a, github.com/apache/spark/pull/3172
[SPARK-4324] [PySpark] [MLlib] support numpy.array for all MLlib API
Davies Liu <davies@databricks.com>
2014-11-10 22:26:16 -0800
Commit: 65083e9, github.com/apache/spark/pull/3189
[SPARK-4330][Doc] Link to proper URL for YARN overview
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2014-11-10 22:18:00 -0800
Commit: 3c07b8f, github.com/apache/spark/pull/3196
[SPARK-3649] Remove GraphX custom serializers
Ankur Dave <ankurdave@gmail.com>
2014-11-10 19:31:52 -0800
Commit: 300887b, github.com/apache/spark/pull/2503
[SPARK-4274] [SQL] Fix NPE in printing the details of the query plan
Cheng Hao <hao.cheng@intel.com>
2014-11-10 17:46:05 -0800
Commit: c764d0a, github.com/apache/spark/pull/3139
[SPARK-3954][Streaming] Optimization to FileInputDStream
surq <surq@asiainfo.com>
2014-11-10 17:37:16 -0800
Commit: ce6ed2a, github.com/apache/spark/pull/2811
[SPARK-4149][SQL] ISO 8601 support for json date time strings
Daoyuan Wang <daoyuan.wang@intel.com>
2014-11-10 17:26:03 -0800
Commit: a1fc059, github.com/apache/spark/pull/3012
[SPARK-4250] [SQL] Fix bug of constant null value mapping to ConstantObjectInspector
Cheng Hao <hao.cheng@intel.com>
2014-11-10 17:22:57 -0800
Commit: fa77783, github.com/apache/spark/pull/3114
[SQL] remove a decimal case branch that has no effect at runtime
Xiangrui Meng <meng@databricks.com>
2014-11-10 17:20:52 -0800
Commit: d793d80, github.com/apache/spark/pull/3192
[SPARK-4308][SQL] Sets SQL operation state to ERROR when exception is thrown
Cheng Lian <lian@databricks.com>
2014-11-10 16:56:36 -0800
Commit: acb55ae, github.com/apache/spark/pull/3175
[SPARK-4000][Build] Uploads HiveCompatibilitySuite logs
Cheng Lian <lian@databricks.com>
2014-11-10 16:17:52 -0800
Commit: 534b231, github.com/apache/spark/pull/2993
[SPARK-4319][SQL] Enable an ignored test "null count".
Takuya UESHIN <ueshin@happy-camper.st>
2014-11-10 15:55:15 -0800
Commit: dbf1058, github.com/apache/spark/pull/3185
Revert "[SPARK-2703][Core]Make Tachyon related unit tests execute without deploying a Tachyon system locally."
Patrick Wendell <pwendell@gmail.com>
2014-11-10 14:56:06 -0800
Commit: 6e7a309
[SPARK-4047] - Generate runtime warnings for example implementation of PageRank
Varadharajan Mukundan <srinathsmn@gmail.com>
2014-11-10 14:32:29 -0800
Commit: 974d334, github.com/apache/spark/pull/2894
SPARK-1297 Upgrade HBase dependency to 0.98
tedyu <yuzhihong@gmail.com>
2014-11-10 13:23:33 -0800
Commit: b32734e, github.com/apache/spark/pull/3115
SPARK-4230. Doc for spark.default.parallelism is incorrect
Sandy Ryza <sandy@cloudera.com>
2014-11-10 12:40:41 -0800
Commit: c6f4e70, github.com/apache/spark/pull/3107
[SPARK-4312] bash doesn't have "die"
Jey Kottalam <jey@kottalam.net>
2014-11-10 12:37:56 -0800
Commit: c5db8e2, github.com/apache/spark/pull/2898
Update RecoverableNetworkWordCount.scala
comcmipi <pitonak@fns.uniba.sk>
2014-11-10 12:33:48 -0800
Commit: 0340c56, github.com/apache/spark/pull/2735
SPARK-2548 [STREAMING] JavaRecoverableWordCount is missing
Sean Owen <sowen@cloudera.com>
2014-11-10 11:47:27 -0800
Commit: 3a02d41, github.com/apache/spark/pull/2564
[SPARK-4169] [Core] Accommodate non-English Locales in unit tests
Niklas Wilcke <1wilcke@informatik.uni-hamburg.de>
2014-11-10 11:37:38 -0800
Commit: ed8bf1e, github.com/apache/spark/pull/3036
[SQL] support udt to hive types conversion (hive->udt is not supported)
Xiangrui Meng <meng@databricks.com>
2014-11-10 11:04:12 -0800
Commit: 894a724, github.com/apache/spark/pull/3164
[SPARK-2703][Core]Make Tachyon related unit tests execute without deploying a Tachyon system locally.
RongGu <gurongwalker@gmail.com>
2014-11-09 23:48:15 -0800
Commit: bd86cb1, github.com/apache/spark/pull/3030
MAINTENANCE: Automated closing of pull requests.
Patrick Wendell <pwendell@gmail.com>
2014-11-09 23:07:14 -0800
Commit: 227488d, github.com/apache/spark/pull/2898
SPARK-3179. Add task OutputMetrics.
Sandy Ryza <sandy@cloudera.com>
2014-11-09 22:29:03 -0800
Commit: 3c2cff4, github.com/apache/spark/pull/2968
SPARK-1209 [CORE] (Take 2) SparkHadoop{MapRed,MapReduce}Util should not use package org.apache.hadoop
Sean Owen <sowen@cloudera.com>
2014-11-09 22:11:20 -0800
Commit: f8e5732, github.com/apache/spark/pull/3048
MAINTENANCE: Automated closing of pull requests.
Patrick Wendell <pwendell@gmail.com>
2014-11-09 18:16:20 -0800
Commit: f73b56f, github.com/apache/spark/pull/464
SPARK-1344 [DOCS] Scala API docs for top methods
Sean Owen <sowen@cloudera.com>
2014-11-09 17:42:08 -0800
Commit: d136265, github.com/apache/spark/pull/3168
SPARK-971 [DOCS] Link to Confluence wiki from project website / documentation
Sean Owen <sowen@cloudera.com>
2014-11-09 17:40:48 -0800
Commit: 8c99a47, github.com/apache/spark/pull/3169
[SPARK-4301] StreamingContext should not allow start() to be called after calling stop()
Josh Rosen <joshrosen@databricks.com>
2014-11-08 18:10:23 -0800
Commit: 7b41b17, github.com/apache/spark/pull/3160
[Minor] [Core] Don't NPE on closeQuietly(null)
Aaron Davidson <aaron@databricks.com>
2014-11-08 13:03:51 -0800
Commit: 4af5c7e, github.com/apache/spark/pull/3166
[SPARK-4291][Build] Rename network module projects
Andrew Or <andrew@databricks.com>
2014-11-07 23:16:13 -0800
Commit: 7afc856, github.com/apache/spark/pull/3148
[MLLIB] [PYTHON] SPARK-4221: Expose nonnegative ALS in the python API
Michelangelo D'Agostino <mdagostino@civisanalytics.com>
2014-11-07 22:53:01 -0800
Commit: 7e9d975, github.com/apache/spark/pull/3095
[SPARK-4304] [PySpark] Fix sort on empty RDD
Davies Liu <davies@databricks.com>
2014-11-07 20:53:03 -0800
Commit: 7779109, github.com/apache/spark/pull/3162
MAINTENANCE: Automated closing of pull requests.
Patrick Wendell <pwendell@gmail.com>
2014-11-07 13:08:25 -0800
Commit: 5923dd9, github.com/apache/spark/pull/3016
Update JavaCustomReceiver.java
xiao321 <1042460381@qq.com>
2014-11-07 12:56:49 -0800
Commit: 7c9ec52, github.com/apache/spark/pull/3153
[SPARK-4292][SQL] Result set iterator bug in JDBC/ODBC
wangfei <wangfei1@huawei.com>
2014-11-07 12:55:11 -0800
Commit: d6e5552, github.com/apache/spark/pull/3149
[SPARK-4203][SQL] Partition directories in random order when inserting into hive table
Matthew Taylor <matthew.t@tbfe.net>
2014-11-07 12:53:08 -0800
Commit: ac70c97, github.com/apache/spark/pull/3076
[SPARK-4270][SQL] Fix Cast from DateType to DecimalType.
Takuya UESHIN <ueshin@happy-camper.st>
2014-11-07 12:30:47 -0800
Commit: a6405c5, github.com/apache/spark/pull/3134
[SPARK-4272] [SQL] Add more unwrapper functions for primitive type in TableReader
Cheng Hao <hao.cheng@intel.com>
2014-11-07 12:15:53 -0800
Commit: 60ab80f, github.com/apache/spark/pull/3136
[SPARK-4213][SQL] ParquetFilters - No support for LT, LTE, GT, GTE operators
Kousuke Saruta <sarutak@oss.nttdata.co.jp>
2014-11-07 11:56:40 -0800
Commit: 14c54f1, github.com/apache/spark/pull/3083
[SQL] Modify keyword val location according to ordering
Jacky Li <jacky.likun@gmail.com>
2014-11-07 11:52:08 -0800
Commit: 68609c5, github.com/apache/spark/pull/3080
[SQL] Support ScalaReflection of schema in different universes
Michael Armbrust <michael@databricks.com>
2014-11-07 11:51:20 -0800
Commit: 8154ed7, github.com/apache/spark/pull/3096
[SPARK-4225][SQL] Resorts to SparkContext.version to inspect Spark version
Cheng Lian <lian@databricks.com>
2014-11-07 11:45:25 -0800
Commit: 86e9eaa, github.com/apache/spark/pull/3105
[SQL][DOC][Minor] Spark SQL Hive now support dynamic partitioning
wangfei <wangfei1@huawei.com>
2014-11-07 11:43:35 -0800
Commit: 636d7bc, github.com/apache/spark/pull/3127
[SPARK-4187] [Core] Switch to binary protocol for external shuffle service messages
Aaron Davidson <aaron@databricks.com>
2014-11-07 09:42:21 -0800
Commit: d4fa04e, github.com/apache/spark/pull/3146
[SPARK-4204][Core][WebUI] Change Utils.exceptionString to contain the inner exceptions and make the error information in Web UI more friendly
zsxwing <zsxwing@gmail.com>
2014-11-06 21:52:12 -0800
Commit: 3abdb1b, github.com/apache/spark/pull/3073
[SPARK-4236] Cleanup removed applications' files in shuffle service
Aaron Davidson <aaron@databricks.com>
2014-11-06 19:54:32 -0800
Commit: 48a19a6, github.com/apache/spark/pull/3126
[SPARK-4188] [Core] Perform network-level retry of shuffle file fetches
Aaron Davidson <aaron@databricks.com>
2014-11-06 18:39:14 -0800
Commit: f165b2b, github.com/apache/spark/pull/3101
[SPARK-4277] Support external shuffle service on Standalone Worker
Aaron Davidson <aaron@databricks.com>
2014-11-06 17:20:46 -0800
Commit: 6e9ef10, github.com/apache/spark/pull/3142
[SPARK-3797] Minor addendum to Yarn shuffle service
Andrew Or <andrew@databricks.com>
2014-11-06 17:18:49 -0800
Commit: 96136f2, github.com/apache/spark/pull/3144
[HOT FIX] Make distribution fails
Andrew Or <andrew@databricks.com>
2014-11-06 15:31:07 -0800
Commit: 470881b, github.com/apache/spark/pull/3145
[SPARK-4249][GraphX]fix a problem of EdgePartitionBuilder in Graphx
lianhuiwang <lianhuiwang09@gmail.com>
2014-11-06 10:46:45 -0800
Commit: d15c6e9, github.com/apache/spark/pull/3138
[SPARK-4264] Completion iterator should only invoke callback once
Aaron Davidson <aaron@databricks.com>
2014-11-06 10:45:46 -0800
Commit: 23eaf0e, github.com/apache/spark/pull/3128
[SPARK-4186] add binaryFiles and binaryRecords in Python
Davies Liu <davies@databricks.com>
2014-11-06 00:22:19 -0800
Commit: b41a39e, github.com/apache/spark/pull/3078
[SPARK-4255] Fix incorrect table striping
Kay Ousterhout <kayousterhout@gmail.com>
2014-11-06 00:03:03 -0800
Commit: 5f27ae1, github.com/apache/spark/pull/3117
[SPARK-4137] [EC2] Don't change working dir on user
Nicholas Chammas <nicholas.chammas@gmail.com>
2014-11-05 20:45:35 -0800
Commit: db45f5a, github.com/apache/spark/pull/2988
[SPARK-4262][SQL] add .schemaRDD to JavaSchemaRDD
Xiangrui Meng <meng@databricks.com>
2014-11-05 19:56:16 -0800
Commit: 3d2b5bc, github.com/apache/spark/pull/3125
[SPARK-4254] [mllib] MovieLensALS bug fix
Joseph K. Bradley <joseph@databricks.com>
2014-11-05 19:51:18 -0800
Commit: c315d13, github.com/apache/spark/pull/3116
[SPARK-4158] Fix for missing resources.
Brenden Matthews <brenden@diddyinc.com>
2014-11-05 16:02:44 -0800
Commit: cb0eae3, github.com/apache/spark/pull/3024
SPARK-3223 runAsSparkUser cannot change HDFS write permission properly i...
Jongyoul Lee <jongyoul@gmail.com>
2014-11-05 15:49:42 -0800
Commit: f7ac8c2, github.com/apache/spark/pull/3034
SPARK-4040. Update documentation to exemplify use of local (n) value, fo...
jay@apache.org <jayunit100>
2014-11-05 15:45:34 -0800
Commit: 868cd4c, github.com/apache/spark/pull/2964
[SPARK-3797] Run external shuffle service in Yarn NM
Andrew Or <andrew@databricks.com>
2014-11-05 15:42:05 -0800
Commit: 61a5cce, github.com/apache/spark/pull/3082
SPARK-4222 [CORE] use readFully in FixedLengthBinaryRecordReader
industrial-sloth <industrial-sloth@users.noreply.github.com>
2014-11-05 15:38:48 -0800
Commit: f37817b, github.com/apache/spark/pull/3093
[SPARK-3984] [SPARK-3983] Fix incorrect scheduler delay and display task deserialization time in UI
Kay Ousterhout <kayousterhout@gmail.com>
2014-11-05 15:30:31 -0800
Commit: a46497e, github.com/apache/spark/pull/2832
[SPARK-4242] [Core] Add SASL to external shuffle service
Aaron Davidson <aaron@databricks.com>
2014-11-05 14:38:43 -0800
Commit: 4c42986, github.com/apache/spark/pull/3108
[SPARK-4197] [mllib] GradientBoosting API cleanup and examples in Scala, Java
Joseph K. Bradley <joseph@databricks.com>
2014-11-05 10:33:13 -0800
Commit: 5b3b6f6, github.com/apache/spark/pull/3094
[SPARK-4029][Streaming] Update streaming driver to reliably save and recover received block metadata on driver failures
Tathagata Das <tathagata.das1565@gmail.com>
2014-11-05 01:21:53 -0800
Commit: 5f13759, github.com/apache/spark/pull/3026
[SPARK-3964] [MLlib] [PySpark] add Hypothesis test Python API
Davies Liu <davies@databricks.com>
2014-11-04 21:35:52 -0800
Commit: c8abddc, github.com/apache/spark/pull/3091
[SQL] Add String option for DSL AS
Michael Armbrust <michael@databricks.com>
2014-11-04 18:14:28 -0800
Commit: 515abb9, github.com/apache/spark/pull/3097
[SPARK-2938] Support SASL authentication in NettyBlockTransferService
Aaron Davidson <aaron@databricks.com>
2014-11-04 16:15:38 -0800
Commit: 5e73138, github.com/apache/spark/pull/3087
[Spark-4060] [MLlib] exposing special rdd functions to the public
Niklas Wilcke <1wilcke@informatik.uni-hamburg.de>
2014-11-04 09:57:03 -0800
Commit: f90ad5d, github.com/apache/spark/pull/2907
fixed MLlib Naive-Bayes java example bug
Dariusz Kobylarz <darek.kobylarz@gmail.com>
2014-11-04 09:53:43 -0800
Commit: bcecd73, github.com/apache/spark/pull/3081
[SPARK-3886] [PySpark] simplify serializer, use AutoBatchedSerializer by default.
Davies Liu <davies@databricks.com>
2014-11-03 23:56:14 -0800
Commit: e4f4263, github.com/apache/spark/pull/2920
[SPARK-4166][Core] Add a backward compatibility test for ExecutorLostFailure
zsxwing <zsxwing@gmail.com>
2014-11-03 22:47:45 -0800
Commit: b671ce0, github.com/apache/spark/pull/3085
[SPARK-4163][Core] Add a backward compatibility test for FetchFailed
zsxwing <zsxwing@gmail.com>
2014-11-03 22:40:43 -0800
Commit: 9bdc841, github.com/apache/spark/pull/3086
[SPARK-3573][MLLIB] Make MLlib's Vector compatible with SQL's SchemaRDD
Xiangrui Meng <meng@databricks.com>
2014-11-03 22:29:48 -0800
Commit: 1a9c6cd, github.com/apache/spark/pull/3070
[SPARK-4192][SQL] Internal API for Python UDT
Xiangrui Meng <meng@databricks.com>
2014-11-03 19:29:11 -0800
Commit: 04450d1, github.com/apache/spark/pull/3068
[FIX][MLLIB] fix seed in BaggedPointSuite
Xiangrui Meng <meng@databricks.com>
2014-11-03 18:50:37 -0800
Commit: c5912ec, github.com/apache/spark/pull/3084
[SPARK-611] Display executor thread dumps in web UI
Josh Rosen <joshrosen@databricks.com>
2014-11-03 18:18:47 -0800
Commit: 4f035dd, github.com/apache/spark/pull/2944
[SPARK-4168][WebUI] web statges number should show correctly when stages are more than 1000
Zhang, Liye <liye.zhang@intel.com>
2014-11-03 18:17:32 -0800
Commit: 97a466e, github.com/apache/spark/pull/3035
[SQL] Convert arguments to Scala UDFs
Michael Armbrust <michael@databricks.com>
2014-11-03 18:04:51 -0800
Commit: 15b58a2, github.com/apache/spark/pull/3077
SPARK-4178. Hadoop input metrics ignore bytes read in RecordReader insta...
Sandy Ryza <sandy@cloudera.com>
2014-11-03 15:19:01 -0800
Commit: 2812815, github.com/apache/spark/pull/3045
[SQL] More aggressive defaults
Michael Armbrust <michael@databricks.com>
2014-11-03 14:08:27 -0800
Commit: 25bef7e, github.com/apache/spark/pull/3064
[SPARK-4152] [SQL] Avoid data change in CTAS while table already existed
Cheng Hao <hao.cheng@intel.com>
2014-11-03 13:59:43 -0800
Commit: e83f13e, github.com/apache/spark/pull/3013
[SPARK-4202][SQL] Simple DSL support for Scala UDF
Cheng Lian <lian@databricks.com>
2014-11-03 13:20:33 -0800
Commit: c238fb4, github.com/apache/spark/pull/3067
[SPARK-3594] [PySpark] [SQL] take more rows to infer schema or sampling
Davies Liu <davies.liu@gmail.com>, Davies Liu <davies@databricks.com>
2014-11-03 13:17:09 -0800
Commit: 24544fb, github.com/apache/spark/pull/2716
[SPARK-4207][SQL] Query which has syntax like 'not like' is not working in Spark SQL
ravipesala <ravindra.pesala@huawei.com>
2014-11-03 13:07:41 -0800
Commit: 2b6e1ce, github.com/apache/spark/pull/3075
[SPARK-4211][Build] Fixes hive.version in Maven profile hive-0.13.1
fi <coderfi@gmail.com>
2014-11-03 12:56:56 -0800
Commit: df607da, github.com/apache/spark/pull/3072
[SPARK-4148][PySpark] fix seed distribution and add some tests for rdd.sample
Xiangrui Meng <meng@databricks.com>
2014-11-03 12:24:24 -0800
Commit: 3cca196, github.com/apache/spark/pull/3010
[EC2] Factor out Mesos spark-ec2 branch
Nicholas Chammas <nicholas.chammas@gmail.com>
2014-11-03 09:02:35 -0800
Commit: 2aca97c, github.com/apache/spark/pull/3008