ODIn/spark-instrumented-optimizer

Author	SHA1	Message	Date
Matei Zaharia	0fa5809768	Updated docs for SparkConf and handled review comments	2013-12-30 22:17:28 -05:00
Matei Zaharia	994f080f8a	Properly show Spark properties on web UI, and change app name property	2013-12-29 22:19:33 -05:00
Matei Zaharia	eaa8a68ff0	Fix some Python docs and make sure to unset SPARK_TESTING in Python tests so we don't get the test spark.conf on the classpath.	2013-12-29 20:15:07 -05:00
Matei Zaharia	11540b798d	Added tests for SparkConf and fixed a bug Typesafe Config caches system properties the first time it's invoked by default, ignoring later changes unless you do something special	2013-12-29 18:44:06 -05:00
Matei Zaharia	1ee7f5aee4	Fix a change that was lost during merge	2013-12-29 18:15:46 -05:00
Matei Zaharia	0bd1900cbc	Fix a few settings that were being read as system properties after merge	2013-12-29 15:38:46 -05:00
Matei Zaharia	b4ceed40d6	Merge remote-tracking branch 'origin/master' into conf2 Conflicts: core/src/main/scala/org/apache/spark/SparkContext.scala core/src/main/scala/org/apache/spark/scheduler/DAGScheduler.scala core/src/main/scala/org/apache/spark/scheduler/TaskSchedulerImpl.scala core/src/main/scala/org/apache/spark/scheduler/cluster/ClusterTaskSetManager.scala core/src/main/scala/org/apache/spark/scheduler/local/LocalScheduler.scala core/src/main/scala/org/apache/spark/util/MetadataCleaner.scala core/src/test/scala/org/apache/spark/scheduler/TaskResultGetterSuite.scala core/src/test/scala/org/apache/spark/scheduler/TaskSetManagerSuite.scala new-yarn/src/main/scala/org/apache/spark/deploy/yarn/Client.scala streaming/src/main/scala/org/apache/spark/streaming/Checkpoint.scala streaming/src/main/scala/org/apache/spark/streaming/api/java/JavaStreamingContext.scala streaming/src/main/scala/org/apache/spark/streaming/scheduler/JobGenerator.scala streaming/src/test/scala/org/apache/spark/streaming/BasicOperationsSuite.scala streaming/src/test/scala/org/apache/spark/streaming/CheckpointSuite.scala streaming/src/test/scala/org/apache/spark/streaming/InputStreamsSuite.scala streaming/src/test/scala/org/apache/spark/streaming/TestSuiteBase.scala streaming/src/test/scala/org/apache/spark/streaming/WindowOperationsSuite.scala	2013-12-29 15:08:08 -05:00
Matei Zaharia	58c6fa2041	Add Python docs about SparkConf	2013-12-29 14:46:59 -05:00
Matei Zaharia	615fb649d6	Fix some other Python tests due to initializing JVM in a different way The test in context.py created two different instances of the SparkContext class by copying "globals", so that some tests can have a global "sc" object and others can try initializing their own contexts. This led to two JVM gateways being created since SparkConf also looked at pyspark.context.SparkContext to get the JVM.	2013-12-29 14:32:05 -05:00
Matei Zaharia	cd00225db9	Add SparkConf support in Python	2013-12-29 14:03:39 -05:00
Matei Zaharia	1c11f54a9b	Fix Python use of getLocalDir	2013-12-29 00:11:36 -05:00
Matei Zaharia	20631348d1	Fix other failing tests	2013-12-28 23:17:58 -05:00
Matei Zaharia	0900d5c72a	Add a StreamingContext constructor that takes a conf object	2013-12-28 21:38:07 -05:00
Matei Zaharia	a8f316386a	Fix CheckpointSuite test failures	2013-12-28 21:26:43 -05:00
Matei Zaharia	578bd1fc28	Fix test failures due to setting / clearing clock type in Streaming	2013-12-28 21:21:06 -05:00
Matei Zaharia	5bbe73864e	Fix Executor not getting properties in local mode	2013-12-28 17:31:58 -05:00
Matei Zaharia	a16c52ed1b	Check for SPARK_YARN_MODE through a system property too since it can sometimes be set that way (undoes a change in previous commit)	2013-12-28 17:24:21 -05:00
Matei Zaharia	642029e7f4	Various fixes to configuration code - Got rid of global SparkContext.globalConf - Pass SparkConf to serializers and compression codecs - Made SparkConf public instead of private[spark] - Improved API of SparkContext and SparkConf - Switched executor environment vars to be passed through SparkConf - Fixed some places that were still using system properties - Fixed some tests, though others are still failing This still fails several tests in core, repl and streaming, likely due to properties not being set or cleared correctly (some of the tests run fine in isolation).	2013-12-28 17:13:15 -05:00
Matei Zaharia	ad3dfd1531	Merge pull request #307 from kayousterhout/other_failure Removed unused OtherFailure TaskEndReason. The OtherFailure TaskEndReason was added by @mateiz 3 years ago in this commit: `24a1e7f838` Unless I am missing something, it doesn't seem to have been used then, and is not used now, so seems safe for deletion.	2013-12-27 22:10:14 -05:00
Matei Zaharia	b579b83277	Merge pull request #306 from kayousterhout/remove_pending Remove unused hasPendingTasks methods	2013-12-27 22:09:04 -05:00
Kay Ousterhout	e17d7518ab	Removed unused OtherFailure TaskEndReason.	2013-12-27 15:51:27 -08:00
Kay Ousterhout	8419148e5f	Remove unused hasPendingTasks methods	2013-12-27 15:19:42 -08:00
Patrick Wendell	19672dca32	Merge pull request #305 from kayousterhout/line_spacing Fixed >100char lines in DAGScheduler.scala There's no changed functionality here -- only line spacing and one grammatical fix in a comment.	2013-12-27 13:37:10 -08:00
Kay Ousterhout	0c71ffe924	Style fixes as per Reynold's review	2013-12-27 12:19:38 -08:00
Kay Ousterhout	8c81068e16	Fixed >100char lines in DAGScheduler.scala	2013-12-27 11:36:54 -08:00
Reynold Xin	7be1e57786	Merge pull request #298 from aarondav/minor Minor: Decrease margin of left side of Log page Before ![before](https://f.cloud.github.com/assets/1400247/1812647/1a4be53e-6e87-11e3-9d5b-f851274be0e9.png) After ![after](https://f.cloud.github.com/assets/1400247/1812648/1ca1ea2c-6e87-11e3-946c-31be9258f450.png) It's a start anyway...	2013-12-26 23:41:40 -10:00
Reynold Xin	7d811ba6f2	Merge pull request #302 from pwendell/SPARK-1007 SPARK-1007: spark-class2.cmd should change SCALA_VERSION to be 2.10 Reported by Qiuzhuang Lian	2013-12-26 23:39:58 -10:00
Patrick Wendell	0cc1e0d43d	SPARK-1007: spark-class2.cmd should change SCALA_VERSION to be 2.10	2013-12-26 23:21:08 -08:00
Matei Zaharia	5e69fc5bb4	Merge pull request #295 from markhamstra/JobProgressListenerNPE Avoid a lump of coal (NPE) in JobProgressListener's stocking.	2013-12-26 19:10:39 -05:00
Aaron Davidson	4f2fb761b0	Decrease margin of left side of log page	2013-12-26 15:38:45 -08:00
Matei Zaharia	e240bad03b	Merge pull request #296 from witgo/master Renamed ClusterScheduler to TaskSchedulerImpl for yarn and new-yarn package	2013-12-26 12:30:48 -05:00
liguoqiang	b662c88a24	fix this import order	2013-12-26 15:49:33 +08:00
Mark Hamstra	c529dceaff	Avoid a lump of coal (NPE) in JobProgressListener's stocking.	2013-12-25 23:10:02 -08:00
Matei Zaharia	c344ed04c7	Merge pull request #283 from tmyklebu/master Python bindings for mllib This pull request contains Python bindings for the regression, clustering, classification, and recommendation tools in mllib. For each 'train' frontend exposed, there is a Scala stub in PythonMLLibAPI.scala and a Python stub in mllib.py. The Python stub serialises the input RDD and any vector/matrix arguments into a mutually-understood format and calls the Scala stub. The Scala stub deserialises the RDD and the vector/matrix arguments, calls the appropriate 'train' function, serialises the resulting model, and returns the serialised model. ALSModel is slightly different since a MatrixFactorizationModel has RDDs inside. The Scala stub returns a handle to a Scala MatrixFactorizationModel; prediction is done by calling the Scala predict method. I have tested these bindings on an x86_64 machine running Linux. There is a risk that these bindings may fail on some choose-your-own-endian platform if Python's endian differs from java.nio.ByteBuffer's idea of the native byte order.	2013-12-26 01:31:06 -05:00
liguoqiang	2bd76f693d	Renamed ClusterScheduler to TaskSchedulerImpl for yarn and new-yarn	2013-12-26 11:10:35 +08:00
liguoqiang	14fcef72db	Renamed ClusterScheduler to TaskSchedulerImpl for yarn and new-yarn	2013-12-26 11:05:07 +08:00
Tor Myklebust	9cbcf81453	Remove commented code in __init__.py.	2013-12-25 14:12:42 -05:00
Tor Myklebust	5e71354cb7	Fix copypasta in __init__.py. Don't import anything directly into pyspark.mllib.	2013-12-25 14:10:55 -05:00
Matei Zaharia	56094bcd8d	Merge pull request #290 from ash211/patch-3 Typo: avaiable -> available	2013-12-25 13:14:33 -05:00
Reynold Xin	4842a07da8	Merge pull request #287 from azuryyu/master Fixed job name in the java streaming example.	2013-12-25 01:52:15 -08:00
Tor Myklebust	02208a175c	Initial weights in Scala are ones; do that too. Also fix some errors.	2013-12-25 00:53:48 -05:00
Tor Myklebust	4e821390bc	Scala stubs for updated Python bindings.	2013-12-25 00:09:00 -05:00
Tor Myklebust	05163057a1	Split the mllib bindings into a whole bunch of modules and rename some things.	2013-12-25 00:08:05 -05:00
Andrew Ash	3665c722b5	Typo: avaiable -> available	2013-12-24 17:25:04 -08:00
Patrick Wendell	85a344b4f0	Merge pull request #127 from kayousterhout/consolidate_schedulers Deduplicate Local and Cluster schedulers. The code in LocalScheduler/LocalTaskSetManager was nearly identical to the code in ClusterScheduler/ClusterTaskSetManager. The redundancy made making updating the schedulers unnecessarily painful and error- prone. This commit combines the two into a single TaskScheduler/ TaskSetManager. Unfortunately the diff makes this change look much more invasive than it is -- TaskScheduler.scala is only superficially changed (names updated, overrides removed) from the old ClusterScheduler.scala, and the same with TaskSetManager.scala. Thanks @rxin for suggesting this change!	2013-12-24 16:35:06 -08:00
Patrick Wendell	c2dd6bcd6e	Merge pull request #279 from aarondav/shuffle-cleanup0 Clean up shuffle files once their metadata is gone Previously, we would only clean the in-memory metadata for consolidated shuffle files. Additionally, fixes a bug where the Metadata Cleaner was ignoring type-specific TTLs.	2013-12-24 14:36:47 -08:00
Kay Ousterhout	1efe3adf56	Responded to Reynold's style comments	2013-12-24 14:18:39 -08:00
Tor Myklebust	86e38c4942	Remove useless line from test stub.	2013-12-24 16:49:31 -05:00
Tor Myklebust	4efec6eb94	Python change for move of PythonMLLibAPI.	2013-12-24 16:49:03 -05:00
Tor Myklebust	58e2a7d6d4	Move PythonMLLibAPI into its own package.	2013-12-24 16:48:40 -05:00

1 2 3 4 5 ...

4998 commits