ODIn/spark-instrumented-optimizer

Author	SHA1	Message	Date
Matei Zaharia	6601a6212b	Added a unit test for cross-partition balancing in sort, and changes to RangePartitioner to make it pass. It turns out that the first partition was always kind of small due to how we picked partition boundaries.	2012-08-03 16:40:45 -04:00
Harvey	1170de3757	Fix for partitioning when sorting in descending order	2012-08-03 16:40:38 -04:00
Paul Cavallaro	d05c0f97ca	Logging Throwables in Info and Debug Logging Throwables in logInfo and logDebug instead of swallowing them. Conflicts: core/src/main/scala/spark/Logging.scala	2012-08-03 16:40:21 -04:00
Matei Zaharia	71a958b0b7	Merge branch 'dev' of github.com:mesos/spark into dev Conflicts: project/SparkBuild.scala	2012-08-02 17:23:13 -04:00
Denny	7312a5c30f	Use spray's implicit Marshaller for Futures.	2012-08-02 14:11:27 -07:00
Denny	ba7e30fb5e	Mostly stlyistic changes.	2012-08-02 13:55:09 -07:00
Shivaram Venkataraman	1a07bb9ba4	Avoid an extra partition copy by passing an iterator to blockManager.put	2012-08-02 12:22:33 -07:00
Shivaram Venkataraman	6790908b11	Use maxMemory to better estimate memory available for BlockManager cache	2012-08-02 12:05:05 -07:00
Denny	863c31b7c1	Moved resources into static folder	2012-08-02 09:48:36 -07:00
Denny	6c670c37dd	Webui improvements.	2012-08-01 19:47:57 -07:00
Denny	1b29e90a79	merge dev branch	2012-08-01 14:06:09 -07:00
Denny	011220fa55	Compact job page.	2012-08-01 11:26:45 -07:00
Denny	7a295fee96	Spark WebUI Implementation.	2012-08-01 11:01:09 -07:00
Matei Zaharia	3ee2530c0c	Merge branch 'block-manager-fix' into dev	2012-07-30 13:58:46 -07:00
Matei Zaharia	400221f851	Merge branch 'dev' of git://github.com/tdas/spark into dev	2012-07-30 13:54:57 -07:00
Matei Zaharia	ed1b0f8388	Made BlockManagerMaster no longer be a singleton. Also cleaned up a few formatting things throughout block manager code.	2012-07-30 13:53:47 -07:00
Matei Zaharia	f471c82558	Various reorganization and formatting fixes	2012-07-30 11:24:01 -07:00
Imran Rashid	f7149c5e46	tasks cannot access value of accumulator	2012-07-28 20:16:17 -07:00
Imran Rashid	244cbbe33a	one more minor cleanup to scaladoc	2012-07-28 20:16:10 -07:00
Imran Rashid	3b392c67db	fix up scaladoc, naming of type parameters	2012-07-28 20:16:01 -07:00
Imran Rashid	f1face1ea9	rename addToAccum to addAccumulator	2012-07-28 20:16:01 -07:00
Imran Rashid	2d666b9d76	add some functionality to Vector, delete copy in AccumulatorSuite	2012-07-28 20:15:51 -07:00
Imran Rashid	edc6972f8e	move Vector class into core and spark.util package	2012-07-28 20:15:42 -07:00
Imran Rashid	83659af11c	Accumulator now inherits from Accumulable, whcih simplifies a bunch of other things (eg., no +:=) Conflicts: core/src/main/scala/spark/Accumulators.scala	2012-07-28 20:13:51 -07:00
Imran Rashid	79d58ed20a	improve scaladoc	2012-07-28 20:12:41 -07:00
Imran Rashid	ae07f3864c	add Accumulatable, add corresponding docs & tests for accumulators	2012-07-28 20:12:41 -07:00
Matei Zaharia	dee8ff1b9d	Added a second version of union() without varargs.	2012-07-27 16:27:52 -07:00
Tathagata Das	cf429699e1	Updated the new checkpoint RDD to remember partitioning of the original RDD.	2012-07-27 23:16:37 +00:00
Matei Zaharia	b51d733a57	Fixed Java union methods having same erasure. Changed union() methods on lists to take a separate "first element" argument in order to differentiate them to the compiler, because Java 7 considered it an error to have them all take Lists parameterized with different types.	2012-07-27 12:23:27 -07:00
Tathagata Das	3e271c3b61	Merge branch 'dev' of github.com:tdas/spark into dev	2012-07-27 12:01:04 -07:00
Tathagata Das	024905f682	Added BlockRDD and a first-cut version of checkpoint() to RDD class.	2012-07-27 12:00:49 -07:00
Tathagata Das	d1eee44a03	Fixed more stuff in BoundedMemoryCache.	2012-07-27 18:33:32 +00:00
Tathagata Das	d1b7f41671	Fixed bug in BoundedMemoryCache.	2012-07-27 09:00:45 -07:00
Tathagata Das	435d129bec	Fixed bugs in block dropping code of MemoryStore and changed synchronized HashMap to ConcurrentHashMap in BlockManager.	2012-07-27 10:02:26 +00:00
Tathagata Das	0426769f89	Modified the block dropping code for better performance.	2012-07-26 20:53:45 -07:00
Matei Zaharia	5c5aa2ff81	Merge pull request #153 from JoshRosen/new-java-api Java API	2012-07-26 17:20:52 -07:00
Josh Rosen	c5e2810dc7	Add persist(), splits(), glom(), and mapPartitions() to Java API.	2012-07-26 12:46:47 -07:00
Josh Rosen	6a78e88237	Minor cleanup and optimizations in Java API. - Add override keywords. - Cache RDDs and counts in TC example. - Clean up JavaRDDLike's abstract methods.	2012-07-24 09:47:00 -07:00
Denny	4f4a34c025	Stlystic changes Conflicts: core/src/test/scala/spark/MesosSchedulerSuite.scala	2012-07-23 16:32:20 -07:00
Matei Zaharia	600e99728d	Fix a bug where an input path was added to a Hadoop job configuration twice	2012-07-23 16:16:19 -07:00
Josh Rosen	042dcbde33	Add type annotations to Java API methods. Add missing Scala Map to java.util.Map conversions.	2012-07-22 17:35:29 -07:00
Josh Rosen	e23938c3be	Use mapValues() in JavaPairRDD.cogroupResultToJava().	2012-07-22 15:10:01 -07:00
Josh Rosen	01dce3f569	Add Java API Add distinct() method to RDD. Fix bug in DoubleRDDFunctions.	2012-07-18 17:34:29 -07:00
Matei Zaharia	628bb5ca7f	Allow null keys in Spark's reduce and group by	2012-07-12 18:36:02 -07:00
Matei Zaharia	e2a67a8024	Fixes to coarse-grained Mesos scheduler in dealing with failed nodes	2012-07-12 18:21:52 -07:00
Matei Zaharia	be622cf867	Formatting	2012-07-11 17:31:44 -07:00
Matei Zaharia	e8ae77df24	Added more methods for loading/saving with new Hadoop API	2012-07-11 17:31:33 -07:00
Matei Zaharia	0a47284003	More work to allow Spark to run on the standalone deploy cluster.	2012-07-08 14:00:04 -07:00
Matei Zaharia	1aa63f775b	Added back coarse-grained Mesos scheduler based on StandaloneScheduler.	2012-07-08 10:52:13 -07:00
Matei Zaharia	c5cc10cda3	More work on standalone scheduler	2012-07-06 20:17:44 -07:00

1 2 3 4 5 ...

322 commits