ODIn/spark-instrumented-optimizer

Author	SHA1	Message	Date
Mosharaf Chowdhury	6a84e40efe	Merge remote-tracking branch 'upstream/master'	2013-10-17 13:14:33 -07:00
Mosharaf Chowdhury	35b2415fb3	Code styling. Updated doc.	2013-10-17 13:14:12 -07:00
Matei Zaharia	cf64f63f8a	Merge pull request #67 from kayousterhout/remove_tsl Removed TaskSchedulerListener interface. The interface was used only by the DAG scheduler (so it wasn't necessary to define the additional interface), and the naming makes it very confusing when reading the code (because "listener" was used to describe the DAG scheduler, rather than SparkListeners, which implement a nearly-identical interface but serve a different function). @mateiz - is there a reason for this interface that I'm missing?	2013-10-17 11:12:28 -07:00
Mosharaf Chowdhury	e663750488	Removed unused code. Changes to match Spark coding style.	2013-10-17 00:19:50 -07:00
Ankur Dave	2282d27cf1	Cache msgsByPartition	2013-10-16 23:56:15 -07:00
Kay Ousterhout	809f547633	Fixed unit tests	2013-10-16 23:16:12 -07:00
KarthikTunga	8537f19268	SPARK-627 , Implementing --config arguments in the scripts	2013-10-16 23:00:33 -07:00
Reynold Xin	3e7df8f6c6	Added a number of very fast, memory-efficient data structures: BitSet, OpenHashSet, OpenHashMap, PrimitiveKeyOpenHashMap.	2013-10-16 22:58:52 -07:00
KarthikTunga	ff4fb1f7ee	SPARK-627 , Implementing --config arguments in the scripts	2013-10-16 22:55:15 -07:00
KarthikTunga	a32aa6b351	Implementing --config argument in the scripts	2013-10-16 22:51:09 -07:00
Mosharaf Chowdhury	e96bd0068f	BroadcastTest2 --> BroadcastTest	2013-10-16 21:33:33 -07:00
Mosharaf Chowdhury	a8d0981832	Fixes for the new BlockId naming convention.	2013-10-16 21:33:33 -07:00
Mosharaf Chowdhury	feb45d391f	Default blockSize is 4MB. BroadcastTest2 example added for testing broadcasts.	2013-10-16 21:33:33 -07:00
Mosharaf Chowdhury	6e5a60fab4	Removed unnecessary code, and added comment of memory-latency tradeoff.	2013-10-16 21:33:33 -07:00
Mosharaf Chowdhury	4602e2bf6e	Torrent-ish broadcast based on BlockManager.	2013-10-16 21:33:33 -07:00
prabeesh	890f8fe439	modify code, use Spark Logging Class	2013-10-17 10:00:40 +05:30
prabeesh	ee4178f144	remove unused dependency	2013-10-17 09:57:48 +05:30
prabeesh	29245605bf	remove unused dependency	2013-10-17 09:57:30 +05:30
Ankur Dave	bc234bf0e1	Split vTableReplicated into two RDDs Previously, (vTableReplicated: IndexedRDD[Pid, VertexHashMap[VD]]) stored one hashmap per partition, taking Vid directly to VD. To take advantage of rxin's new hashmaps (see rxin/incubator-spark@32a79d6d13), this commit splits that data structure into two RDDs: (vTableReplicationMap: IndexedRDD[Pid, VertexIdToIndexMap]) stores a map per partition from vertex ID to the index where that vertex's attribute is stored. This index refers to an array in the same partition in vTableReplicatedValues. (vTableReplicatedValues: IndexedRDD[Pid, Array[VD]]) stores the vertex data and is arranged as described above.	2013-10-16 19:22:23 -07:00
Ankur Dave	af8e461841	Set serialization properties in GraphSuite	2013-10-16 19:21:24 -07:00
Shivaram Venkataraman	0a4b76fcc2	Rename SBT target to assemble-deps.	2013-10-16 17:05:46 -07:00
Kay Ousterhout	ec512583ab	Removed TaskSchedulerListener interface. The interface was used only by the DAG scheduler (so it wasn't necessary to define the additional interface), and the naming makes it very confusing when reading the code (because "listener" was used to describe the DAG scheduler, rather than SparkListeners, which implement a nearly-identical interface but serve a different function).	2013-10-16 16:57:42 -07:00
Matei Zaharia	f9973cae3a	Merge pull request #65 from tgravescs/fixYarn Fix yarn build Fix the yarn build after renaming StandAloneX to CoarseGrainedX from pull request 34.	2013-10-16 15:58:41 -07:00
Shivaram Venkataraman	1dcded45e2	Exclude assembly jar from classpath if using deps	2013-10-16 13:43:41 -07:00
tgravescs	cc7df2b3cc	Fix yarn build	2013-10-16 10:09:16 -05:00
Joseph E. Gonzalez	57ac9073ae	Introducing unique indexedrdd and adding numerous specialized joins	2013-10-16 04:08:22 -07:00
prabeesh	9a7575728d	add maven dependencies for mqtt	2013-10-16 13:41:49 +05:30
prabeesh	7d36a117c1	add maven dependencies for mqtt	2013-10-16 13:41:26 +05:30
prabeesh	9eaf68fd40	added mqtt adapter wordcount example	2013-10-16 13:40:38 +05:30
prabeesh	06de3d516d	added mqtt adapter library dependencies	2013-10-16 13:38:37 +05:30
prabeesh	2e48b23eae	added mqtt adapter	2013-10-16 13:36:25 +05:30
prabeesh	742ada91e0	mqttinputdstream for mqttstreaming adapter	2013-10-16 13:35:29 +05:30
Joseph E. Gonzalez	59700c0c2a	switched to more efficienct implementation of reduce by key	2013-10-16 00:18:37 -07:00
Joseph E. Gonzalez	80e4ec3278	IndexedRDD now only supports unique keys	2013-10-16 00:16:44 -07:00
Matei Zaharia	28e9c2abc0	Merge pull request #63 from pwendell/master Fixing spark streaming example and a bug in examples build. - Examples assembly included a log4j.properties which clobbered Spark's - Example had an error where some classes weren't serializable - Did some other clean-up in this example	2013-10-15 23:59:56 -07:00
Matei Zaharia	4e46fde818	Merge pull request #62 from harveyfeng/master Make TaskContext's stageId publicly accessible.	2013-10-15 23:14:27 -07:00
Patrick Wendell	35befe07bb	Fixing spark streaming example and a bug in examples build. - Examples assembly included a log4j.properties which clobbered Spark's - Example had an error where some classes weren't serializable - Did some other clean-up in this example	2013-10-15 22:55:43 -07:00
Harvey Feng	65b46236e7	Proper formatting for SparkHadoopWriter class extensions.	2013-10-15 21:51:52 -07:00
Matei Zaharia	b5346064d6	Merge pull request #8 from vchekan/checkpoint-ttl-restore Serialize and restore spark.cleaner.ttl to savepoint In accordance to conversation in spark-dev maillist, preserve spark.cleaner.ttl parameter when serializing checkpoint.	2013-10-15 21:25:03 -07:00
Matei Zaharia	6dbd2208ff	Merge pull request #34 from kayousterhout/rename Renamed StandaloneX to CoarseGrainedX. (as suggested by @rxin here https://github.com/apache/incubator-spark/pull/14) The previous names were confusing because the components weren't just used in Standalone mode. The scheduler used for Standalone mode is called SparkDeploySchedulerBackend, so referring to the base class as StandaloneSchedulerBackend was misleading.	2013-10-15 19:02:57 -07:00
Matei Zaharia	983b83f24d	Merge pull request #61 from kayousterhout/daemon_thread Unified daemon thread pools As requested by @mateiz in an earlier pull request, this refactors various daemon thread pools to use a set of methods in utils.scala, and also changes the thread-pool-creation methods in utils.scala to use named thread pools for improved debugging.	2013-10-15 19:02:46 -07:00
Joseph E. Gonzalez	3cb6dffce0	adding indexed reduce by key	2013-10-15 18:55:06 -07:00
Harvey Feng	c4c76e37a7	Fix line length > 100 chars in SparkHadoopWriter	2013-10-15 18:35:59 -07:00
Harvey Feng	5b8083fee5	Make TaskContext's stageId publicly accessible.	2013-10-15 18:06:37 -07:00
Joseph E. Gonzalez	9058f261fe	Addressing issue where statistics are not computed correctly	2013-10-15 17:39:09 -07:00
Joseph E. Gonzalez	1b22eef744	Merge branch 'master' of https://github.com/apache/incubator-spark into indexedrdd_graphx	2013-10-15 16:15:19 -07:00
Joseph E. Gonzalez	194bb03d16	Resolved closure capture issues by addressing capture through implicit variables.	2013-10-15 15:10:41 -07:00
Kay Ousterhout	f95a2be045	Fixed build error after merging in master	2013-10-15 14:51:37 -07:00
Kay Ousterhout	acc7638f7c	Merge remote branch 'upstream/master' into rename	2013-10-15 14:43:56 -07:00
Kay Ousterhout	707ad8cc4f	Unified daemon thread pools	2013-10-15 14:23:43 -07:00

... 4 5 6 7 8 ...

4715 commits