Commit graph

9 commits

Author SHA1 Message Date
Matei Zaharia b84769a107 Modified project structure to work with buildr 2010-11-13 17:18:05 -08:00
Matei Zaharia 630a982b88 Added a getId method to split to force classes to specify a unique ID
for each split. This replaces the previous method of calling
split.toString, which would produce different results for the same split
each time it is deserialized (because the default implementation returns
the Java object's address).
2010-10-07 17:17:07 -07:00
Justin Ma 7a9ff1cc9a - Got rid of 'Split' type parameter in RDD
- Added SampledRDD, SplitRDD and CartesianRDD
- Made Split a class rather than a type parameter
- Added numCores() to Scheduler to help set default level of parallelism
2010-08-31 12:08:09 -07:00
Matei Zaharia b56ed67553 Updated code to work with Nexus->Mesos name change 2010-07-25 23:53:46 -04:00
Matei Zaharia 7d0eae17e3 Merge branch 'dev'
Conflicts:
	src/scala/spark/HdfsFile.scala
	src/scala/spark/NexusScheduler.scala
	src/test/spark/repl/ReplSuite.scala
2010-06-27 15:21:54 -07:00
Matei Zaharia b54198819e Added appropriate hashCode, equals and toString to ParallelArraySplit. 2010-06-17 13:19:02 -07:00
Matei Zaharia cd247b7d86 Created common RDD superclass for distributed files and parallel arrays.
This also means that parallel arrays now get all the functionality files
used to have (filter, map, reduce, cache, etc).
2010-06-17 12:49:42 -07:00
Matei Zaharia 92246c843b Initial work on 2.8 port 2010-06-10 21:50:55 -07:00
Matei Zaharia df29d0ea4c Initial commit 2010-03-29 16:17:55 -07:00