Justin Ma
156bccbe23
HdfsFile.scala: added a try/catch block to exit gracefully for correupted gzip files
...
MesosScheduler.scala: formatted the slaveOffer() output to include the serialized task size
RDD.scala: added support for aggregating RDDs on a per-split basis
(aggregateSplit()) as well as for sampling without replacement (sample())
2010-08-18 15:25:57 -07:00
Matei Zaharia
75b2ca10c3
Removed HOD from included Hadoop because it was making the project count
...
as Python on GitHub :|.
2010-08-16 23:16:35 -07:00
Matei Zaharia
1cbffaae6f
Modified Scala interpreter to have it avoid computing string versions of
...
all results when :silent is enabled, so that it is easier to work with
large arrays in Spark. (The string version of an array of numbers might
not fit in memory even though the array itself does.)
2010-08-15 18:33:27 -07:00
Matei Zaharia
1600c31554
Added latest mesos.jar
2010-08-13 19:03:46 -07:00
Matei Zaharia
0b195927b6
Improved README and added blank templates for config files.
2010-08-13 18:54:32 -07:00
Matei Zaharia
3d8d7fd557
Bug fix from Justin
2010-08-13 11:29:19 -07:00
root
a9481c3514
Update to work with latest Mesos API changes
2010-08-13 07:39:36 +00:00
Matei Zaharia
4488b3bc8a
Fixed a bug where we would incorrectly decide we've finished a parallel operation if Mesos tells us a task is finished twice
2010-08-09 16:46:14 -07:00
Matei Zaharia
f415b071af
Change shell framework's name to "Spark shell"
2010-08-06 12:07:26 -07:00
Matei Zaharia
0e6e577fdf
Add Mesos native library to .gitignore
2010-07-25 23:54:56 -04:00
Matei Zaharia
b56ed67553
Updated code to work with Nexus->Mesos name change
2010-07-25 23:53:46 -04:00
Matei Zaharia
4239f76997
Removed Matei's old start on broadcast code
2010-07-25 23:46:44 -04:00
Matei Zaharia
e240e38ee9
Updated a bunch of libraries, and increased the default memory in run so
...
that unit tests can run successfully.
2010-07-25 21:10:03 -04:00
Matei Zaharia
0435de9e87
Made it possible to set various Spark options and environment variables
...
in general through a conf/spark-env.sh script.
2010-07-19 18:00:30 -07:00
Justin Ma
edad598684
Updated Spark to run with latest Mesos build and Scala-2.8.0.final.
2010-07-19 15:03:49 -07:00
Matei Zaharia
7d0eae17e3
Merge branch 'dev'
...
Conflicts:
src/scala/spark/HdfsFile.scala
src/scala/spark/NexusScheduler.scala
src/test/spark/repl/ReplSuite.scala
2010-06-27 15:21:54 -07:00
root
6aacaa6870
Made Spark shell class directory configurable.
2010-06-18 23:24:18 +00:00
Matei Zaharia
323571a177
Initial work on union operation.
2010-06-18 12:54:33 -07:00
Matei Zaharia
b54198819e
Added appropriate hashCode, equals and toString to ParallelArraySplit.
2010-06-17 13:19:02 -07:00
Matei Zaharia
cd247b7d86
Created common RDD superclass for distributed files and parallel arrays.
...
This also means that parallel arrays now get all the functionality files
used to have (filter, map, reduce, cache, etc).
2010-06-17 12:49:42 -07:00
Matei Zaharia
77103eab2a
Fixed README
2010-06-11 14:55:23 -07:00
Matei Zaharia
0d9c51d473
Added back REPL tests
2010-06-11 10:03:01 -07:00
Matei Zaharia
e58fba279d
Fix junk stripper
2010-06-11 01:18:43 -07:00
Matei Zaharia
396f48e5a4
New interpreter port for Scala 2.8 interpreter
2010-06-11 01:10:03 -07:00
Matei Zaharia
4eb39e0c8a
New nexus.jar
2010-06-10 22:41:23 -07:00
Matei Zaharia
1473987fb7
Fixed classpath for tests
2010-06-10 22:36:45 -07:00
Matei Zaharia
359e84c585
Use new Nexus API
2010-06-10 22:09:13 -07:00
Matei Zaharia
92246c843b
Initial work on 2.8 port
2010-06-10 21:50:55 -07:00
Matei Zaharia
c177a546a5
Ignore .DS_Store
2010-06-10 18:08:59 -07:00
root
1c90a32621
Fix native build to use build directory
2010-04-30 22:41:21 +00:00
Matei Zaharia
06aac8a889
Imported changes from old repository (mostly Mosharaf's work,
...
plus some fault tolerance code).
2010-04-03 23:44:55 -07:00
Matei Zaharia
df29d0ea4c
Initial commit
2010-03-29 16:17:55 -07:00