Gavin Li
4508089fc3
refine comments and add sc.clean
2013-06-17 05:23:46 +00:00
Gavin Li
728665d34b
Merge branch 'enhance_pipe' of ssh://github.com/lyogavin/spark into enhance_pipe
2013-06-17 00:23:29 +00:00
Yu L Li
33c55ec713
Merge remote branch 'upstream/master' into enhance_pipe
2013-06-17 00:21:26 +00:00
Gavin Li
e6ae049283
Merge remote-tracking branch 'upstream1/master' into enhance_pipe
2013-06-16 22:53:39 +00:00
Gavin Li
fb6d733fa8
update according to comments
2013-06-16 22:32:55 +00:00
Christopher Nguyen
f91195cc15
Import just scala.math.abs rather than scala.math._
2013-06-16 01:29:53 -07:00
Christopher Nguyen
5c886194e4
Move zero-length partition testing from JavaAPISuite.java to PartitioningSuite.scala
2013-06-16 01:23:48 -07:00
Christopher Nguyen
479442a9b9
Add zeroLengthPartitions() test to make sure, e.g., StatCounter.scala can handle empty partitions without incorrectly returning NaN
2013-06-15 17:35:55 -07:00
Matei Zaharia
f961aac8b2
Merge pull request #649 from ryanlecompte/master
...
Add top K method to RDD using a bounded priority queue
2013-06-15 00:53:41 -07:00
ryanlecompte
e8801d4490
use delegation for BoundedPriorityQueue, add Java API
2013-06-14 23:39:05 -07:00
Andrew xia
53add598f2
Update LocalSchedulerSuite to avoid using sleep for task launch
2013-06-15 01:46:13 +08:00
Matei Zaharia
6602d94973
Merge pull request #651 from rxin/groupbykey
...
SPARK-772 / SPARK-774: groupByKey and cogroup should disable map side combine
2013-06-14 10:41:31 -07:00
Reynold Xin
2cc188fd54
SPARK-774: cogroup should also disable map side combine by default
2013-06-14 00:10:54 -07:00
Reynold Xin
6738178d0d
SPARK-772: groupByKey should disable map side combine.
2013-06-13 23:59:42 -07:00
ryanlecompte
93b3f5e535
drop unneeded ClassManifest implicit
2013-06-13 16:26:35 -07:00
ryanlecompte
44b8dbaede
use Iterator.single(elem) instead of Iterator(elem) for improved performance based on scaladocs
2013-06-13 16:23:15 -07:00
Matei Zaharia
d93851aedf
Merge pull request #645 from pwendell/compression
...
Adding compression to Hadoop save functions
2013-06-13 13:38:45 -07:00
Rohit Rai
b5b12823fa
Fixing the style as per feedback
2013-06-13 14:05:46 +05:30
Mingfei
967a6a699d
modify sparklister function interface according to comments
2013-06-13 14:36:07 +08:00
Matei Zaharia
f1da591a8a
Merge pull request #646 from markhamstra/jvmArgs
...
Fixed jvmArgs in maven build.
2013-06-12 17:55:08 -07:00
Mark Hamstra
3f96c6f27b
Fixed jvmArgs in maven build.
2013-06-12 17:24:22 -07:00
ryanlecompte
db5bca08ff
add a new top K method to RDD using a bounded priority queue
2013-06-12 10:54:16 -07:00
Matei Zaharia
dc4073654b
Revert "Fix start-slave not passing instance number to spark-daemon."
...
This reverts commit a674d67c0a
.
2013-06-11 00:08:02 -04:00
Matei Zaharia
0e94b734be
Merge pull request #625 from stephenh/fix-start-slave
...
Fix start-slave not passing instance number to spark-daemon.
2013-06-10 13:00:31 -07:00
Patrick Wendell
fd6148c8b2
Removing print statement
2013-06-10 10:27:25 -07:00
Andrew xia
190ec61799
change code style and debug info
2013-06-10 15:27:02 +08:00
Patrick Wendell
ef14dc2e77
Adding Java-API version of compression codec
2013-06-09 18:09:46 -07:00
Patrick Wendell
df592192e7
Monads FTW
2013-06-09 18:09:24 -07:00
Patrick Wendell
083a3485ab
Clean extra whitespace
2013-06-09 11:49:33 -07:00
Patrick Wendell
d1bbcebae5
Adding compression to Hadoop save functions
2013-06-09 11:39:35 -07:00
Mingfei
ade822011d
not check return value of eventQueue.take
2013-06-08 16:26:45 +08:00
Matei Zaharia
74b91d53bc
Merge pull request #629 from c0s/master
...
Sometime Maven build runs out of PermGen space.
2013-06-08 01:19:40 -07:00
Mingfei
4fd86e0e10
delete test code for joblogger in SparkContext
2013-06-08 15:45:47 +08:00
Mingfei
362f0f93ac
Merge branch 'master' of https://github.com/mesos/spark
2013-06-08 15:20:13 +08:00
Mingfei
1a4d93c025
modify to pass job annotation by localProperties and use daeamon thread to do joblogger's work
2013-06-08 14:23:39 +08:00
Matei Zaharia
b58a29295b
Small formatting and style fixes
2013-06-07 22:51:28 -07:00
Matei Zaharia
c8fc423bc2
Merge pull request #631 from jerryshao/master
...
Fix block manager UI display issue when enable spark.cleaner.ttl
2013-06-07 22:43:18 -07:00
Matei Zaharia
c9ca0a4a58
Small code style fix to SchedulingAlgorithm.scala
2013-06-07 22:40:44 -07:00
Matei Zaharia
1ae60bcb36
Merge pull request #634 from xiajunluan/master
...
[Spark-753] Fix ClusterSchedulSuite unit test failed
2013-06-07 22:39:06 -07:00
Gavin Li
e179ff8a32
update according to comments
2013-06-05 22:41:05 +00:00
Christopher Nguyen
9d35904357
In the current code, when both partitions happen to have zero-length, the return mean will be NaN.
...
Consequently, the result of mean after reducing over all partitions will also be NaN,
which is not correct if there are partitions with non-zero length. This patch fixes this issue.
2013-06-04 22:12:47 -07:00
Matei Zaharia
fff3728552
Merge pull request #640 from pwendell/timeout-update
...
Fixing bug in BlockManager timeout
2013-06-04 16:09:50 -07:00
Patrick Wendell
061fd3ae36
Fixing bug in BlockManager timeout
2013-06-04 19:02:44 -04:00
Matei Zaharia
f420d4f228
Merge pull request #639 from pwendell/timeout-update
...
Bump akka and blockmanager timeouts to 60 seconds
2013-06-04 15:25:58 -07:00
Patrick Wendell
8bd4e12104
Bump akka and blockmanager timeouts to 60 seconds
2013-06-04 18:14:24 -04:00
Konstantin Boudnik
d1286231e0
Sometime Maven build runs out of PermGen space.
2013-06-03 15:55:44 -07:00
Rohit Rai
b104c7f5c7
Example to write the output to cassandra
2013-06-03 15:15:52 +05:30
Rohit Rai
56c64c4033
A better way to read column value if you are sure the column exists in every row.
2013-06-03 12:48:35 +05:30
Andrew xia
606bb1b450
Fix schedulingAlgorithm bugs for unit test
2013-06-03 10:29:23 +08:00
Gavin Li
4a9913d66a
add ut for pipe enhancement
2013-06-02 23:21:09 +00:00