Reynold Xin
ded10ce5b0
Merge pull request #103 from amplab/optimizations
...
Optimizations cherry-picked from SIGMOD branches
2013-12-15 17:25:33 -08:00
Reynold Xin
bad85b051d
Use murmur3 hash for open hashset.
...
(cherry picked from commit 212ff6834515543163aa63a3f4f762ebe641f8ca)
Signed-off-by: Ankur Dave <ankurdave@gmail.com>
2013-12-15 17:23:15 -08:00
Reynold Xin
62bdc44a1e
Unrolled while loop in readVarLong.
...
(cherry picked from commit 45ffb1ae3c0527aae50502741a3585c411875b9a)
Signed-off-by: Ankur Dave <ankurdave@gmail.com>
2013-12-15 17:23:15 -08:00
Ankur Dave
0459747c1c
Fix typo
2013-12-14 17:08:04 -08:00
Ankur Dave
8a56c1ff67
Merge pull request #84 from amatsukawa/graphlab_enhancements
...
GraphLab bug fix & set start vertex
2013-12-14 16:29:24 -08:00
Ankur Dave
1ad3d240b0
Remove commented code
2013-12-14 15:49:00 -08:00
Ankur Dave
7dbd3bf825
Remove debug printing from PregelSuite
2013-12-14 15:45:23 -08:00
Ankur Dave
e9cd634d05
Merge remote-tracking branch 'upstream/master' into mrTriplets-active-set
...
Updates standalone PageRank to work with the new API.
2013-12-14 15:44:35 -08:00
Ankur Dave
0c3fc1c1b6
Avoid re-creating the view RDDs multiple times
...
Previously, successive operations that support incremental view
maintenance would inadvertently recreate previous view RDDs by calling
VTableReplicated.get(), which created the RDDs anew though they were
already cached. This change memoizes the RDDs and separates the process
of shipping active-set information to an existing view.
2013-12-14 15:28:24 -08:00
Ankur Dave
59f625b745
Revert "Add debug logging to Pregel"
...
This reverts commit e62013cdd92137507a96b6a5b347a1d239209587.
2013-12-14 15:28:24 -08:00
Ankur Dave
ee5c69e481
Fix bug in VertexPartition.isActive
...
This took me ~5 hours to find!
2013-12-14 15:28:23 -08:00
Ankur Dave
4d3bba3a13
Add debug logging to Pregel
2013-12-14 15:28:23 -08:00
Ankur Dave
cf6288b993
Add PregelSuite
2013-12-14 15:28:23 -08:00
Ankur Dave
9c8b6224ec
Remove unused bound variable
2013-12-14 15:28:23 -08:00
Ankur Dave
ef17ab58cb
Add another mrTriplets test
2013-12-14 15:28:23 -08:00
Ankur Dave
b40824bc2b
Allow innerJoining VertexPartitions with different indexes
2013-12-14 15:28:23 -08:00
Ankur Dave
7a8952e9bb
Replace skipStale with activeSetOpt in mrTriplets (fails Pregel)
2013-12-14 15:28:23 -08:00
Ankur Dave
77b92748ad
Replace update with innerJoin (has a bug)
...
There is a conflict between vertices that didn't change so are not moved
but still need to run, and vertices that were deleted by the innerJoin
so should not run.
2013-12-14 15:28:23 -08:00
Ankur Dave
d161caa6eb
Expose srcStale and dstStale
2013-12-14 15:28:23 -08:00
Ankur Dave
4bbae5c39b
Remove printlns from test
2013-12-14 15:28:23 -08:00
Ankur Dave
b2f595f552
Test more graph ops and skipStale
2013-12-14 15:28:22 -08:00
Ankur Dave
45ea674e21
Handle diffing VertexPartitions with different indexes
2013-12-14 15:28:22 -08:00
Ankur Dave
8f4b8e9b95
Reuse previous localVidMap if available
2013-12-14 15:28:22 -08:00
Ankur Dave
5e20cbaf66
Define localVidMap once per VTableReplicated
2013-12-14 15:28:22 -08:00
Ankur Dave
dc72147900
Fix bug in interaction of incr. view maint., skipStale, and join rewrite
2013-12-14 15:28:22 -08:00
Ankur Dave
d076fe5c9e
Finish up 49f7fc5ed2edd42b118182438d84ed7d4e7d521f
2013-12-14 15:28:22 -08:00
Ankur Dave
4f4f0add24
Fix bug in mapVertices and outerJoinVertices
2013-12-14 15:28:22 -08:00
Ankur Dave
6bf51a5e15
Don't expose {src,dst}Mask in EdgeTriplet
...
Exposing this information is not necessary due to the addition of
skipStale.
2013-12-14 15:28:22 -08:00
Ankur Dave
87f2909561
Incremental view maintenance for all graph ops
...
All GraphImpl operations now support incremental maintenance of the
replicated vertex view by reusing vTableReplicated whenever
possible. This has two consequences:
- This is not confined to the deltaJoinVertices operator anymore,
so we rename it to the more intuitive "updateVertices" instead.
- This enables mrTriplets to support skipStale.
2013-12-14 15:28:22 -08:00
Ankur Dave
c3e0f01ad9
Remove calls to ClosureCleaner in GraphX
...
All Spark RDD methods already clean closures so we don't need to.
2013-12-14 15:28:21 -08:00
Ankur Dave
016cabceca
Clean up imports in EdgeRDD
2013-12-14 15:03:57 -08:00
Ankur Dave
d00cc8092b
Fix argument bug and closure capture
2013-12-14 15:03:57 -08:00
Ankur Dave
a8c7ebf0ed
Don't partition edges by default; refactor
...
Instead, expose Graph.partitionBy(PartitionStrategy).
2013-12-14 15:03:57 -08:00
Ankur Dave
1e98840128
Load edges in columnar format
...
In GraphLoader.edgeListFile, load edges directly into EdgePartitions,
avoiding repartitioning.
2013-12-14 15:01:51 -08:00
Reynold Xin
9bf192b01c
Merge pull request #91 from amplab/standalone-pagerank
...
Standalone PageRank
2013-12-14 12:52:18 -08:00
Reynold Xin
840af5e8e4
Merge pull request #99 from ankurdave/only-dynamic-pregel
...
Remove static Pregel; take maxIterations in dynamic Pregel
2013-12-14 12:51:51 -08:00
Ankur Dave
5b0d6f0ad5
Remove static Pregel; take maxIters in dynamic Pregel
2013-12-12 18:03:19 -08:00
Ankur Dave
3f69cdc81b
Use standalone PageRank in Analytics
2013-12-12 15:43:12 -08:00
Ankur Dave
a0fb477726
Test standalone PageRank
2013-12-12 15:42:55 -08:00
Ankur Dave
e8ba51d644
Add standalone PageRank using only GraphX operators
2013-12-12 15:42:27 -08:00
Reynold Xin
ce6ca4ea61
Merge pull request #97 from dcrankshaw/fix-rddtop
...
Added BoundedPriorityQueue kryo registrator. Fixes top issue.
2013-12-11 22:30:54 -08:00
Dan Crankshaw
12483d4ae6
Added BoundedPriorityQueue kryo registrator. Fixes top issue.
2013-12-11 20:56:21 -08:00
Reynold Xin
3abfbfb104
Merge pull request #92 from ankurdave/rdd-names
...
Set RDD names for easy debugging
2013-12-07 11:24:19 -08:00
Ankur Dave
84d0e1a334
Set RDD names for easy debugging
2013-12-07 04:05:45 -08:00
Ankur Dave
31e8a14e17
Merge pull request #90 from amplab/pregel-replicate-changed
...
Replicate only changed vertices
2013-12-06 21:49:55 -08:00
Reynold Xin
41721b1494
Fixed a bug in VTableReplicated that we only process the first block.
2013-12-06 00:51:12 -08:00
Reynold Xin
3b0ee53eda
Minor update.
2013-12-05 23:30:56 -08:00
Reynold Xin
15168d6c4d
Fixed a bug in VTableReplicated that we are always broadcasting all the vertices.
2013-12-05 23:25:53 -08:00
Reynold Xin
a6075ba11f
Merge branch 'pregel-replicate-changed' of github.com:ankurdave/graphx into pregel-replicate-changed
2013-12-05 22:35:16 -08:00
Ankur Dave
b707861ba0
Simplify GraphImpl.deltaJoinVertices
2013-12-05 20:01:32 -08:00