Reynold Xin
6fadff2b92
Converted for loops to while loops in EdgePartition.
2013-11-07 16:54:33 -08:00
Joseph E. Gonzalez
3e504938c2
merging upstream changes
2013-11-05 01:36:48 -08:00
Joseph E. Gonzalez
2dc9ec2387
Reverting to Array based (materialized) output of all VertexSetRDD operations.
2013-11-05 01:15:12 -08:00
Reynold Xin
551a43fd3d
Merge branch 'master' of github.com:apache/incubator-spark into mergemerge
...
Conflicts:
README.md
core/src/main/scala/org/apache/spark/util/collection/OpenHashMap.scala
core/src/main/scala/org/apache/spark/util/collection/OpenHashSet.scala
core/src/main/scala/org/apache/spark/util/collection/PrimitiveKeyOpenHashMap.scala
2013-11-04 21:02:36 -08:00
Joseph E. Gonzalez
e7d37472b8
After some testing I realized that the IndexedSeq is still instantiating the array (not maintaining a view) so I have replaced all IndexedSeq[V] with (Int => V)
2013-10-31 21:09:39 -07:00
Joseph E. Gonzalez
63311d9c72
renamed update to setMerge
2013-10-31 20:12:30 -07:00
Joseph E. Gonzalez
8381aeffb3
This commit introduces the OpenHashSet and OpenHashMap as indexing primitives.
...
Large parts of the VertexSetRDD were restructured to take advantage of:
1) the OpenHashSet as an index map
2) view based lazy mapValues and mapValuesWithVertices
3) the cogroup code is currently disabled (since it is not used in any of the tests)
The GraphImpl was updated to also use the OpenHashSet and PrimitiveOpenHashMap
wherever possible:
1) the LocalVidMaps (used to track replicated vertices) are now implemented
using the OpenHashSet
2) an OpenHashMap is temporarily constructed to combine the local OpenHashSet
with the local (replicated) vertex attribute arrays
3) because the OpenHashSet constructor grabs a class manifest all operations
that construct OpenHashSets have been moved to the GraphImpl Singleton to prevent
implicit variable capture within closures.
2013-10-31 18:13:02 -07:00
Joseph E. Gonzalez
aeb773fa47
Merging with upstream master.
2013-10-31 10:12:12 -07:00
Reynold Xin
3f3c727bc5
Merge pull request #41 from jegonzal/LineageTracking
...
Optimizing Graph Lineage
2013-10-31 09:52:25 -07:00
Joseph E. Gonzalez
d6b5122532
Switching to the @rxin BitSet implementation for VertexSet Value tables.
2013-10-31 01:44:24 -07:00
Dan Crankshaw
c430d2e21d
Added bitset to kryo register
2013-10-31 01:01:59 -07:00
Joseph E. Gonzalez
a3ce484a2c
Adding additional type constraints to VertexSetRDD to help diagnose issues with recent benchmarks.
2013-10-30 21:02:21 -07:00
Joseph E. Gonzalez
09ea661bbb
removing completely unnecessary map operation.
2013-10-30 20:07:26 -07:00
Joseph E. Gonzalez
003f8a505d
Removing potential additional shuffle dependency where an already partitioned RDD[(Vid, VD)] is repartitioned.
2013-10-30 20:06:54 -07:00
Joseph E. Gonzalez
d513addb77
added lineage tracking code
2013-10-30 20:05:29 -07:00
Joseph E. Gonzalez
a4b8ddf417
removing unused commented code
2013-10-30 16:07:05 -07:00
Joseph E. Gonzalez
38ec0baf5c
fixing a typo in the VertexSetRDD docs
2013-10-29 16:27:55 -07:00
Joseph E. Gonzalez
d8c8256e52
merging upstream changes
2013-10-29 16:23:26 -07:00
Joseph E. Gonzalez
08c7b040d6
Documented the VertexSetRDD
2013-10-29 15:03:13 -07:00
Joseph E. Gonzalez
ede329336d
Fixing a scaladoc bug in graph generators.
2013-10-29 14:50:12 -07:00
Joseph E. Gonzalez
15958ca65a
Reindenting documentation.
2013-10-29 14:01:24 -07:00
Joseph E. Gonzalez
d316cad9b1
Documented Graph.appy functions.
2013-10-29 13:58:04 -07:00
Joseph E. Gonzalez
19da8820fc
Minor modifications to documentation.
2013-10-29 11:06:06 -07:00
Joseph E. Gonzalez
77626d1507
Adding collect neighbors and documenting GraphOps.
2013-10-29 11:05:42 -07:00
Joseph E. Gonzalez
942de98433
Making suggested changes.
2013-10-29 10:19:49 -07:00
Joseph E. Gonzalez
d6a902f309
Finished updating connected components to used Pregel like abstraction and created a series of tests in the AnalyticsSuite.
2013-10-28 11:52:26 -07:00
Joseph E. Gonzalez
a2287ae138
Implementing connected components on top of pregel like abstraction.
2013-10-27 10:42:11 -07:00
Joseph E. Gonzalez
6a0fbc0374
Updating the GraphLab API to match the changes made to the Pregel API.
2013-10-26 15:44:19 -07:00
Joseph E. Gonzalez
08024c938c
Adding more documentation to the Pregel API as well as additional functionality including the ability to specify the edge direction along which messages are computed.
2013-10-26 15:42:51 -07:00
Joseph E. Gonzalez
00e73833cc
Fixing a bug in reverse edge direction.
2013-10-26 15:10:30 -07:00
Joseph E. Gonzalez
c30624dcbb
Adding dynamic pregel, fixing bugs in PageRank, and adding basic analytics unit tests.
2013-10-23 00:25:45 -07:00
Joseph E. Gonzalez
0bd92ed8d0
Fixing a bug in pregel where the initial vertex-program results are lost.
2013-10-22 19:10:51 -07:00
Joseph E. Gonzalez
be8269af07
Merge branch 'VertexSetRDD_Tests' into AnalyticsCleanup
2013-10-22 15:03:49 -07:00
Joseph E. Gonzalez
e3eb03d5b5
Starting analytics test suite.
2013-10-22 15:03:16 -07:00
Joseph E. Gonzalez
ba5c75692a
Updating analytics to reflect changes in the pregel interface and moving degree information into the edge attribute.
2013-10-22 15:03:00 -07:00
Joseph E. Gonzalez
46b195253e
Adding some additional graph generators to support unit testing of the analytics package.
2013-10-22 15:01:49 -07:00
Joseph E. Gonzalez
14a3329a11
Changing the Pregel interface slightly to better support type inference.
2013-10-22 15:01:20 -07:00
Joseph E. Gonzalez
ebdbedc3e9
Documenting VertexSetRDD and added some testing code for VertexSetRDD
2013-10-19 01:26:08 -07:00
Joseph E. Gonzalez
dbc8c9868a
Fixing bug in VertexSetRDD that breaks Graph tests.
2013-10-18 23:44:06 -07:00
Reynold Xin
9cf43cfeb7
Merge pull request #28 from jegonzal/VertexSetRDD
...
Refactoring IndexedRDD to VertexSetRDD.
2013-10-18 22:07:21 -07:00
Ankur Dave
2d3603930e
Add a unit test for GraphOps.joinVertices
2013-10-18 19:46:13 -07:00
Ankur Dave
d15db10831
Add a unit test for Graph.mapEdges
2013-10-18 19:46:13 -07:00
Ankur Dave
d429f015c0
Update GraphSuite aggregateNeighbors test
2013-10-18 19:46:13 -07:00
Joseph E. Gonzalez
5d01ebca3c
Specializing IndexedRDD as VertexSetRDD.
...
1) This allows the index map to be optimized for Vids
2) This makes the code more readable
2) The Graph API can now return VertexSetRDDs from operations that produce results for vertices
2013-10-18 19:03:59 -07:00
Joseph E. Gonzalez
bb58aa5330
Added some stub code to address the case where a vertex could occur multiple times in the vertex table or where a vertex in the edge list may not appear in the vertex table.
...
Moving IndexedRDD into the graphx source tree and removing dependencies in /core.
2013-10-18 18:15:32 -07:00
Ankur Dave
36a902e52d
Revert accidental removal of code in 3a40a5e
2013-10-18 16:19:40 -07:00
Dan Crankshaw
3a40a5eb30
Added some documentation.
2013-10-18 15:11:21 -07:00
Joseph E. Gonzalez
3f3d28c73f
Switching from Seq to IndexedSeq
2013-10-17 19:55:36 -07:00
Joseph E. Gonzalez
9a03c5fe28
This commit accomplishes three goals:
...
1) Further simplification of the IndexedRDD operations (eliminating some)
2) Aggressive reuse of HashMaps
3) Pipelining join operations within indexedrdd
2013-10-17 19:01:48 -07:00
Ankur Dave
bf19aac2b7
Use ArrayBuilder instead of ArrayBuffer
...
ArrayBuilder is specialized for holding primitive VD types.
2013-10-17 13:19:00 -07:00