[DOC] Minor modification to Streaming docs with regards to parallel data receiving

pwendell tdas

Author: Nishkam Ravi <nravi@cloudera.com>
Author: nishkamravi2 <nishkamravi@gmail.com>
Author: nravi <nravi@c1704.halxg.cloudera.com>

Closes #6544 from nishkamravi2/master_nravi and squashes the following commits:

46e8c03 [Nishkam Ravi] Slight modification to streaming docs

(cherry picked from commit e7c7e51f2e)
Signed-off-by: Sean Owen <sowen@cloudera.com>
This commit is contained in:
Nishkam Ravi 2015-06-01 21:34:41 +01:00 committed by Sean Owen
parent 78a6723e87
commit 2f41cf3e29

View file

@ -1946,10 +1946,10 @@ creates a single receiver (running on a worker machine) that receives a single s
Receiving multiple data streams can therefore be achieved by creating multiple input DStreams
and configuring them to receive different partitions of the data stream from the source(s).
For example, a single Kafka input DStream receiving two topics of data can be split into two
Kafka input streams, each receiving only one topic. This would run two receivers on two workers,
thus allowing data to be received in parallel, and increasing overall throughput. These multiple
DStream can be unioned together to create a single DStream. Then the transformations that was
being applied on the single input DStream can applied on the unified stream. This is done as follows.
Kafka input streams, each receiving only one topic. This would run two receivers,
allowing data to be received in parallel, and increasing overall throughput. These multiple
DStreams can be unioned together to create a single DStream. Then the transformations that were
being applied on a single input DStream can be applied on the unified stream. This is done as follows.
<div class="codetabs">
<div data-lang="scala" markdown="1">