[DOC] Minor modification to Streaming docs with regards to parallel data receiving
pwendell tdas
Author: Nishkam Ravi <nravi@cloudera.com>
Author: nishkamravi2 <nishkamravi@gmail.com>
Author: nravi <nravi@c1704.halxg.cloudera.com>
Closes #6544 from nishkamravi2/master_nravi and squashes the following commits:
46e8c03 [Nishkam Ravi] Slight modification to streaming docs
(cherry picked from commit e7c7e51f2e
)
Signed-off-by: Sean Owen <sowen@cloudera.com>
This commit is contained in:
parent
78a6723e87
commit
2f41cf3e29
|
@ -1946,10 +1946,10 @@ creates a single receiver (running on a worker machine) that receives a single s
|
|||
Receiving multiple data streams can therefore be achieved by creating multiple input DStreams
|
||||
and configuring them to receive different partitions of the data stream from the source(s).
|
||||
For example, a single Kafka input DStream receiving two topics of data can be split into two
|
||||
Kafka input streams, each receiving only one topic. This would run two receivers on two workers,
|
||||
thus allowing data to be received in parallel, and increasing overall throughput. These multiple
|
||||
DStream can be unioned together to create a single DStream. Then the transformations that was
|
||||
being applied on the single input DStream can applied on the unified stream. This is done as follows.
|
||||
Kafka input streams, each receiving only one topic. This would run two receivers,
|
||||
allowing data to be received in parallel, and increasing overall throughput. These multiple
|
||||
DStreams can be unioned together to create a single DStream. Then the transformations that were
|
||||
being applied on a single input DStream can be applied on the unified stream. This is done as follows.
|
||||
|
||||
<div class="codetabs">
|
||||
<div data-lang="scala" markdown="1">
|
||||
|
|
Loading…
Reference in a new issue