spark-instrumented-optimizer/docs/streaming-kafka-integration.md
cody koeninger c9f2501af2 [SPARK-16312][STREAMING][KAFKA][DOC] Doc for Kafka 0.10 integration
## What changes were proposed in this pull request?
Doc for the Kafka 0.10 integration

## How was this patch tested?
Scala code examples were taken from my example repo, so hopefully they compile.

Author: cody koeninger <cody@koeninger.org>

Closes #14385 from koeninger/SPARK-16312.
2016-08-05 10:13:32 +01:00

54 lines
1.5 KiB
Markdown

---
layout: global
title: Spark Streaming + Kafka Integration Guide
---
[Apache Kafka](http://kafka.apache.org/) is publish-subscribe messaging rethought as a distributed, partitioned, replicated commit log service. Please read the [Kafka documentation](http://kafka.apache.org/documentation.html) thoroughly before starting an integration using Spark.
The Kafka project introduced a new consumer api between versions 0.8 and 0.10, so there are 2 separate corresponding Spark Streaming packages available. Please choose the correct package for your brokers and desired features; note that the 0.8 integration is compatible with later 0.9 and 0.10 brokers, but the 0.10 integration is not compatible with earlier brokers.
<table class="table">
<tr><th></th><th><a href="streaming-kafka-0-8-integration.html">spark-streaming-kafka-0-8</a></th><th><a href="streaming-kafka-0-10-integration.html">spark-streaming-kafka-0-10</a></th></tr>
<tr>
<td>Broker Version</td>
<td>0.8.2.1 or higher</td>
<td>0.10.0 or higher</td>
</tr>
<tr>
<td>Api Stability</td>
<td>Stable</td>
<td>Experimental</td>
</tr>
<tr>
<td>Language Support</td>
<td>Scala, Java, Python</td>
<td>Scala, Java</td>
</tr>
<tr>
<td>Receiver DStream</td>
<td>Yes</td>
<td>No</td>
</tr>
<tr>
<td>Direct DStream</td>
<td>Yes</td>
<td>Yes</td>
</tr>
<tr>
<td>SSL / TLS Support</td>
<td>No</td>
<td>Yes</td>
</tr>
<tr>
<td>Offset Commit Api</td>
<td>No</td>
<td>Yes</td>
</tr>
<tr>
<td>Dynamic Topic Subscription</td>
<td>No</td>
<td>Yes</td>
</tr>
</table>