DirectKafkaInputDStream (Spark 1.3.1 JavaDoc)

Object
- org.apache.spark.streaming.dstream.DStream<T>
- - org.apache.spark.streaming.dstream.InputDStream<R>
  - - org.apache.spark.streaming.kafka.DirectKafkaInputDStream<K,V,U,T,R>

All Implemented Interfaces:

java.io.Serializable, Logging
```
public class DirectKafkaInputDStream<K,V,U extends kafka.serializer.Decoder<K>,T extends kafka.serializer.Decoder<V>,R>
extends InputDStream<R>
implements Logging
```
A stream of KafkaRDD where each given Kafka topic/partition corresponds to an RDD partition. The spark configuration spark.streaming.kafka.maxRatePerPartition gives the maximum number of messages per second that each '''partition''' will accept. Starting offsets are specified in advance, and this DStream is not responsible for committing offsets, so that you can control exactly-once semantics. For an easy interface to Kafka-managed offsets, see KafkaCluster

See Also:
Serialized Form

Nested Class Summary

Nested Classes
Modifier and Type Class and Description

class DirectKafkaInputDStream.DirectKafkaInputDStreamCheckpointData

Nested Classes
Modifier and Type	Class and Description
`class`	`DirectKafkaInputDStream.DirectKafkaInputDStreamCheckpointData`

Constructor Summary

Constructors
Constructor and Description
`DirectKafkaInputDStream(StreamingContext ssc_, scala.collection.immutable.Map<String,String> kafkaParams, scala.collection.immutable.Map<kafka.common.TopicAndPartition,Object> fromOffsets, scala.Function1<kafka.message.MessageAndMetadata<K,V>,R> messageHandler, scala.reflect.ClassTag<K> evidence$1, scala.reflect.ClassTag<V> evidence$2, scala.reflect.ClassTag<U> evidence$3, scala.reflect.ClassTag<T> evidence$4, scala.reflect.ClassTag<R> evidence$5)`

Method Summary

Methods
Modifier and Type	Method and Description
`scala.Option<KafkaRDD<K,V,U,T,R>>`	`compute(Time validTime)`
`scala.collection.immutable.Map<kafka.common.TopicAndPartition,Object>`	`fromOffsets()`
`scala.collection.immutable.Map<String,String>`	`kafkaParams()`
`int`	`maxRetries()`
`void`	`start()` Method called to start receiving data.
`void`	`stop()` Method called to stop receiving data.

Methods inherited from class org.apache.spark.streaming.dstream.InputDStream
dependencies, isTimeValid, lastValidTime, slideDuration

Methods inherited from class org.apache.spark.streaming.dstream.DStream
cache, checkpoint, checkpointDuration, clearCheckpointData, clearMetadata, context, count, countByValue, countByValueAndWindow, countByWindow, creationSite, filter, flatMap, foreach, foreach, foreachRDD, foreachRDD, generatedRDDs, generateJob, getCreationSite, getOrCompute, glom, graph, initialize, isInitialized, map, mapPartitions, mustCheckpoint, parentRememberDuration, persist, persist, print, print, reduce, reduceByWindow, reduceByWindow, register, remember, rememberDuration, repartition, restoreCheckpointData, saveAsObjectFiles, saveAsTextFiles, setContext, setGraph, slice, slice, ssc, storageLevel, toPairDStreamFunctions, transform, transform, transformWith, transformWith, union, updateCheckpointData, validate, window, window, zeroTime

Methods inherited from class Object
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

Methods inherited from interface org.apache.spark.Logging
initializeIfNecessary, initializeLogging, isTraceEnabled, log_, log, logDebug, logDebug, logError, logError, logInfo, logInfo, logName, logTrace, logTrace, logWarning, logWarning

Constructor Detail

DirectKafkaInputDStream

public DirectKafkaInputDStream(StreamingContext ssc_,
                       scala.collection.immutable.Map<String,String> kafkaParams,
                       scala.collection.immutable.Map<kafka.common.TopicAndPartition,Object> fromOffsets,
                       scala.Function1<kafka.message.MessageAndMetadata<K,V>,R> messageHandler,
                       scala.reflect.ClassTag<K> evidence$1,
                       scala.reflect.ClassTag<V> evidence$2,
                       scala.reflect.ClassTag<U> evidence$3,
                       scala.reflect.ClassTag<T> evidence$4,
                       scala.reflect.ClassTag<R> evidence$5)

Method Detail
- kafkaParams
```
public scala.collection.immutable.Map<String,String> kafkaParams()
```
- fromOffsets
```
public scala.collection.immutable.Map<kafka.common.TopicAndPartition,Object> fromOffsets()
```
- maxRetries
```
public int maxRetries()
```
- compute
```
public scala.Option<KafkaRDD<K,V,U,T,R>> compute(Time validTime)
```
- start
```
public void start()
```
  Description copied from class: InputDStream
  
  Method called to start receiving data. Subclasses must implement this method.
  
  Specified by:
  
  start in class InputDStream<R>
- stop
```
public void stop()
```
  Description copied from class: InputDStream
  
  Method called to stop receiving data. Subclasses must implement this method.
  
  Specified by:
  
  stop in class InputDStream<R>

Class DirectKafkaInputDStream<K,V,U extends kafka.serializer.Decoder<K>,T extends kafka.serializer.Decoder<V>,R>

Nested Class Summary

Constructor Summary

Method Summary

Methods inherited from class org.apache.spark.streaming.dstream.InputDStream

Methods inherited from class org.apache.spark.streaming.dstream.DStream

Methods inherited from class Object

Methods inherited from interface org.apache.spark.Logging

Constructor Detail

DirectKafkaInputDStream

Method Detail

kafkaParams

fromOffsets

maxRetries

compute

start

stop