Getting Started with Spark Streaming

WARNING!

This presentation is now obsolete. The information in this talk has not been updated in a while and could be outdated. Use the information in this presentation at your own risk and be sure to double-check things to ensure that any information you use is accurate.

ABSTRACT

With the broader adoption of message brokers like Apache Kafka as well as distributed, message-sending architectures, the need for tools which can process vast amounts of data quickly became critical. To fill this need, we have several competing products, including Spark Streaming. In this talk, we will understand the use cases for stream processing and how Spark's concept of distributed batch processing reduces down to micro batches in the streaming case. We will understand the two streaming models for Spark, DStreams and Structured Streaming with DataFrames, and will see examples of streaming applications in Scala and F#.

ADDITIONAL MEDIA

No recordings or additional media are available for this talk.