Spark Streaming: Dealing with State, by François Garillot

Apr 28, 2016 · Lausanne, Switzerland

One of the first steps in adopting stream processing is understanding that little if any data should be kept around during processing. Yet having completely stateless transformations is often difficult. We'll take a couple of examples of stream processing tasks where state might make sense — a simple aggregative ETL job, and an anomaly detection task — and drive them through the features Spark Streaming offers to address the issue of transforming DStreams with memory. 

Audiences should come back from this talk with a better view when and where it's appropriate to collect some state in stream processing, and in the facilities available in Spark Streaming — now and in the future — to do so.

François is a Big Data Scientist at Swisscom and was previously part of the Typesafe (now Lightbend) crew.

NB: our friends from the Scala Romandie meeting is hosting a Spark meetup on April 19th in Geneva (http://www.meetup.com/Scala-Romandie/events/229599508/), and both talk should delightfully come together.

-----------------------

Many thanks to Lightbend (www.lightbend.com) and OCTO Technology (www.octo.ch) to host this meetup.


Event organizers
  • Big Data Romandie

    "Big Data Romandie": pour aller au delà du buzz.  "Big data", "fat data", "fast data", "rich data"... un écosystème croît chaque jour, offrant autant de moyens pour relever les nouveaux défis qui se dressent devant nous. Nous vous proposons de nous retrouver pour partager et découvrir de nouveaux outils, techniques ou cas d'utilisation, pour démystifier ensembles le "Big Data. Big Data Roman...

    Recent Events
    More

Are you organizing Spark Streaming: Dealing with State, by François Garillot?

Claim the event and start manage its content.

I am the organizer
Social
Rating

based on 0 reviews