Skip to content

Spark Streaming

  • Dstream
  • Break up the stream into distinct RDDs
  • It's not actually real streaming, but rather micro batches
  • Structured Streaming
  • Uses datasets instead of RDDs
  • More modern API
  • Real streaming on real time!
  • It's an ever expanding dataset