Spark Streaming
Dstream
- Break up the stream into distinct RDDs
- It's not actually real streaming, but rather micro batches
Structured Streaming
- Uses datasets instead of RDDs
- More modern API
- Real streaming on real time!
- It's an ever expanding dataset
Dstream
Structured Streaming