Streaming Analytics Comparison of Open Source Frameworks, Products, Cloud Services

In November 2016, I am at Big Data Spain in Madrid for the first time. A great conference with many awesome speakers and sessions about very hot topics such as Apache Hadoop, Spark Spark, Streaming Processing / Streaming Analytics and Machine Learning. If you are interested in big data, then this conference is for you! My two talks:

  • How to Apply Machine Learning to Real Time Processing” (see slides and video recording from a similar conference talk).
  • Comparison of Streaming Analytics Options” (the reason for this blog post; an updated version of my talk from JavaOne 2015)

Here I wanna share the slides and a video recording of the latter one…

Abstract: Comparison of Stream Processing Options

This session discusses the technical concepts of stream processing / streaming analytics and how it is related to big data, mobile, cloud and internet of things. Different use cases such as predictive fault management or fraud detection are used to show and compare alternative frameworks and products for stream processing and streaming analytics.

The focus of the session lies on comparing

  • different open source frameworks such as Apache Apex, Apache Flink or Apache Spark Streaming
  • engines from software vendors such as IBM InfoSphere Streams, TIBCO StreamBase
  • cloud offerings such as AWS Kinesis.
  • real time streaming UIs such as Striim, Zoomdata or TIBCO Live Datamart.  Live demos will give the audience a good feeling about how to use these frameworks and tools.

The session will also discuss how stream processing is related to Apache Hadoop frameworks (such as MapReduce, Hive, Pig or Impala) and machine learning (such as R, Spark ML or H2O.ai).

Slides – Alternatives for Streaming Analytics

The following slide deck is a more extensive version of the talk at Big Data Spain (as the conference talks were only 30 minutes):

Click on the button to load the content from www.slideshare.net.

Load content

The video recording walks you through the above slide deck:

As always, I appreciate any comments, questions or other feedback.

Kai Waehner

bridging the gap between technical innovation and business value for real-time data streaming, processing and analytics

Recent Posts

Tesla Energy Platform – The Power of Data Streaming with Apache Kafka

Tesla’s Virtual Power Plant (VPP) turns thousands of home batteries, solar panels, and energy storage…

1 week ago

How Data Streaming with Apache Kafka and Flink Drives the Top 10 Innovations in FinServ

The financial industry is rapidly shifting toward real-time, intelligent, and seamlessly integrated services. From IoT…

2 weeks ago

Free Ebook: Data Streaming Use Cases and Industry Success Stories Featuring Apache Kafka and Flink

Real-time data is no longer optional—it’s essential. Businesses across industries use data streaming to power…

2 weeks ago

Why Generative AI and Data Streaming Are Replacing Visual Coding with Low-Code / No-Code Platforms

Low-code/no-code tools have revolutionized software development and data engineering by providing visual interfaces that empower…

3 weeks ago

The Role of Data Streaming in McAfee’s Cybersecurity Evolution

In today’s digital landscape, cybersecurity faces mounting challenges from sophisticated threats like ransomware, phishing, and…

4 weeks ago

Fully Managed (SaaS) vs. Partially Managed (PaaS) Cloud Services for Data Streaming with Kafka and Flink

The cloud revolution has reshaped how businesses deploy and manage data streaming with solutions like…

1 month ago