Spark

The Data Streaming Landscape 2023

Data streaming is a new software category to process data in motion. Apache Kafka is the de facto standard used…

2 years ago

The Heart of the Data Mesh Beats Real-Time with Apache Kafka

If there were a buzzword of the hour, it would undoubtedly be "data mesh"! This new architectural paradigm unlocks analytic…

2 years ago

Kappa Architecture is Mainstream Replacing Lambda

Real-time data beats slow data. That's true for almost every use case. Nevertheless, enterprise architects build new infrastructures with the…

3 years ago

Can Apache Kafka Replace a Database?

Can and should Apache Kafka replace a database? How long can and should I store data in Kafka? How can…

5 years ago

Big Data Spain: Talk about KSQL – The Streaming SQL Engine for Apache Kafka

KSQL - The Open Source Streaming SQL Engine for Apache Kafka => Slides from my talk at Big Data Spain…

6 years ago

Machine Learning Trends of 2018 combined with the Apache Kafka Ecosystem

At OOP 2018 conference in Munich, I presented an updated version of my talk about building scalable, mission-critical microservices with…

7 years ago

Kafka Streams + H2O.ai + TensorFlow (Video Recording / Live Demo)

I do a lot of presentations these days at meetups and conferences about how to leverage Apache Kafka and Kafka…

7 years ago

Apache Kafka Streams + Machine Learning (Spark, TensorFlow, H2O.ai)

Apache Kafka Streams to build Real Time Streaming Microservices. Apply Machine Learning / Deep Learning using Spark, TensorFlow, H2O.ai, etc.…

7 years ago

Why I Move (Back) to Open Source for Messaging, Integration and Stream Processing

After three great years at TIBCO Software, I move back to open source and join Confluent, the company behind the…

8 years ago

Streaming Analytics Comparison of Open Source Frameworks, Products, Cloud Services

Streaming Analytics Comparison of Open Source Frameworks, Products and Cloud Services. Includes Apache Storm, Flink, Spark, TIBCO, IBM, AWS Kinesis,…

8 years ago