Categories: AnalyticsBig DataBusiness IntelligenceCloudESBHadoop

“Hadoop and Data Warehouse (DWH) – Friends, Enemies or Profiteers? What about Real Time?” – Slides (including TIBCO Examples) from JAX 2014 Online

Slides from my talk “Hadoop and Data Warehouse (DWH) – Friends, Enemies or Profiteers? What about Real Time?” at JAX 2014 (Twitter #jaxcon) in Mainz are online. JAX is a great conference with interesting topics and many good speakers!

Content (Data Warehouse, Business Intelligence, Hadoop, Stream Processing)

Big data represents a significant paradigm shift in enterprise technology. Big data radically changes the nature of the data management profession as it introduces new concerns about the volume, velocity and variety of corporate data. New business models based on predictive analytics, such as recommendation systems or fraud detection, are relevant more than ever before. Apache Hadoop seems to become the de facto standard for implementing big data solutions. For that reason, solutions from many different vendors emerged on top of Hadoop.

But hold on… Companies have spent a lot of many to implement a data warehouse for the same reason in the last decades. Both, Apache Hadoop and data warehouse were invented to store and analyze big data. This session explains the different architectural and technical concepts of Apache Hadoop and a data warehouse. The following questions will be answered: When to use which alternative? Does a data warehouse even have a future at all? Or how can we combine both alternatives?

However, Hadoop and a Data Warehouse cannot solve every big data problem. Complex event processing and real-time analytics have to be solved in another way. So, in-memory computing and streaming platforms are good alternatives or complements to Hadoop for processing and analyzing big data. For that reasons, an almost unimaginable number of solutions for big data emerged on the market. This session shows and compares the most important concepts and solutions for processing and analyzing big data, and discusses how they complement each other.

TIBCO Products (Spotfire, StreamBase, BusinessEvents, BusinessWorks) and Real World Examples

I discuss a good big data architecture which includes Data Warehouse / Business Intelligence + Apache Hadoop + Real Time / Stream Processing. Several real world example are shown. TIBCO offers some very nice products for realizing these use cases, e.g. Spotfire (Business Intelligence / BI), StreamBase (Stream Processing), BusinessEvents (Complex Event Processing / CEP) and BusinessWorks (Integration / ESB). TIBCO is also ready for Hadoop by offering connectors and plugins for many important Hadoop frameworks / interfaces such as HDFS, Pig, Hive, Impala, Apache Flume and more.

Slides

Here are the slides:

Click on the button to load the content from www.slideshare.net.

Load content

As always, I appreciate feedback and discussions.

Kai Wähner

Kai Waehner

bridging the gap between technical innovation and business value for real-time data streaming, processing and analytics

Next Fundamentals of Stream Processing (IBM InfoSphere Streams, TIBCO StreamBase, Apache Storm) – Book Review »

Previous « Slides from OOP 2014 Online: Next-Generation BPM - How to create intelligent Business Processes thanks to Big Data

Published by

Kai Waehner

11 years ago

The Top 20 Problems with Batch Processing (and How to Fix Them with Data Streaming)

Batch processing introduces delays, complexity, and data quality issues that modern businesses can no longer…

22 hours ago

Design Pattern

Replacing Legacy Systems, One Step at a Time with Data Streaming: The Strangler Fig Approach

Modernizing legacy systems doesn’t have to mean a risky big-bang rewrite. This blog explores how…

6 days ago

Retail Media

Retail Media with Data Streaming: The Future of Personalized Advertising in Commerce

Retail media is reshaping digital advertising by using first-party data to deliver personalized, timely ads…

2 weeks ago

Apache Kafka

Modernizing OT Middleware: The Shift to Open Industrial IoT Architectures with Data Streaming

Legacy OT middleware is struggling to keep up with real-time, scalable, and cloud-native demands. As…

2 weeks ago

Agentic AI

CIO Summit: The State of AI and Why Data Streaming is Key for Success

The CIO Summit in Amsterdam provided a valuable perspective on the state of AI adoption…

3 weeks ago

Allgemein

Cathay: From Premium Airline to Integrated Travel Ecosystem with Data Streaming

Cathay Pacific is evolving beyond aviation, rebranding as Cathay to offer a seamless travel and…