Categories: Uncategorized

You are not Facebook or Google? Why you should still care about Big Data and Apache Hadoop Ecosystem (Pig, Hive, Hortonworks, Cloudera, MapR, Informatica, Talend)

In March 2013, I was at 33rd Degree – “A Conference for Java Masters”. I had two talks, including a new one: “You are not Facebook or Google? Why you should still care about Big Data”. It is a great talk to give an overview about big data, especially from a business perspective (paradigm shift, business value, challenges). However, I also talk about alternatives for big data from a technology perspective, mainly about the defacto standard Apache Hadoop, its ecosystem (Hive, Pig, HBase, Oozie, Sqoop, etc.), distributions (Cloudera, Hortonworks, MapR), and tooling (i.e. big data suites, e.g. Talend, Informatica, Oracle, IBM).

I want to share the slides with you:

Click on the button to load the content from www.slideshare.net.

Load content

As always, I appreciate every feedback, questions and other comments…

 

Best regards,

Kai Wähner (Twitter: @KaiWaehner)

Kai Waehner

bridging the gap between technical innovation and business value for real-time data streaming, processing and analytics

Recent Posts

Virta’s Electric Vehicle (EV) Charging Platform with Real-Time Data Streaming: Scalability for Large Charging Businesses

The rise of Electric Vehicles (EVs) demands a scalable, efficient charging network—but challenges like fluctuating…

7 hours ago

Apache Kafka 4.0: The Business Case for Scaling Data Streaming Enterprise-Wide

Apache Kafka 4.0 represents a major milestone in the evolution of real-time data infrastructure. Used…

3 days ago

How Apache Kafka and Flink Power Event-Driven Agentic AI in Real Time

Agentic AI marks a major evolution in artificial intelligence—shifting from passive analytics to autonomous, goal-driven…

1 week ago

Shift Left Architecture at Siemens: Real-Time Innovation in Manufacturing and Logistics with Data Streaming

Industrial enterprises face increasing pressure to move faster, automate more, and adapt to constant change—without…

2 weeks ago

The Importance of Focus: Why Software Vendors Should Specialize Instead of Doing Everything (Example: Data Streaming)

As real-time technologies reshape IT architectures, software vendors face a critical decision: specialize deeply in…

2 weeks ago

The Top 20 Problems with Batch Processing (and How to Fix Them with Data Streaming)

Batch processing introduces delays, complexity, and data quality issues that modern businesses can no longer…

3 weeks ago