Loading…
ApacheCon North America 2014 has ended
Register Now for ApacheCon North America 2014 - April 7-9 in Denver, CO. Registration fees increase on March 15th, so don’t delay!
Back To Schedule
Tuesday, April 8 • 4:45pm - 5:35pm
Interoperability in the Apache Hive Ecosystem

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Apache Hadoop has grown over time to spawn many other Apache projects, each of which enables crunching big data in one way or another. Due to the need for some of those projects to talk to each other, a smaller ecosystem has developed among some of these projects, notably Hive, HCatalog, Pig and HBase.

In this presentation, we will begin with a baseline overview of Apache Hadoop and MapReduce. We will outline the related other Apache projects (Hive, Pig, and HBase) and their niches, and highlight their use cases and best practices. Then, we will tie them all together via HCatalog apis and common metadata and look at some patterns for usage introspection and optimization.

We will approach the above from a historical perspective of evolution of these tools in this ecosystem, and also provide a sneak peek into recent developments and the future of Hadoop and these projects.

Speakers
avatar for Mithun Radhakrishnan

Mithun Radhakrishnan

Programmer, Yahoo
Erstwhile firmware developer. Apache HCatalog committer. Author of DistCp for Hadoop-2. Has moderate to severe C++ withdrawal symptoms. Currently works on Hive and its ecosystem over at Yahoo!
SS

Sushanth Sowmyan

Hortonworks
Sushanth Sowmyan is an Apache HCatalog committer, and a long time Apache Hive contributor that spends most of his time oscillating between worrying about backward compatibility and being worked up about doing it ""the right way"". He currently works at Hortonworks in their data query... Read More →


Tuesday April 8, 2014 4:45pm - 5:35pm PDT
Confluence C

Attendees (0)