ApacheCon North America 2014 has ended
Register Now for ApacheCon North America 2014 - April 7-9 in Denver, CO. Registration fees increase on March 15th, so don’t delay!
Back To Schedule
Monday, April 7 • 2:00pm - 2:50pm
Apache Falcon – Simplifying managing data jobs on Hadoop

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Apache Falcon is a framework for simplifying data management and pipeline processing in Apache Hadoop. It enables users to automate the movement and processing of datasets for ingest, pipelines, disaster recovery and data retention use cases. Instead of hard-coding complex dataset and pipeline processing logic, users can now rely on Apache Falcon for these functions, maximizing reuse and consistency across Hadoop applications.

Apache Falcon simplifies the development and management of data processing pipelines with introduction of higher layer of abstractions for users to work with such as Data Sets, Process and Infrastructure entities that are expressed using declarative language.

The presentation covers detailed design and architecture along with case studies on the usage of Falcon in production. We also look at how this compares against solutions if we took a silo-ed approach.


Shwetha GS

Staff Engineer, InMobi
Shwetha GS is a Staff Engineer at InMobi and has building data processing applications over Hadoop. She is a committer and PMC with Apache Falcon (incubating) and a contributor on Apache Oozie. Prior to InMobi, she was with Amazon.

Monday April 7, 2014 2:00pm - 2:50pm PDT
Confluence C

Attendees (0)