ApacheCon North America 2014 has ended
Register Now for ApacheCon North America 2014 - April 7-9 in Denver, CO. Registration fees increase on March 15th, so don’t delay!
Back To Schedule
Monday, April 7 • 3:00pm - 3:50pm
Sqoop 2 - New generation of Big Data Transfers

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Apache Sqoop is a tool that was created to efficiently transfer big data between entire Hadoop ecosystem (components such as HDFS, Hive or HBase) and structured data stores (such as relational databases, data warehouses, or NoSQL systems). The popularity of Sqoop in enterprise systems confirms that Sqoop does bulk transfer admirably.

In the meantime, we have encountered many new challenges that have outgrown the abilities of the current infrastructure. To fulfill more data integration use cases, as well as become easier to manage and operate, a new generation of Sqoop has been created. With focus on ease of use, ease of extension, and security Sqoop 2 was born. This session will dive into Sqoop 2 architecture, describing differences between Sqoop 1, and the benefits that the new architecture brings.

avatar for Jaroslav Cecho

Jaroslav Cecho

Software Engineer, Cloudera
Jarek Jarcec Cecho is a software engineer at Cloudera, where he develops software to help customers better access and integrate with the Hadoop ecosystem. He has led the Sqoop community in the architecture of the next generation of Sqoop, known as Sqoop 2. He is also a co-auhor of... Read More →

Abraham Elmahrek

Software Engineer, Cloudera
Abe is a Software Engineer at Cloudera working on ingest systems. Prior to working on ingest systems, he helped develop and bring to market Hue 3. He is a member of the Apache Sqoop PMC and a committer on the Apache HTrace (incubating) project.

Monday April 7, 2014 3:00pm - 3:50pm PDT
Confluence C

Attendees (0)