ApacheCon North America 2014 has ended
Register Now for ApacheCon North America 2014 - April 7-9 in Denver, CO. Registration fees increase on March 15th, so don’t delay!
Back To Schedule
Monday, April 7 • 5:00pm - 5:50pm
Feeding the Elephant: Optimizing the Read Path of the Hadoop Distributed Filesystem

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

The Hadoop Distributed Filesystem (HDFS) is a key component of the Hadoop distributed computation framework.  I'd like to talk about some important optimizations we made to the read path of HDFS, such as direct reads, short-circuit local reads, zero-copy reads, and HDFS caching.  Along the way, I'll talk about lessons that I learned while working on HDFS, and emerging trends in data center hardware.  Finally, I'll talk about some interesting ongoing and planned approaches to optimizing Hadoop and HDFS.


Colin McCabe

Software Engineer, Cloudera
Colin McCabe is a Platform Software Engineer at Cloudera, where he works on HDFS and related technologies. He is a committer on HDFS. Prior to joining Cloudera, he worked on the Ceph Distributed Filesystem, and the Linux kernel, among other things. He studied Computer Science and... Read More →

Monday April 7, 2014 5:00pm - 5:50pm PDT
Confluence A

Attendees (0)