Loading…
This event has ended. Create your own event → Check it out
This event has ended. Create your own
Register Now for ApacheCon North America 2014 - April 7-9 in Denver, CO. Registration fees increase on March 15th, so don’t delay!
View analytic
Monday, April 7 • 5:00pm - 5:50pm
Feeding the Elephant: Optimizing the Read Path of the Hadoop Distributed Filesystem

Sign up or log in to save this to your schedule and see who's attending!

The Hadoop Distributed Filesystem (HDFS) is a key component of the Hadoop distributed computation framework.  I'd like to talk about some important optimizations we made to the read path of HDFS, such as direct reads, short-circuit local reads, zero-copy reads, and HDFS caching.  Along the way, I'll talk about lessons that I learned while working on HDFS, and emerging trends in data center hardware.  Finally, I'll talk about some interesting ongoing and planned approaches to optimizing Hadoop and HDFS.

Speakers
CM

Colin McCabe

Software Engineer, Cloudera
Colin McCabe is a Platform Software Engineer at Cloudera, where he works on HDFS and related technologies. He is a committer on HDFS. Prior to joining Cloudera, he worked on the Ceph Distributed Filesystem, and the Linux kernel, among other things. He studied Computer Science and Computer Engineering at Carnegie Mellon.


Monday April 7, 2014 5:00pm - 5:50pm
Confluence A

Attendees (14)