Cloudera Hadoop Supports Amazon Elastic Block Storage
It integrates the Hadoop File System with Amazon EBS.
This is a Press Release edited by StorageNewsletter.com on June 11, 2009 at 3:41 pmCloudera, the commercial Hadoop company, announced that the Cloudera Distribution for Hadoop now supports running Hadoop on Amazon Elastic Compute Cloud (EC2) clusters with Amazon Elastic Block Store (EBS) backed storage. Typically, Hadoop clusters on Amazon EC2 use local disks or Amazon’s Simple Storage Service (S3). The Cloudera Distribution for Hadoop directly integrates the Hadoop File System with Amazon EBS.
Schematic showing how the cluster is set up
with Cloudera Distribution for Hadoop
When not in use, the Cloudera Distribution for Hadoop allows a cluster to surrender unneeded EC2 instances, then restart later, carrying on where it left off. Users no longer need to copy large volumes of data from S3 to local disk on the EC2 instance – data persists reliably and independently in Amazon’s EBS, saving compute costs.
"This announcement marks a major advancement in the Cloudera Distribution for Hadoop, as EBS support brings significant cost savings in storage versus S3," said Christophe Bisciglia, Founder at Cloudera. "The Cloudera Distribution for Hadoop leverages both EC2 and EBS to directly align computation costs with usage, lower overall storage costs, and improve performance."
Cloudera’s EBS support removes the requirement to move data in and out of S3 for each job. Additionally, it improves performance by enabling more disks per node compared to EC2, which provides a fixed number of local disks per instance. EBS support allows the Cloudera Distribution for Hadoop to perform better than local EC2 storage or Amazon’s premium priced, S3 based, Elastic MapReduce.
About Cloudera
Cloudera develops and distributes Hadoop, the open source software that powers the data processing engines of the world’s largest and most popular web sites. Founded by experts on big data from Facebook, Google, Oracle and Yahoo, Cloudera’s mission is to bring the power of Hadoop, MapReduce, and distributed storage to companies of all sizes in the enterprise, Internet and government sectors. Headquartered in Silicon Valley, Cloudera has financial backing from Accel Partners and angel investors who include Diane Greene (former CEO of VMware), Marten Mickos (former CEO of MySQL), and Gideon Yu (former CFO of Facebook). Cloudera’s advisors include the founders of the Hadoop project, Doug Cutting and Mike Cafarella.
For more information about the Cloudera Distribution for Hadoop with EBS integration