What are you looking for ?
Advertise with us
RAIDON

Amazon Web Services Announces Limited Preview of Redshift

Low-cost petabyte-scale data warehouse service in cloud

Amazon.com, Inc., an Amazon.com company, announced the limited preview of Amazon Redshift, a fast and powerful, fully managed, petabyte-scale data warehouse service in the cloud.

Redshift enables customers to increase the speed of query performance when analyzing virtually any size data set, using the same SQL-based business intelligence tools they use today. With a few clicks in the AWS Management Console, customers can launch a Redshift cluster, starting with a few hundred gigabytes and scaling to a petabyte or more, for under $1,000 per terabyte per year – one tenth the price of most data warehousing solutions available to customers today.

Self-managed, on-premise data warehouses require significant time and resource to administer, especially for large datasets. Loading, monitoring, tuning, taking backups, and recovering from faults are complex and time-consuming tasks. And, the financial cost associated with building, maintaining, and growing traditional data warehouses is flat-out expensive. Larger companies have resigned themselves to paying such a high cost for data warehousing, while smaller companies often find the hardware and software costs prohibitively expensive, leaving most of these organizations without a data warehousing capability.

Redshift aims to change this quagmire. It manages all of the work needed to set up, operate, and scale a data warehouse, from provisioning capacity to monitoring and backing up the cluster, to applying patches and upgrades. Scaling a cluster to improve performance or increase capacity on Redshift is simple and incurs no downtime, while the service continuously monitors the health of the cluster and automatically replaces any component needed. Redshift is also priced cost-effectively (a fraction of existing data warehouses) to enable larger companies to substantially reduce their costs and smaller companies to take advantage of the analytic insights that come from using a powerful data warehouse.

"Over the past two years, one of the most frequent requests we’ve heard from customers is for AWS to build a data warehouse service," said Raju Gulabani, VP of Database Services, AWS. "Enterprises are tired of paying such high prices for their data warehouses and smaller companies can’t afford to analyze the vast amount of data they collect (often throwing away 95% of their data). This frustrates customers as they know the cloud has made it easier and less expensive than ever to collect, store, and analyze data. Amazon Redshift not only significantly lowers the cost of a data warehouse, but also makes it easy to analyze large amounts of data very quickly. While actual performance will vary based on each customers’ specific query requirements, our internal tests have shown over 10 times performance improvement when compared to standard relational data warehouses. Having the ability to quickly analyze petabytes of data at a low cost changes the game for our customers."

Redshift uses a number of techniques, including columnar storage, compression, and high performance IO and network, to achieve higher performance than traditional databases for data warehousing and analytics workloads. By distributing and parallelizing queries across a cluster of inexpensive nodes, Redshift makes it easy to obtain high performance without requiring customers to hand-tune queries, maintain indices, or pre-compute results. It is certified by business intelligence tools, including Jaspersoft and MicroStrategy. Over twenty customers, including Flipboard, NASA/JPL, Netflix, and Schumacher Group, are in the Amazon Redshift private beta program.

"At Netflix, we deliver personalized recommendations for our millions of subscribers by analyzing large volumes of data, and are always looking for ways to improve our service," said Kurt Brown, director, Data Science and Engineering Platform at Netflix, Inc. "We’re very excited about the cost-disruptive and cloud-based model of Amazon Redshift. It’s sure to shake up the data warehousing industry."

"We are excited about being able to use this new service to take our cloud usage even farther and run a large scale data warehouse in the cloud for our engineering, science, and IT data," said Tom Soderstrom, CTO, office of the CIO, NASA/JPL. "We’re delighted to have a new, fast and low-cost option for analyzing massive amounts of data. This new service will also allow us to create new types of Big Data analytics that will lead to new discoveries."

"The Amazon Enterprise Data Warehouse manages petabytes of data for every group at Amazon. We are seeing significant performance improvements leveraging Amazon Redshift over our current multi-million dollar data warehouse," said Erik Selberg, manager of the Amazon.com Data Warehouse team. "Some multi-hour queries finish in under an hour, and some queries that took 5-10 minutes on our current data warehouse are now returning in seconds with Amazon Redshift. Early estimates show the cost of Amazon Redshift will be well under 1/10th the cost of our existing solution. Amazon Redshift is providing us with a cost-effective way to scale with our growing data analysis needs."

Redshift includes technology components licensed from ParAccel, Inc. and is available with two underlying node types, including either 2TB or 16TB of compressed customer data per node. One cluster can scale up to 100 nodes and on-demand pricing starts at just $0.85 per hour for a 2TB data warehouse, scaling linearly up to a petabyte and more. Reserved instance pricing lowers the effective price to $0.228 per hour or under $1,000/TB/year – less than one tenth the price of comparable technology available to customers today.

Prices in US East (N. Virginia)
DW Node Class (On-Demand) / Hourly / Price/TB/Year
High Storage Extra Large (XL) DW Node (Per Node) / $0.85/hour / $3,723
High Storage Eight Extra Large (8XL) DW Node (Per Node) / $6.8/hour / $3,723

Articles_bottom
ExaGrid
AIC
ATTOtarget="_blank"
OPEN-E