Saturday, May 22, 2010

Energy and Space Efficient Storage: Multi-tier Strategies for Protecting and Retaining Data


Energy and Space Efficient Storage: Multi-tier Strategies for Protecting and Retaining Data


Data center power, cooling, and space requirements are becoming key IT challenges for organizations of all sizes. At the same time, IT managers are faced with ever-increasing demands for data protection, including better restore performance and longer retention times. Meeting these challenges requires a multi-tier storage architecture along with new technologies like data deduplication.


Power, Cooling, and Space Concerns in the Data center

Power, cooling, and space requirements are key issues faced by IT managers. Demand-side contributors to this situation include decreased hardware acquisition costs, sharp increases in server density, and an exponential rise in the volume of data being stored and managed.

The Environmental Protection Agency has predicted that data center power usage will double over the next five years. The EPA released a paper by Dr. Jonathan G. Koomey, professor at Stanford University, stating that 1.2% of all power purchased in the United States was consumed by data centers in order to operate and cool equipment1. He also warned that this phenomenon is on the rise and could impact data centers worldwide if we don’t focus our attention toward green computing.

On the supply side, electric utilities already struggle to keep up with demand. Brown-out restrictions can be commonplace during peak consumption periods, and many IT directors are finding that they cannot increase the amount of power they source from their local power grid. IT analysts believe that by the end of the decade the world’s data centers will have run out of power. Yet, building new, more efficient facilities can be cost prohibitive due to construction and cooling costs.

While data centers remain an essential part of most businesses, there will come a time very soon when growth may need to be limited because of power costs and the growing awareness at governmental, corporate, and personal levels in regards to the importance of reducing our environmental impact.


Impact of Trend on Disk Backup

The growing restrictions on power, cooling, and use of data center footprint can be seen as coming into conflict with another major IT trend namely a steady increase in the percentage of disk in the data protection mix. Using disk as part of a backup strategy is growing rapidly as IT departments take advantage of the faster backup and restore performance that disk delivers. In turn, disk provides higher service levels to their customers and reduces the use of removable media. But disk storage is fast becoming one of the primary power and cooling culprits in the data center, and when disk is used for backup, the requirements can increase dramatically.

The solution is not to stop data protection progress, but rather to take advantage of various storage technologies by combining different kinds of media for an optimal balance of performance and cost, including the ongoing energy consumption factor. One of the new technologies that is having a major impact in this area is data deduplication.


Data Deduplication Is A Key Enabler

Data deduplication technology is being vaunted by industry analysts as the next big IT advance, with the Taneja Group predicting 108% compound growth for this market sector through 2010. There are several reasons for this prediction. Data deduplication reduces the amount of disk required to protect a given amount of primary data by 90% or more by detecting and eliminating redundant blocks within files and, in some products, between different files and file types. It allows users to exploit the performance characteristics of disk for backup without incurring the costs of conventional disk systems-costs which include both capital expenses and the operating expenses that include power and cooling.

Disaster-recovery protection also benefits from some vendors’ implementations of data deduplication. Using the same technology that identifies duplicate segments inside datasets, data deduplication systems also reduce the bandwidth needed to transmit backup sets over a network. With data deduplication, once systems are synchronized, whole backup sets can be replicated while only changed blocks are actually moved. If a new backup is only 5% different from a previous one at a block level, then we can reduce the bandwidth needed for transmission by up to 95%. Replication allows multiple copies of backup data to be maintained on disk in different sites using WAN links. Its use can simplify backup and reduce costs by decreasing an IT department’s need to buy, manage, and move tapes. At the same time, it provides improved disaster recovery protection.


Meeting Requirements with Multiple Tiers of Storage

The optimal approach to data protection and retention is to match the various business requirements with the right storage technology. IT managers must consider the business requirements for data access and retention, and balance those against the requirements for power, space, and cooling. In most cases, this will result in a multi-tier storage architecture that includes a mix of disk, data deduplication, and tape technologies.

Datasets with high access requirements may need to use conventional disk systems without data deduplication. However, they are only practical for a subset of data where the recovery time objective overrides other considerations such as long-term retention and total cost. Datasets with normal access requirements can use disk with data deduplication technology both as first site for backup data and for medium term retention. Data deduplication technology reduces space, power, and cooling requirements enough to make it practical to use disk as a retention medium for weeks or months without breaking the operational budget. However, as data needs to be retained for multiple quarters and years and the demand for fast recovery diminishes, the most effective retention medium for most users is tape. Tape cartridges in a tape library consume power at lower rates than any disk system, and tape cartridges stored in a vault consume the least of all, as well as providing the lowest aggregated storage cost per GB. The following chart compares these various types of storage in terms of power, space, and cooling.


Summary

IT managers are faced with growing concerns around data center power, cooling, and space requirements. Costs are on the rise and supply is becoming increasingly scarce. At the same time, IT managers must deal with ever-growing demands for protecting and retaining valuable data. Meeting these various demands requires a multi-tier storage architecture with the appropriate mix of disk, tape, and enabling technologies like data deduplication.

Labels:

1 Comments:

Blogger Unknown said...

This is a really good discussion on different unified storage options. For those who don’t know EMC’s solutions, I think this paper will help you see that they’ve got some really great options. And check out the new guarantee: with EMC’s unified storage systems you’ll use 20% less raw storage capacity than with other products. That means efficiency and cost savings for your company. http://bit.ly/ao57rm -- Brian, EMC Social Outreach Team

Tuesday, July 20, 2010 9:00:00 PM  

Post a Comment

<< Home