April 28, 2009

Data deduplication for SMEs: what to look out for

Data redundancy is a primary contributor to the explosive growth in data. Initially deduplication focused on eliminating data redundancy in specific cases like full backups, e-mail attachments, and VMware images. Over time, however, customers have noticed the pervasiveness of duplicated data.

Test and development data multiplies across an organization: replication, backup, and archiving create multiple data copies scattered across your enterprise, and sometimes users simply copy data to multiple locations for their own convenience.

[ Get sage advice on IT careers and management from Bob Lewis in InfoWorld's Advice Line blog and newsletter. ]

Studies estimate that multiple copies of data now require organizations to buy, use, and administer two- to 50-times more storage than they would actually need with deduplication. Given that impact on the bottom line, organizations are recognizing that, far from being a niche technology, deduplication needs to become an integrated and mandatory element in their overall IT strategy.

Candidates for deduplication

The best candidates for deduplication solutions are mid-size or enterprise customers experiencing issues with:

  • Exponential growth of data, resulting in out-of-control storage costs.
  • Shrinking or inadequate backup windows.
  • Longer recovery times, especially for older data not on the primary backup media.
  • Cost, risk and complexity of sending tapes to disaster recovery (DR) sites.
  • Slow throughput on both backup and archiving systems.
  • eDiscovery, compliance and SLA requirements.
  • Bottlenecks in expensive LANs and WANs.

Features to look for in a deduplication solution

When evaluating deduplication solutions, IT decision-makers should look for the following essential features:

  • Ability to scale without expensive hardware upgrades.
  • More recovery points and with shorter recovery times.
  • Point-and-click deduplication management.
  • Built-in reporting of deduplication across vendors, data types, sources and platforms.
  • Tight integration with all necessary applications to minimize end-user downtime.
  • Single solution simplicity for ease of deployment and administration.
  • Ability to rapidly and securely recover business-critical data across all locations, applications, storage media and points-in-time.
  • D2D2T-optimized for backup performance and reliable data recovery.
  • Fast, comprehensive search to aid in recovery.
  • Data integrity and security features.
  • Built-in DR capabilities.
  • Data classification.
  • Cost-effective and timely eDiscovery.
  • Use of a common technology platform.
  • Single point of management.

Challenges in Deploying a deduplication solution

Like disk-to-disk backup or server virtualization, deduplication should not be evaluated as an isolated product or feature. Customers must consider the broader implications of deduplication within the context of their entire data management and storage strategy. Common challenges in deploying a deduplication solution are related to performance, increased complexity of management, and proliferation of deduplicated data silos.

Performance

Finding and eliminating redundant data can be extremely expensive for an appliance-based deduplication solution. Without contextual knowledge of the data that it deduplicates, it faces significant challenges scaling to the size of most enterprises.

additional resources
White Paper - How to Improve Delivery of Advanced Web Applications

White Paper

Virtual Workforce: The Key to Expanding The Business While Cutting Costs

Get the independent advice and expertise you need to support a virtual workforce.

Go inside:
The three-step approach to making a virtual workforce a reality.
The four flavors of client virtualization technologies.
The three key initiatives that solve IT challenges.
Download now »
White Paper: Successfully Secure Your Wireless LAN With Wi-Fi firewalls.

White Paper

Addressing Linux Threats Leveraging Fewer Resources

The increase in Linux popularity has increased the frequency and sophistication of malware attacks. Read this 2 page white paper now to learn how you can protect your Linux environment with real-time protection that is certified by all major Linux vendors.

Download now »
White Paper - The 2009 Handbook of Application Delivery

White Paper

The 2009 Handbook of Application Delivery

Ensuring acceptable application delivery will become even more difficult over the next few years. As a result, IT organizations need to ensure that the approach that they take to resolving the current application delivery challenges can scale to support the emerging challenges. This handbook elaborates on the key tasks associated with planning, optimization, management and control and provides decision criteria to help IT organizations choose appropriate solutions.

Download now »
White Paper - Is Your Backup System Outdated?

White Paper

Mid-range Storage Considerations

A common misconception is that mid-range storage requirements are dramatically different than that of a larger enterprise. Mid-range storage users may require less capacity, but they have similar functionality and management requirements. This ESG paper examines mid-range storage needs and reviews a new solution that adjusts size while retaining value, performance and functionality.

Download now »

Sign up to receive InfoWorld Resource Alerts

Subscribe to the Today's Headlines: First Look Newsletter

Find out what will be news for the day, with our first-thing-in-the-morning briefing.

©1994-2010 Infoworld, Inc.