April 28, 2009

Data deduplication for SMEs: what to look out for

Data redundancy is a primary contributor to the explosive growth in data. Initially deduplication focused on eliminating data redundancy in specific cases like full backups, e-mail attachments, and VMware images. Over time, however, customers have noticed the pervasiveness of duplicated data.

Test and development data multiplies across an organization: replication, backup, and archiving create multiple data copies scattered across your enterprise, and sometimes users simply copy data to multiple locations for their own convenience.

[ Get sage advice on IT careers and management from Bob Lewis in InfoWorld's Advice Line blog and newsletter. ]

Studies estimate that multiple copies of data now require organizations to buy, use, and administer two- to 50-times more storage than they would actually need with deduplication. Given that impact on the bottom line, organizations are recognizing that, far from being a niche technology, deduplication needs to become an integrated and mandatory element in their overall IT strategy.

Candidates for deduplication

The best candidates for deduplication solutions are mid-size or enterprise customers experiencing issues with:

  • Exponential growth of data, resulting in out-of-control storage costs.
  • Shrinking or inadequate backup windows.
  • Longer recovery times, especially for older data not on the primary backup media.
  • Cost, risk and complexity of sending tapes to disaster recovery (DR) sites.
  • Slow throughput on both backup and archiving systems.
  • eDiscovery, compliance and SLA requirements.
  • Bottlenecks in expensive LANs and WANs.

Features to look for in a deduplication solution

When evaluating deduplication solutions, IT decision-makers should look for the following essential features:

  • Ability to scale without expensive hardware upgrades.
  • More recovery points and with shorter recovery times.
  • Point-and-click deduplication management.
  • Built-in reporting of deduplication across vendors, data types, sources and platforms.
  • Tight integration with all necessary applications to minimize end-user downtime.
  • Single solution simplicity for ease of deployment and administration.
  • Ability to rapidly and securely recover business-critical data across all locations, applications, storage media and points-in-time.
  • D2D2T-optimized for backup performance and reliable data recovery.
  • Fast, comprehensive search to aid in recovery.
  • Data integrity and security features.
  • Built-in DR capabilities.
  • Data classification.
  • Cost-effective and timely eDiscovery.
  • Use of a common technology platform.
  • Single point of management.

Challenges in Deploying a deduplication solution

Like disk-to-disk backup or server virtualization, deduplication should not be evaluated as an isolated product or feature. Customers must consider the broader implications of deduplication within the context of their entire data management and storage strategy. Common challenges in deploying a deduplication solution are related to performance, increased complexity of management, and proliferation of deduplicated data silos.

Performance

Finding and eliminating redundant data can be extremely expensive for an appliance-based deduplication solution. Without contextual knowledge of the data that it deduplicates, it faces significant challenges scaling to the size of most enterprises.

White Paper

D2D Virtual Tape Library Replication Primer

This whitepaper explains the terminology and concepts behind Data Replication technologies and establishes some sizing rules through worked examples. Learn the new paradigm in disaster tolerance—protect data anywhere.

Download now »

White Paper

An Alternative to Virtualization for Datacenter Cost Savings

Server virtualization is a popular option for dealing with mounting datacenter costs. Another equally promising approach is the use of an Application Delivery Controller. Citrix NetScaler provides a low-cost way for organizations to reduce their server count and accrue cost savings from a reduction in space, cooling, power and personnel.

Download now »

White Paper

Why Your Firewall, VPN, and IEEE 802.11i Aren't Enough to Protect Your Network

The emergence of WLANs has created a new breed of security threats to enterprise networks.

Included in HP ProCurve WLAN solutions is security technology that alleviates threats from WLANs through:
* Monitoring wireless activity inside and out of the enterprise
* Classifying WLAN transmissions into harmful and harmless
* Preventing transmissions that pose a security threat to the enterprise network
* Locating participating devices for physical remediation

Download now »

White Paper

Bringing the Edge to the Data Center

Effectively address data protection challenges, implementing solutions that help store and protect business–critical data while cutting costs and improving efficiency and reliability.

Download now »

Sign up to receive InfoWorld Resource Alerts

Subscribe to the Today's Headlines: First Look Newsletter

Find out what will be news for the day, with our first-thing-in-the-morning briefing.

©1994-2009 Infoworld, Inc.