February 17, 2006

Higher Availability Future: Autonomic Computing or Recovery Oriented Computing?

It is fascinating to me that so many smart people can disagree on the best future approach to higher availability infrastructure. The

autonomic computing crowd led by IBM is touting self-healing and self-regulating computing systems. On the other hand the recovery oriented computing (ROC) folks led by researchers at Berkeley and Stanford declare failures are inevitable. ROC proposes the key to higher availability is helping humans to recover infrastructure from failures faster.

I have written here previously about ROC, but its time to start a dialog on comparing and contrasting these two radically differing views on the future of better infrastructure availability.

You notice I am talking about infrastructure availability not individual system availability. As an industry we have focused for decades on building more reliable individual components and systems. But now the reliability problem has moved to a different level. Take all these highly reliable components and systems and put them together with software developed by multiple vendors or adopted from different open source projects and the reality of complex systems settles in.

Can we build autonomic computing infrastructure that is self-healing and self-regulating beyond simple problems and single systems? Or will humans always be an important part of repairing and recovering IT infrastructure?

Our friends from Berkeley and Stanford offer an interesting perspective dubbed the Ironies of Automation. Their argument goes something like this.

Automation does not remove human influence, but instead reduces IT personnel understanding and can actually make their job harder. Automation increases complexity, reduces visibility and provides no day-to-day interaction and learning. ROC argues for better tools to help, not replace people.

So what do you think? Autonomic Computing or Recovery Oriented Computing? Which will lead us to higher availability infrastructure? Send me your vote to thebaum@splunk.com,

Close

On Twitter now

Networking

Powered by Twitter

On Twitter now

White Paper

D2D Virtual Tape Library Replication Primer

This whitepaper explains the terminology and concepts behind Data Replication technologies and establishes some sizing rules through worked examples. Learn the new paradigm in disaster tolerance—protect data anywhere.

Download now »

White Paper

An Alternative to Virtualization for Datacenter Cost Savings

Server virtualization is a popular option for dealing with mounting datacenter costs. Another equally promising approach is the use of an Application Delivery Controller. Citrix NetScaler provides a low-cost way for organizations to reduce their server count and accrue cost savings from a reduction in space, cooling, power and personnel.

Download now »

White Paper

Why Your Firewall, VPN, and IEEE 802.11i Aren't Enough to Protect Your Network

The emergence of WLANs has created a new breed of security threats to enterprise networks.

Included in HP ProCurve WLAN solutions is security technology that alleviates threats from WLANs through:
* Monitoring wireless activity inside and out of the enterprise
* Classifying WLAN transmissions into harmful and harmless
* Preventing transmissions that pose a security threat to the enterprise network
* Locating participating devices for physical remediation

Download now »

White Paper

Bringing the Edge to the Data Center

Effectively address data protection challenges, implementing solutions that help store and protect business–critical data while cutting costs and improving efficiency and reliability.

Download now »

Sign up to receive Networking Resource Alerts

Subscribe to the Today's Headlines: First Look Newsletter

Find out what will be news for the day, with our first-thing-in-the-morning briefing.

©1994-2009 Infoworld, Inc.