Free Newsletters
Technology & Business Daily

InfoWorld
Log-in | Register
STORAGE INSIDER  

Bringing CAS to the masses

The technology's inventor reveals a software-only content addressable storage solution that's sure to spark some buzz

By Mario Apicella
June 01, 2006
 

Back in 2001 when EMC bought Belgian startup FilePool, only a few people knew what CAS (content addressable storage) was. Even fewer people knew that FilePool's Paul Carpentier was the mind behind CAS.

Free IT resource

Virtualization Insights from Top Experts - Learn how virtualization gets real!

Sponsored by Dell

Free IT resource

TechNet: More ways to know it, share it, and keep it running.

Sponsored by Microsoft

Nevertheless, CAS was (and still is) a major revolution in storage because it identifies, I believe for the first time ever, a separated data custodian role. In essence, it's a system that provides reliable, centralized archival services to a variety of clients.

Here's a quick recap of how CAS works: A client application delivers a file to a CAS system and receives in return a token that uniquely identifies the file but is also a verification key that certifies the integrity of the data content.

Because of its characteristics, a CAS system is a simple and effective repository for long-term archival. It's especially good for fixed-content files that either don't change at all or seldom change over their lifetime, such as the digital image of a contract.

Back to the news at hand: About a year after acquiring FilePool, EMC announced Centera, a new product line based on CAS. Since then, EMC has made that technology one of the cornerstones of its ILM (information lifecycle management) and compliance strategies, including eDiscovery.

If you've ever thought that -- however interesting it may be -- Centera is a bit too rich for your blood and your budget, you're not alone. In fact, CAS inventor Carpentier has founded a new company, Caringo, to prove that point.

Paul Carpentier is CTO of the startup Caringo, and President Jonathan Ring and CEO Mark Goros round out the management team. In case you missed it, the name of the company comes from the initial letters of the three managers' last names.

At the end of May, Caringo announced its first product, CAStor, a software implementation of CAS. "We deliver on a USB key a software product that runs on x86 commodity hardware," Goros says. "Sixty seconds after you boot from that USB key, you have a running CAStor node."

Adding a second node is as easy as booting a second machine: CAStor expands automatically as you add more servers and will redistribute its data content to take advantage of the new nodes.

CAStor can scale to hundreds or thousands of nodes linked by Gigabit Ethernet with little or no human manipulation. "It manages itself, it configures itself, and you can add nodes while it's running," Goros remarks.

In addition to the CAS management code, CAStor includes a customized version of Linux. "We have removed anything that was not needed, including the possibility of remote access," Carpentier explains. "Our application turns a general-purpose server into a single-minded appliance that works together with others to preserve the integrity of fixed content files."

When I ask to see a screenshot of its management GUI, Carpentier has a simple answer. "That would be a problem," he says. "We don't have a management GUI."

"There is no centralized access, and no gateway. We use a subset of HTTP 1.1 [to communicate with clients]," Carpentier continues, explaining that a client can contact any node and that the request to add a new file or to retrieve existing files will be broadcast to every node in the system. The node that wins the bid gets the job, Carpentier explains. 

CAStor is exciting because it could provide the benefits of CAS at a price that a variety of new users -- from entry level to large scale corporations -- may find attractive. In fact, CAStor, or future products based on that technology, could become as popular for data archival as NAS appliances are for storing live files.

Caringo is licensing CAStor at $500 per disk drive, but customers will have to develop their own applications to interact with the system. "From a Web browser, you can retrieve data but that's just for demonstration purposes," Carpentier adds.

I expect Caringo to attract the attention of numerous potential partners in the archival and data protection business, for example, or maybe vendors of server hardware with a vested interest in storage. As for the company's take on those scenarios, "We look forward to doing partnerships with ISVs and software integrators," Goros says. 

Judging from the envy surrounding EMC's success with Centera, I would be surprised if those partnerships take very long to form.





 


 
Mario Apicella is a senior analyst at the InfoWorld Test Center.

  More of Mario Apicella's column
  Mario Apicella's Weblog

Newsletter Check out all of our free newsletters!
Enter e-mail address:




 

TOP NEWS:


»  Four quick tips for choosing an IM security product
71 percent of businesses will invest in real-time messaging this year. If you're one of them, be sure to protect your enterprise

»  Forrester analysts ID hot IT jobs
Research group finds 16 IT roles with a promising future

»  Nvidia claims 10 hours of HD video on Tegra chip
The Tegra 600 and 650 can be used with hard disk drives and are designed partly for mobile Internet devices

»  Database vendors add Google's MapReduce
Greenplum and Aster Data Systems will support Google's programming technique, developed for parallel processing of large data sets across commodity hardware

»  Network management: Tips for managing costs
New technologies, changing requirements, and ongoing equipment maintenance and upgrades cost money, but there are ways to manage expenses

»  EMC targets SMBs, branch offices with new low-end storage
Celerra NX4 highlights include thin provisioning, snapshot technology for data recovery and backups, and Web-based console for management of storage volumes




Virtualization: A Step by Step Approach to Success
Your virtual machines can be up and running in a matter of minutes. HP and Citrix have integrated XenServer with HP ProLiant servers and management tools, powered by hardware-assisted Intel Virtualization Technology to enable high- performance, cost-savings solutions for server consolidation and disaster recovery. Sponsor: HP

»  Click here to view this Webcast
  Virtualization Solutions Guide
This comprehensive IT Strategy Guide covers Virtualization and puts you at the forefront of the discussion. You'll learn all you need to know from the cost of virtualization, how to implement it for your business, how to back it up safely and which products are best. Sponsored by Riverbed

»  Click here to download now

- Special Advertising Partners -
WHITE PAPERS
 

» Technology White Papers Library

Technology White Papers by Topic

Technology White Papers E-mail Alert

Find out when the latest white paper is available:
 
 
INFOWORLD MARKETPLACE
 
» BUY A LINK NOW
 

FIND PRODUCTS AND COMPANIES
» COMPLETE PRODUCT GUIDE



TECHNOLOGY INDEX
• Applications
• Application Development
• Security
• Networking
• Wireless
• Platforms
• Hardware
• Data Management
• Storage
• Web Services
• Business
• Telecom
• Professional Services
• Standards

TECH WATCH 


What's the 411 on GOOG-411?
Just as Google has become synonymous with "performing a Web search," 411 is understood to mean "information" -- as in "what's the 411?" I was thus surprised to discover, from a billboard, no less, that the king of search is taking on the ...

Apple HTML source reveals 'iPhone Extreme'
"This one's a stretch..." reports AppleInsider. Um, yeah. Reporting on HTML code sightings of product names could be called a stretch, but iPhone Extreme has a ring to it. Now, that sounds like the product Apple should have released first, rather ...

COLUMNISTS

Unified under law
Ephraim Schwartz's Column and Blog (InfoWorld) - In the litigious world we live in, deploying a unified communications platform in your enterprise could...
» MORE COLUMNISTS

MORE INFOWORLD BLOGS


Open Sources 
Product Management
When I joined MySQL four years ago, there was quite a lot of debate about product management. We didn't actually have ...

Zero Day 
Botnet herders tending smaller flocks
New research backs up the theory that botnet operators are keeping their networks smaller in a continued effort to keep ...



• Advice Line
• Database Underground
• The Deep End
• Enterprise Mac
• Geeks in Paradise
• Grid Meter
• The Gripe Line
• InfoWorld Daily
• Inside IT
• IT Troubleshooter
• ITXtreme
• Open Sources
• ProdBlog
• Real World SOA
• Reality Check
• Security Adviser
• SMB IT
• The Storage Network
• Tech Watch
• Virtualization Report
• Zero Day

ADVERTISEMENT


RESOURCE CENTERadvertisement 

GOVERNMENT IT & POLICY
'If you don't go after the network, you're never going to stop these guys. Never.'
From the State Department, All the News for Inquiring Minds
TechPresident, the Internet Citizenry's New Consensus Taker



Sponsored Technology Links

 
 
 HOME  NEWS  BLOGS  PODCASTS  VIDEOS  TECHNOLOGIES  TEST CENTER  EVENTS  CAREERS   About | Advertise | Awards | RSS | Contact Us 

Copyright © 2008, Reprints, Permissions, Licensing, IDG Network, Privacy Policy, Terms of Service.
All Rights reserved. InfoWorld is a leading publisher of technology information and product reviews on topics including viruses,
phishing, worms, firewalls, security, servers, storage, networking, wireless, databases, and web services.

CIO :: ComputerWorld :: CSO :: Demo :: GamePro :: Games.net :: IDG Connect :: IDG World Expo
Industry Standard :: IT World :: JavaWorld :: LinuxWorld :: MacUser :: Macworld :: Network World :: PC World :: Playlist