Free Newsletters
Technology & Business Daily

InfoWorld
Log-in | Register
STRATEGIC DEVELOPER  

The two-way data web

Atom and RSS are evolving into tools for creating loosely coupled databases on Internet scale

By Jon Udell  
November 23, 2005
 

Two years ago, I gave the keynote address on the opening day of XML 2003. The next day, Adam Bosworth delivered a weirdly complementary keynote, in which he began to lay out an idea he’s been developing ever since, first at BEA and now at Google. The idea, in a nutshell, is that the truly scalable databases of the future will be more like the Web than like Oracle, DB2, or SQL Server.

Free IT resource

Open Source Business Conference (OSBC) May 22-23, 2007

Sponsored by OSBC

Free IT resource

Virtualization Insights from Top Experts - Learn how virtualization gets real!

Sponsored by Dell

In last month’s ACM Queue, Bosworth elaborated on some of the lessons the Web has taught us about simplicity, human accessibility, “sloppily extensible” formats, the social dimension of software, and loose coupling. But he also introduced a key technical point about RSS and Atom, the feed formats powering the blog revolution. These formats represent sets of items. Typically, the items contain Weblog postings, but they can also contain XML fragments that represent anything under the sun. What’s more, items can link to other items or collections. Bosworth argues that this architecture lends itself to aggressive scale-out, decentralized caching, and grassroots schema evolution, all of which tend to elude conventional databases.

There’s no free lunch, of course. When you query this RSS/Atom data web, you should expect more structural precision than full-text search affords, but you shouldn’t plan on fast execution of complex nested queries.

We’ve yet to colonize the middle ground between these extremes, and I don’t think anyone really knows what the sweet spot will turn out to be. I’ve gotten plenty of mileage out of XPath and XQuery, and my dream is that these XML-oriented query disciplines can be federated at large scale. But first things first: We need to create the data web. And recently, two leading figures have dropped major hints about how that’s going to happen.

The first was Bill Gates, who, in a September interview, told me, “the RSS data web is a natural development coming out of the acceptance of XML ... and we’ve got some ideas internally ... about making RSS work two-way.”

Historically, RSS has been a read-mostly affair. There are APIs through which blogging tools can inject content into publishing systems, which then reflect it back out as XML feeds. But while the blogosphere has at last realized the vision of a two-way Web, RSS as a data transport remains largely asymmetric. Microsoft evidently wants to change that.

The second and much more explicit hint appeared a month later in Adam Bosworth’s ACM Queue article. Atom is both a feed format and a publishing protocol. The latter, Bosworth noted, is “a simple HTTP-based way to INSERT, DELETE, and REPLACE” entries within a feed.

Microsoft developer/blogger Dare Obasanjo responded with a question. “Perhaps,” he asked, “this Atom store, accessible via Atom feeds and the Atom API, is [the rumored] Google Base?” I’d say that’s a good guess.

We’ll surely see more squabbling within the already fragmented world of lightweight XML syndication. But while the RSS feed format won the first round, and I suspect the Atom API will win the next one, don’t take your eye off the ball. This game isn’t about formats and APIs; it’s about the emergence of a data web made of loosely coupled sets of XML fragments that people and processes can easily read and write. Bring it on!





 


 
Jon Udell is lead analyst and blogger in chief at the InfoWorld Test Center.

  More of Jon Udell's column
  Jon Udell's Weblog

Newsletter Check out all of our free newsletters!
Enter e-mail address:




 

TOP NEWS:


»  Coverity buying software build management vendor Codefast
The acquisition, Coverity's first, is intended to help developers with build process

»  Google steals search share from rivals
Google's share of U.S. search queries went up to 61.6 percent in April, from 59.8 percent in March, as it took market share from rivals like Yahoo, Microsoft, and Ask.com

»  Update: Microsoft's embrace of ODF cautiously welcomed
Of concern is whether Microsoft's implementation of ODF within Office will handle documents with the same or better performance as competing suites

»  Study: 25 percent of Web projects over budget
Frequently changing requirements and too many people having a say in the matter are two reasons Web projects end up over budget and late, study says

»  Facebook provides redesign details
Latest plans in Facebook's redesign call for profile pages to evolve from a single repository of content and applications into a tabbed interface

»  Mozilla targets Firefox 3.1 release for later this year
Company exec reveals in blog that features held back from Firefox 3 are being hammered out for the next edition, which could come as soon as late 2008




Are you ready for event-driven business?
"Faster than a speeding bullet" doesn't just refer to superheroes anymore, it's the velocity your business needs to compete. In this webcast you will learn strategies you can implement today that will keep your systems ahead of the increased business velocity. Sponsor: Progress Sonic

»  Click here to view this Webcast
  The Data Protection You've Been Looking For
Enterprise data is of supreme importance. If you can't find it quickly, it's worthless. If you lose it, it's a crisis. This IT Strategy Guide explores how to keep your data safe.

»  Click here to download now

- Special Advertising Partners -
WHITE PAPERS
 

» Technology White Papers Library

Technology White Papers by Topic

Technology White Papers E-mail Alert

Find out when the latest white paper is available:
 
 
INFOWORLD MARKETPLACE
 
» BUY A LINK NOW
 

FIND PRODUCTS AND COMPANIES
» COMPLETE PRODUCT GUIDE



TECHNOLOGY INDEX
• Applications
• Application Development
• Security
• Networking
• Wireless
• Platforms
• Hardware
• Data Management
• Storage
• Web Services
• Business
• Telecom
• Professional Services
• Standards

TECH WATCH 


What's the 411 on GOOG-411?
Just as Google has become synonymous with "performing a Web search," 411 is understood to mean "information" -- as in "what's the 411?" I was thus surprised to discover, from a billboard, no less, that the king of search is taking on the ...

Apple HTML source reveals 'iPhone Extreme'
"This one's a stretch..." reports AppleInsider. Um, yeah. Reporting on HTML code sightings of product names could be called a stretch, but iPhone Extreme has a ring to it. Now, that sounds like the product Apple should have released first, rather ...

COLUMNISTS

Unified under law
Ephraim Schwartz's Column and Blog (InfoWorld) - In the litigious world we live in, deploying a unified communications platform in your enterprise could...
» MORE COLUMNISTS

MORE INFOWORLD BLOGS


Open Sources 
Product Management
When I joined MySQL four years ago, there was quite a lot of debate about product management. We didn't actually have ...

Zero Day 
Botnet herders tending smaller flocks
New research backs up the theory that botnet operators are keeping their networks smaller in a continued effort to keep ...



• Advice Line
• Database Underground
• The Deep End
• Enterprise Mac
• Geeks in Paradise
• Grid Meter
• The Gripe Line
• InfoWorld Daily
• Inside IT
• IT Troubleshooter
• ITXtreme
• Open Sources
• ProdBlog
• Real World SOA
• Reality Check
• Security Adviser
• SMB IT
• The Storage Network
• Tech Watch
• Virtualization Report
• Zero Day

ADVERTISEMENT


RESOURCE CENTERadvertisement 

GOVERNMENT IT & POLICY
'If you don't go after the network, you're never going to stop these guys. Never.'
From the State Department, All the News for Inquiring Minds
TechPresident, the Internet Citizenry's New Consensus Taker



Sponsored Technology Links

 
 
 HOME  NEWS  BLOGS  PODCASTS  VIDEOS  TECHNOLOGIES  TEST CENTER  EVENTS  CAREERS   About | Advertise | Awards | RSS | Contact Us 

Copyright © 2008, Reprints, Permissions, Licensing, IDG Network, Privacy Policy, Terms of Service.
All Rights reserved. InfoWorld is a leading publisher of technology information and product reviews on topics including viruses,
phishing, worms, firewalls, security, servers, storage, networking, wireless, databases, and web services.

CIO :: ComputerWorld :: CSO :: Demo :: GamePro :: Games.net :: IDG Connect :: IDG World Expo
Industry Standard :: IT World :: JavaWorld :: LinuxWorld :: MacUser :: Macworld :: Network World :: PC World :: Playlist