Free Newsletters
Technology & Business Daily

InfoWorld
Log-in | Register
STRATEGIC DEVELOPER  

Building connection engines with metadata

Aggregated search results based on metadata create more compelling views of content

By Jon Udell  
June 14, 2006
 

In “Scan This Book!” -- a May 14 manifesto published in The New York Times Magazine -- Wired’s Kevin Kelly explores the copyright battle provoked by Google’s ambition to digitize millions of library books. It’s ultimately a clash of business models, he concludes. In a networked world, where copying is implicit in every transfer of information, copies lose their direct economic value but gain indirect value as “discovery tools” that attract attention, sponsorship, and subscription.

Free IT resource

Hear how top CIOs turn change into a competitive advantage.

Sponsored by HP

Free IT resource

Attend the SOA Executive Forum: Breaking SOA Bottlenecks SOAExecForum.com/may2007

Sponsored by InfoWorld

Search is the game-changer. “Things can be found by search,” Kelly adds, “only if they radiate potential connections.” Yes, but that leads to a more nuanced view of search than the one Google and its competitors have popularized. For the past few months I’ve been revamping search on InfoWorld.com. Full-text search works more effectively now and is augmented by streams of metadata and by RSS syndication. It’s all about making the site a better connection engine.

On my blog, I’ve chronicled the development of a pair of applications called InfoWorld Power Search and InfoWorld Metadata Explorer. Both exploit three kinds of metadata to turbocharge the discovery of InfoWorld articles: first, structured document titles that include key attributes, such as date, type, and author; second, tags assigned by way of del.icio.us; third, subtitles and lead paragraphs.

In InfoWorld Power Search, aka iws, I use these metadata streams to add value to the output of our Ultraseek search engine. The raw Ultraseek results are ordered by relevance, but that’s begging the question: Relevant to whom? For what purpose? Using iws, you can order results by date, type, and author, and you can evaluate the results in the context of their tags, subtitles, and lead paragraphs.

In InfoWorld Metadata Explorer, aka iwx, the same metadata streams add value to del.icio.us. Compare the results for the tag “vista,” for example, in del.icio.us and in iwx. It’s the same set of URLs, but in iwx they’re decorated with extra metadata. Those metadata elements aren’t just passively displayed; they’re active filters, too. Clicking a tag filters the view to include just items with that tag. Clicking an author’s name adds another filter for items by that author.

These applications blend search and navigation in interesting and powerful ways. Because every view is fully specified by a URL, they radiate a lot of connections for people to use. Under the covers, they also use RSS feeds to radiate connections that people and machines alike can use.

Like many sites, InfoWorld.com offers a set of topical RSS feeds. Now that every iwx view can be seen through an RSS lens, that set is vastly enlarged. Suddenly we have feeds for Vista, Cisco, and many other topics. Cool! But not in the obvious way. Before iwx offered an RSS feed of InfoWorld’s Vista articles, del.icio.us did. Frankly, neither version is very interesting in and of itself. If you want to syndicate articles about Vista, ours are only some of the ones you’ll want to see in that feed.

Aggregated views are the ticket. And the key point is that the iwx version of the feed encourages smarter aggregation. Metadata is what makes the interactive experience in iwx more compelling than its del.icio.us counterpart. By syndicating that metadata, I’m inviting others to more richly contextualize their aggregations of our stuff.

If other publishers will return the favor, I’ll gladly make better use of theirs. Any takers?





 


 
Jon Udell is lead analyst and blogger in chief at the InfoWorld Test Center.

  More of Jon Udell's column
  Jon Udell's Weblog

Newsletter Check out all of our free newsletters!
Enter e-mail address:




 

TOP NEWS:


»  Parts of San Francisco network still locked out
Administrators are still locked out of the city's VoIP system and LANs within the Sheriff's Department and the Recreation & Park Department

»  Intel says Moblin update coming soon
Open-source effort set for mobile Linux should have an alpha-level release in a few weeks

»  Are virtual firewalls a solution for VM security?
Virtual firewalls can be a useful security tool, but their efficacy depends heavily on how you have set up your networks

»  Ubuntu to unveil new version of Launchpad next week
Ubuntu's beta community still has a long way to go to achieve the popularity of competitors such as SourceForge.net

»  Oracle unveils access management suite
Oracle's suite includes a new server that provides controls to fine-tune user privileges

»  5 ways the iPhone 3G still lags in enterprise
Despite Apple's improvements, its iPhone 2.0 software remain less competent and less tested than its BlackBerry and Windows Mobile counterparts




Take control of your content- leverage Microsoft SharePoint
Microsoft Office SharePoint Server (MOSS) offers core content management designed for a broad user population. Attend this webcast to learn how to implement a strategy that allows for the coexistence of both MOSS and advanced ECM solution within the same IT environment. Sponsor: IBM

»  Click here to view this Webcast
  Zombie PCs Are Attacking Your LAN
A recent study showed that malware-infected zombie PCs are now a bigger threat to ISPs and Web infrastructure than DoS attacks. As this brand new IT Strategy Guide explains, an increased use of peer-to-peer techniques by the attackers has made it harder to fight back. Download now, compliments of Verio:

»  Click here to download now

- Special Advertising Partners -
WHITE PAPERS
 

» Technology White Papers Library

Technology White Papers by Topic

Technology White Papers E-mail Alert

Find out when the latest white paper is available:
 
 
INFOWORLD MARKETPLACE
 
» BUY A LINK NOW
 

FIND PRODUCTS AND COMPANIES
» COMPLETE PRODUCT GUIDE



TECHNOLOGY INDEX
• Applications
• Application Development
• Security
• Networking
• Wireless
• Platforms
• Hardware
• Data Management
• Storage
• Web Services
• Business
• Telecom
• Professional Services
• Standards

TECH WATCH 


What's the 411 on GOOG-411?
Just as Google has become synonymous with "performing a Web search," 411 is understood to mean "information" -- as in "what's the 411?" I was thus surprised to discover, from a billboard, no less, that the king of search is taking on the ...

Apple HTML source reveals 'iPhone Extreme'
"This one's a stretch..." reports AppleInsider. Um, yeah. Reporting on HTML code sightings of product names could be called a stretch, but iPhone Extreme has a ring to it. Now, that sounds like the product Apple should have released first, rather ...

COLUMNISTS

Unified under law
Ephraim Schwartz's Column and Blog (InfoWorld) - In the litigious world we live in, deploying a unified communications platform in your enterprise could...
» MORE COLUMNISTS

MORE INFOWORLD BLOGS


Open Sources 
Product Management
When I joined MySQL four years ago, there was quite a lot of debate about product management. We didn't actually have ...

Zero Day 
Botnet herders tending smaller flocks
New research backs up the theory that botnet operators are keeping their networks smaller in a continued effort to keep ...



• Advice Line
• Database Underground
• The Deep End
• Enterprise Mac
• Geeks in Paradise
• Grid Meter
• The Gripe Line
• InfoWorld Daily
• Inside IT
• IT Troubleshooter
• ITXtreme
• Open Sources
• ProdBlog
• Real World SOA
• Reality Check
• Security Adviser
• SMB IT
• The Storage Network
• Tech Watch
• Virtualization Report
• Zero Day

ADVERTISEMENT


RESOURCE CENTERadvertisement 

GOVERNMENT IT & POLICY
'If you don't go after the network, you're never going to stop these guys. Never.'
From the State Department, All the News for Inquiring Minds
TechPresident, the Internet Citizenry's New Consensus Taker



Sponsored Technology Links

 
 
 HOME  NEWS  BLOGS  PODCASTS  VIDEOS  TECHNOLOGIES  TEST CENTER  EVENTS  CAREERS   About | Advertise | Awards | RSS | Contact Us 

Copyright © 2008, Reprints, Permissions, Licensing, IDG Network, Privacy Policy, Terms of Service.
All Rights reserved. InfoWorld is a leading publisher of technology information and product reviews on topics including viruses,
phishing, worms, firewalls, security, servers, storage, networking, wireless, databases, and web services.

CIO :: ComputerWorld :: CSO :: Demo :: GamePro :: Games.net :: IDG Connect :: IDG World Expo
Industry Standard :: IT World :: JavaWorld :: LinuxWorld :: MacUser :: Macworld :: Network World :: PC World :: Playlist