Free Newsletters
Technology & Business Daily

InfoWorld
Log-in | Register
STRATEGIC DEVELOPER  

Bootstrapping the semantic Web

Tim Berners-Lee's quest to give the Web meaning receives aid from unexpected quarters

By Jon Udell  
December 03, 2004
 

It's tempting to draw parallels between the careers of Albert Einstein and Tim Berners-Lee. Both men made world-transforming breakthroughs and then pursued even grander visions. Einstein, of course, never found the unified theory he sought for three decades. A lot of people think Berners-Lee's vision of a semantic Web will prove equally elusive.

Free IT resource

Hear how top CIOs turn change into a competitive advantage.

Sponsored by HP

Free IT resource

Attend the SOA Executive Forum: Breaking SOA Bottlenecks SOAExecForum.com/may2007

Sponsored by InfoWorld

We can all imagine the desired outcome: a version of the Web where items are related explicitly, not merely by co-occurrence of words. But skepticism has greeted the "semweb" technologies that Berners-Lee has been spearheading in the W3C. The approach is based on what's called "ontology," which the W3C defines as "a representation of terms and their interrelationships." Critics argue that we'll never agree on (or consistently apply) an ontology -- and they point to Google as proof that we don't need to.

Two companies I've encountered recently think there's a middle ground. One is Semagix, which offers an application toolkit, called Freedom, to help developers create, and then build on, a domain-specific ontology. Take the case of an anti-money-laundering application. The ontology is derived from authoritative information about individuals and companies provided by the likes of Dun & Bradstreet and Hoover's. Given such a framework, automatic classifiers can read unstructured documents -- e-mail, news feeds, Web pages -- and attach them to the framework. As a result, Semagix says, you can answer questions such as, "Which recent news reports mention companies that share directors with company X"?

Digital Harbor is taking a similar approach. The company's PiiE (Professional Interactive Information Environment), originally pitched as a rich Internet app-dev toolkit, has lately shifted toward "business ontology." Digital Harbor's Fusion Server helps developers define a set of terms and relationships, populate that framework with structured data, and then attach unstructured data to it. And as does Semagix, Digital Harbor emphasizes fusing ontology and data so that users can "connect the dots."

Of course, it's never as easy as we'd like to imagine. Consider Eliyon, a company that's gathered public information about more than 22 million people to support sales, recruiting, and other applications. As it turns out, I am several of those people. In addition to my current title, InfoWorld Test Center lead analyst, I show up as executive editor of Byte Magazine and contributor to Linux Magazine. And while those were once accurate descriptions of me, I have never been a member of Blue Titan's board of advisors, and I am not the inventor of RSS.

It's true I could register with the site, coalesce my correct identities, and purge the wrong ones. But authenticating with a credit card in order to update a profile that Eliyon owns is a nonstarter for me. Back in June, on my Weblog, I suggested the alternative that would suit me: I'll maintain my own profile on the Web and syndicate my data to anyone who needs it.

Semantic-Web naysayers think people and organizations can't be bothered to assert machine-readable facts about themselves. And, today, that is undoubtedly true. But when others assert facts about you -- as they increasingly will -- the tide could begin to turn. Individual acts of self-defense may ultimately combine to bootstrap the semantic Web.





 


 
Jon Udell is lead analyst and blogger in chief at the InfoWorld Test Center.

  More of Jon Udell's column
  Jon Udell's Weblog

Newsletter Check out all of our free newsletters!
Enter e-mail address:




 

TOP NEWS:


»  Update: HP in talks to buy EDS for up to $13 billion
Deal would strengthen HP's competitive position against IBM, but still would leave it about $10 billion short of IBM's global services revenue

»  SOA Software buys LogicLibrary
Combination links governance automation, repository technologies

»  Sun to clarify JavaFX open-source plan later this year
Sun had promised that JavaFX would be open source, but a FAQ on Sun's site indicates that only parts of the RIA development family will be open source

»  Microsoft readies service packs for dev tools
Improvements to Visual Studio 2008 include database and Office 2007 Ribbon support

»  Cisco's TelePresence gets personal
The high-definition virtual meeting system will be available at a less expensive entry price for midsized businesses later this year

»  Developers' role shifting from apps to platforms
Untrained workers are moving into app dev space, pushing career developers into the platform space, a Sun engineer noted at JavaOne




Virtualization: A Step by Step Approach to Success
Your virtual machines can be up and running in a matter of minutes. HP and Citrix have integrated XenServer with HP ProLiant servers and management tools, powered by hardware-assisted Intel Virtualization Technology to enable high- performance, cost-savings solutions for server consolidation and disaster recovery. Sponsor: HP

»  Click here to view this Webcast
  The Data Protection You've Been Looking For
Enterprise data is of supreme importance. If you can't find it quickly, it's worthless. If you lose it, it's a crisis. This IT Strategy Guide explores how to keep your data safe.

»  Click here to download now

- Special Advertising Partners -
WHITE PAPERS
 

» Technology White Papers Library

Technology White Papers by Topic

Technology White Papers E-mail Alert

Find out when the latest white paper is available:
 
 
INFOWORLD MARKETPLACE
 
» BUY A LINK NOW
 

FIND PRODUCTS AND COMPANIES
» COMPLETE PRODUCT GUIDE



TECHNOLOGY INDEX
• Applications
• Application Development
• Security
• Networking
• Wireless
• Platforms
• Hardware
• Data Management
• Storage
• Web Services
• Business
• Telecom
• Professional Services
• Standards

TECH WATCH 


What's the 411 on GOOG-411?
Just as Google has become synonymous with "performing a Web search," 411 is understood to mean "information" -- as in "what's the 411?" I was thus surprised to discover, from a billboard, no less, that the king of search is taking on the ...

Apple HTML source reveals 'iPhone Extreme'
"This one's a stretch..." reports AppleInsider. Um, yeah. Reporting on HTML code sightings of product names could be called a stretch, but iPhone Extreme has a ring to it. Now, that sounds like the product Apple should have released first, rather ...

COLUMNISTS

Unified under law
Ephraim Schwartz's Column and Blog (InfoWorld) - In the litigious world we live in, deploying a unified communications platform in your enterprise could...
» MORE COLUMNISTS

MORE INFOWORLD BLOGS


Open Sources 
Product Management
When I joined MySQL four years ago, there was quite a lot of debate about product management. We didn't actually have ...

Zero Day 
Botnet herders tending smaller flocks
New research backs up the theory that botnet operators are keeping their networks smaller in a continued effort to keep ...



• Advice Line
• Database Underground
• The Deep End
• Enterprise Mac
• Geeks in Paradise
• Grid Meter
• The Gripe Line
• InfoWorld Daily
• Inside IT
• IT Troubleshooter
• ITXtreme
• Open Sources
• ProdBlog
• Real World SOA
• Reality Check
• Security Adviser
• SMB IT
• The Storage Network
• Tech Watch
• Virtualization Report
• Zero Day

ADVERTISEMENT


RESOURCE CENTERadvertisement 

GOVERNMENT IT & POLICY
'If you don't go after the network, you're never going to stop these guys. Never.'
From the State Department, All the News for Inquiring Minds
TechPresident, the Internet Citizenry's New Consensus Taker



Sponsored Technology Links

 
 
 HOME  NEWS  BLOGS  PODCASTS  VIDEOS  TECHNOLOGIES  TEST CENTER  EVENTS  CAREERS  IT EXEC-CONNECT   About | Advertise | Awards | RSS | Contact Us 

Copyright © 2008, Reprints, Permissions, Licensing, IDG Network, Privacy Policy, Terms of Service.
All Rights reserved. InfoWorld is a leading publisher of technology information and product reviews on topics including viruses,
phishing, worms, firewalls, security, servers, storage, networking, wireless, databases, and web services.

CIO :: ComputerWorld :: CSO :: Demo :: GamePro :: Games.net :: IDG Connect :: IDG World Expo
Industry Standard :: IT World :: JavaWorld :: LinuxWorld :: MacUser :: Macworld :: Network World :: PC World :: Playlist