Free Newsletters
Technology & Business Daily

InfoWorld
Log-in | Register
Page 2 of 2  «  Previous Page

XML databases evolve

 

Unlike Xindice, DB XML is not a client/server system; it is a library that you link into -- and that runs in the process space of -- your application. Bindings are available for numerous languages, including Java, C++, Perl, Python, Tcl (Tool Command Language), and PHP (PHP: Hypertext Processor). There are also several third-party bindings available for other languages.

Free IT resource

Open Source Business Conference (OSBC) May 22-23, 2007

Sponsored by OSBC

Free IT resource

TechNet: More ways to know it, share it, and keep it running.

Sponsored by Microsoft

Much of what’s in the new 2.0 release is the direct result of user feedback. The preceding release handled documents as single entities, imparting an upper limit on the size of the document that DB XML could handle (that upper limit was typically set by available memory, and any XML document exceeding that limit was probably a good candidate for factoring). In this release, DB XML allows you to store documents either wholesale (as before) or per node -- carved up, if you will.

When you choose per-node storage, documents are taken apart and their individual nodes are stored in separate records in the database. Consequently, available disk space is the only real upper limit on the size of a document handled. The Berkeley DB system can deal with databases ranging in sizes as large as 256TB, but only a few people will hit that ceiling.

Document options
As with Xindice, DB XML’s storage uses a collections paradigm. You associate whole-document or per-node storage for a given collection; all documents in that collection are stored similarly.

Whole-document storage is best if your documents are reasonably small (measuring 1MB or less), and you must process each document intact. Also, documents retrieved from whole-document storage are byte-for-byte identical to the document that was placed in storage. That’s important if you want to be able to verify that the content of the document has not been meddled with -- for example, if you’ve added a digital signature to the document.

Per-node storage provides faster queries and updates, because the entire document need not be read in to be processed. And, as already stated, it allows you to manage extremely large XML documents.

DB XML 2.0 also has a new command-line tool. Like Xindice’s command-line tool, it’s the perfect way to familiarize yourself with the database’s capabilities. The commands accepted by the tool have a one-to-one correspondence with the product’s API. 

Sleepycat was in the process of finishing a tutorial for the command-line tool at the time of my review. I saw an early version that was already polished enough to be useful, and can say that the tutorial promises to be a worthy guide to the neophyte DB XML user.

DB XML 2.0 supports XPath and XUpdate as well as the more robust XQuery. As with Xindice, you can use the command-line tool to familiarize yourself with the syntax of these queries and update dialects. And, like Xindice, DB XML 2.0 provides numerous examples to work through and explore.

Quite a pair
Both Apache Xindice and Sleepycat Berkeley DB XML allow you to attach indexes to your databases for the purpose of speeding queries. DB XML, however, gives you greater control over the index type, and thereby allows you to fine-tune an index for the sorts of queries likely to take place.

In addition, the DB XML command-line tool will return the amount of time taken by a query, so you can experiment with different index types and query strategies to optimize performance.

Xindice and DB XML 2.0 are top-notch database libraries, although DB XML provides a greater range of features and is polished to a more impressive sheen. Nevertheless, I expect to see the Xindice project’s feature list lengthen over time. Improvements in Xindice will only benefit the wide and growing XML database community.


»  Previous Page | 1 | 2 



Apache Xindice 1.0

Apache Software Foundation, apache.org/xindice

Very Good  8.6
criteria score weight
Interoperability 9 20%
Performance 8 20%
Scalability 9 20%
Setup 9 20%
Documentation 7 10%
Value 9 10%

Cost:
Free

Platforms:
Linux, Windows (NT or better), Solaris, Mac OS X

Bottom Line:
Another fine product from Apache. Xindice is most easily approached by Java programmers, though other languages can be used with some work. Not as versatile as Berkeley DB XML, since it doesn't yet support XQuery, and the documentation needs work, but Xindice should improve with time.

About our Reviews and Scoring Methodology



Sleepycat Berkeley DB XML 2.0

Sleepycat Software, sleepycat.com

Excellent  9.3
criteria score weight
Interoperability 10 20%
Performance 9 20%
Scalability 9 20%
Setup 9 20%
Documentation 9 10%
Value 10 10%

Cost:
Free

Platforms:
Supports all major OSes

Bottom Line:
Berkeley DB XML is a killer XML database library running a layer above the bulletproof Berkeley DB. Sleepycat has improved this latest version greatly, adding XQuery support, the ability to manage large files with per-node storage, and new documentation to flatten the learning curve.

About our Reviews and Scoring Methodology



 


 
Rick Grehan is a contributing editor at InfoWorld. Contact him at rick_grehan@infoworld.com.
 

TOP NEWS:


»  You don't know tech: The InfoWorld news quiz
Match your weekly tech news wits against our snarky quiz master

»  Spinning off fabs would be risky for AMD, analysts say
AMD has expressed a desire to control chip-manufacturing costs, which has created speculation that the company might sell off its chip fabrication plants

»  Hackers find a new place to hide rootkits
A pair of security researchers has developed a new kind of rootkit, called an SSM, that hides in an obscure part of the processor that is invisible to antivirus apps

»  Top 10: Microsoft-Yahoo, XP SP3 woes, Sprint-Clearwire WiMax deal
This week's roundup of the top tech stories of the week include the demise of the Microsoft-Yahoo deal, Sun's JavaOne announcements, the Intel-OLPC beef, and more

»  Easing network congestion caused by virtual servers
Better I/O capability is an important and often overlooked aspect of getting the most out of server virtualization

»  Sun exec ponders OpenSolaris, Linux
In an interview, Ian Murdock, formerly with the Linux Foundation and now with Sun, discusses the company's open-source efforts and how to monetize them




BRINGING PERFORMANCE VALIDATION "INTO THE LIFECYCLE"
Today's enterprise apps are complex and ever-changing, which makes delivering high performance difficult. By virtualizing the behavior of application services and data in a VSE, teams can answer this challenge with validation best practices and test tools to ensure solid performance throughout the lifecycle. Register now to attend this webcast! Sponsor: ITKO

»  Click here to view this Webcast
  Storage is big, and getting bigger
The only certainty is that your requirement for storage will never be satisfied. While you clean out space and authorize POs, you might consider another alternative: outsourcing. The best way to deal with storage might be to let someone else deal with it. Sponsored by SGI

»  Click here to download now

- Special Advertising Partners -
WHITE PAPERS
 
  • Protect Your Data with SSL - Discover how to increase customer confidence in your site with the latest solution in SSL, Extended Validation (EV) SSL ...
  • Need simple, low cost server virtualization? - Do more with less. Support fewer servers. Simplify disaster recovery. Implement proven, easy-to-use server virtualization...
  • Virtually Limitless Virtual Storage - Do you need virtualization space savings of 50% or more with virtually no performance impact? You might be able to get storage...
  • Invisible IT? - The goal of IT is to become an invisible entity within a larger organization. Eliminating visibility and road blocks IT ...
  • It Really Is Easy to be Green - "Green IT" is a popular concept. And IT organizations are learning the influence that IT purchase decisions have on data...
  • Key Strategies For SOA Testing - SOA requires a unique approach to testing. Unless you're willing to reorient your testing procedures and technology now,...

» Technology White Papers Library

Technology White Papers by Topic

Technology White Papers E-mail Alert

Find out when the latest white paper is available:
 
 
INFOWORLD MARKETPLACE
 
» BUY A LINK NOW
 

FIND PRODUCTS AND COMPANIES
» COMPLETE PRODUCT GUIDE



TECHNOLOGY INDEX
• Applications
• Application Development
• Security
• Networking
• Wireless
• Platforms
• Hardware
• Data Management
• Storage
• Web Services
• Business
• Telecom
• Professional Services
• Standards

TECH WATCH 


What's the 411 on GOOG-411?
Just as Google has become synonymous with "performing a Web search," 411 is understood to mean "information" -- as in "what's the 411?" I was thus surprised to discover, from a billboard, no less, that the king of search is taking on the ...

Apple HTML source reveals 'iPhone Extreme'
"This one's a stretch..." reports AppleInsider. Um, yeah. Reporting on HTML code sightings of product names could be called a stretch, but iPhone Extreme has a ring to it. Now, that sounds like the product Apple should have released first, rather ...

COLUMNISTS

Unified under law
Ephraim Schwartz's Column and Blog (InfoWorld) - In the litigious world we live in, deploying a unified communications platform in your enterprise could...
» MORE COLUMNISTS

MORE INFOWORLD BLOGS


Open Sources 
Product Management
When I joined MySQL four years ago, there was quite a lot of debate about product management. We didn't actually have ...

Zero Day 
Botnet herders tending smaller flocks
New research backs up the theory that botnet operators are keeping their networks smaller in a continued effort to keep ...



• Advice Line
• Database Underground
• The Deep End
• Enterprise Mac
• Geeks in Paradise
• Grid Meter
• The Gripe Line
• InfoWorld Daily
• Inside IT
• IT Troubleshooter
• ITXtreme
• Open Sources
• ProdBlog
• Real World SOA
• Reality Check
• Security Adviser
• SMB IT
• The Storage Network
• Tech Watch
• Virtualization Report
• Zero Day

ADVERTISEMENT


RESOURCE CENTERadvertisement 

GOVERNMENT IT & POLICY
'If you don't go after the network, you're never going to stop these guys. Never.'
From the State Department, All the News for Inquiring Minds
TechPresident, the Internet Citizenry's New Consensus Taker



Sponsored Technology Links

 
 
 HOME  NEWS  BLOGS  PODCASTS  VIDEOS  TECHNOLOGIES  TEST CENTER  EVENTS  CAREERS  IT EXEC-CONNECT   About | Advertise | Awards | RSS | Contact Us 

Copyright © 2008, Reprints, Permissions, Licensing, IDG Network, Privacy Policy, Terms of Service.
All Rights reserved. InfoWorld is a leading publisher of technology information and product reviews on topics including viruses,
phishing, worms, firewalls, security, servers, storage, networking, wireless, databases, and web services.

CIO :: ComputerWorld :: CSO :: Demo :: GamePro :: Games.net :: IDG Connect :: IDG World Expo
Industry Standard :: IT World :: JavaWorld :: LinuxWorld :: MacUser :: Macworld :: Network World :: PC World :: Playlist