Free Newsletters
Technology & Business Daily

InfoWorld
Log-in | Register

Vivísimo Velocity races ahead of the search pack

Souped-up Version 5 combines high speed with killer control

By Mike Heck
September 08, 2006
 

Choosing an enterprise search product is often fraught with compromise. If you, say, pick something with a simple search interface to appease users, administrators will likely be restricted in indexing databases or customizing results. From the start, Vivísimo's Velocity hasn’t asked organizations to make such concessions. Its search engine indexes content as is, so you don’t have to preprocess or reformat documents. The content integrator allows users to perform simultaneous searches through diverse sources — internal documents, intranets, Web, or syndicated news feeds. Last, the clustering engine organizes all these search results into categories intelligently made from words and phrases in the results, not some arbitrary popularity ranking.

Free IT resource

TechNet: More ways to know it, share it, and keep it running.

Sponsored by Microsoft

Free IT resource

Attend the SOA Executive Forum: Breaking SOA Bottlenecks SOAExecForum.com/may2007

Sponsored by InfoWorld

TEST CENTER DAILY BLOG

Track the latest product reviews and news from the InfoWorld Test Center.




Vivísimo Velocity 5

Vivísimo , http://vivisimo.com/

Excellent  8.9
criteria score weight
Ease-of-use 9 20%
Integration 9 20%
Management 9 20%
Performance 9 20%
Scalability 8 10%
Value 9 10%

Cost:
Starts at $25,000 annually

Platforms:
Windows, Solaris, and Linux

Bottom Line:
Vivísimo Velocity, which is based on an SOA, melds a search engine, content integrator, and clustering engine. It’s also one of the few solutions that searches and indexes enterprise content without requiring pre-built taxonomies and then presents categorized results. Version 5 reaches inside databases, SharePoint, and Documentum — and restricts results based on users’ authentication. Furthermore, this update should please IT staff with its simplified administration.

About our Reviews and Scoring Methodology

Still, Version 5 has something new for both end-users and system managers, making it an even stronger search solution for Fortune 1000 companies and government agencies. To improve the user experience, Vivísimo added role-based search so that enterprises can target results, for example, to employees in sales or to human resources.

Yet Velocity 5’s biggest changes are in the less visible underlying technology and administration. Topping the list are connectors for databases, Microsoft SharePoint portals, and EMC Documentum Docbases. Because these sources usually require authentication, Vivísimo Velocity 5 easily connects to LDAP or Active Directory servers and restricts viewing documents (or content sections) based on a user’s rights. Moreover, administration is more straightforward compared with Version 4.5.

Gentlemen, start your engine

In approximately 30 minutes I’d installed Velocity 5 under Red Hat Enterprise Linux 4. Vivísimo consolidated some of the management functions and reorganized parts of the Web administration interface, which compressed down to a few hours the process of creating my test scenarios — searching an intranet, external Web sites, two SharePoint portals, and several Microsoft SQL databases. More elaborate customizations required a few days; Vivísimo also offers professional services and commits to completing most complex projects within 90 days.

Crawling and indexing sites or documents is as simple as selecting the type of resource (such as a database) and pointing to the server. The control that IT staff has over content extraction and normalization — without much effort — is significant.

Using a simple form, for example, I adjusted the HTML converter so that the crawler ignored common navigation that appeared on each page but gave more weight to link and tag density. I also boosted the priority of certain pages that I wanted to appear at the top of results. These tweaks, along with Velocity’s own relevance-ranking algorithms (freshness, term proximity, link analysis), generated results that were more accurate than other products I’ve tested.

Similarly, I adjusted XSL templates to change the appearance and behavior of the search interface and results page. Things got even more interesting when I used “formula-based sorting” to retrieve very specific results, a feature that truly improves the search experience. For instance, based on metadata, I created graphical sliders that allow users to quickly search a Web site’s product section and pick servers that employed specific processors. Or, for a real estate site, sliders could allow users to easily select homes in specific price ranges, number of rooms, or land size.

Mashups extraordinaire

Federation is another area where Velocity shows usability and creativity. A few clicks from the administration interface bundled my internal search sources — so a single query contained relevant results from all my indexes. Furthermore, the software’s SOA allows you to just as easily federate searches from more than 60 external sources, such as the BBC, CNN, New York Times, Washington Post, and the National Library of Medicine — along with results from the popular consumer search engines, such as Google, MSN, Yahoo. It’s pretty easy to adjust the built-in setups — or to create your own — to include most other external sources, such as InfoWorld’s own Verity UltraSeek search.

On the flip side, Velocity returns all its data, including the clustering tree and search results, as an XML feed. This permits you to integrate search into existing applications.


Click for larger view.
Velocity’s search engine breaks tradition because it supports various relationships — one-to-many, many-to-one, and many-to-many — between result documents and the original source. With most products, each search result corresponds to a single URL. But Velocity can take a single PDF document that discusses several ideas and break it into separate results.

Conversely, the system will combine several URLs that cover a common thought into a single result — a virtual document — without any special document preprocessing. Even more significant is that Version 5 supports security at each of these content-block levels within a virtual document. That is, Vivísimo could create a virtual document about a product your enterprise produces — containing marketing literature, white paper information, and pricing. But because the pricing part is sensitive, that piece shows only in results presented to authorized managers.

Vivísimo didn’t fool with its successful clustering interface in this version, but I discovered a few improvements. Search results can be saved and exported as plain text, HTML, or RIS citations — used by products such as EndNote, ProCite, or Reference Manager. This helps users revisit searches and collaborate by sharing results. Additionally, results now include a preview button, which allows you to examine a Web page or document before opening it.

Vivísimo Velcoity 5 delivers what other search products generally lack: a pleasant experience, characteristic of consumer search, that’s supported by enterprise security and customization options. The system federates searches of structured and unstructured content, which are organized into clusters. Therefore, this retrieval platform is appropriate for any organization needing to understand large — and disparate — information repositories. And with the pressure on IT projects to deliver results, Velocity can be deployed rapidly.

Although enterprise search products such as exalead and Siderean continue to get better, Velocity edges ahead with its collection of excellent functionality and administration tools.





 


 
Mike Heck is a contributing editor for the InfoWorld Test Center.
 

TOP NEWS:


»  Troubleshooting tool for Java offered
Sun's Java VisualVM open-source technology views apps while they run on a JVM and is billed as all-in-one solution

»  Python backing eyed for NetBeans
Scripting language capabilities of the open-source IDE continue to expand

»  Microsoft sets Windows XP SP3 automatic download for Thursday
The latest service pack for Windows XP will be pushed to Automatic Update at 7a.m. EDT on July 10

»  Real Software, Veryant bolster dev tools
RealBasic, Cobol apps platforms get improvements

»  Microsoft sets hosted-services pricing, irks partners
By offering 38 percent discount to customers who buy entire hosted business productivity suite, Microsoft undercuts partners selling similar services

»  Adobe readying new mashup tool for business users
Mashup interface code-named 'Genesis' will open up desktop 'workspace' combining business application data, documents, analytics, and instant messaging




5 Things You Need to Know About Storage Virtualization
This Webcast feature insights from various InfoWorld articles, as well as primary research conducted by InfoWorld and sister company IDC to better understand demand drivers, challenges and opportunities provided by storage virtualization, as well as other flavors or approaches to virtualization Sponsor: HP

»  Click here to view this Webcast
  The Silver Lining: Cloud Computing
This IT Strategy Guide digs deep into cloud computing helping put you ahead of the curve on this hot topic. It explores the differences between cloud computing, grid computing and utility computing and then helps you see where and how each applies to your business. Sponsored by Box.net

»  Click here to download now

- Special Advertising Partners -
WHITE PAPERS
 

» Technology White Papers Library

Technology White Papers by Topic

Technology White Papers E-mail Alert

Find out when the latest white paper is available:
 
 
INFOWORLD MARKETPLACE
 
» BUY A LINK NOW
 
SEE ALSO
• Desktop search gets down to business
• Windows Vista offers view of integrated desktop search
• exalead and Siderean guide users down differing paths to data troves


FIND PRODUCTS AND COMPANIES
» COMPLETE PRODUCT GUIDE



TECHNOLOGY INDEX
• Applications
• Application Development
• Security
• Networking
• Wireless
• Platforms
• Hardware
• Data Management
• Storage
• Web Services
• Business
• Telecom
• Professional Services
• Standards

TECH WATCH 


What's the 411 on GOOG-411?
Just as Google has become synonymous with "performing a Web search," 411 is understood to mean "information" -- as in "what's the 411?" I was thus surprised to discover, from a billboard, no less, that the king of search is taking on the ...

Apple HTML source reveals 'iPhone Extreme'
"This one's a stretch..." reports AppleInsider. Um, yeah. Reporting on HTML code sightings of product names could be called a stretch, but iPhone Extreme has a ring to it. Now, that sounds like the product Apple should have released first, rather ...

COLUMNISTS

Unified under law
Ephraim Schwartz's Column and Blog (InfoWorld) - In the litigious world we live in, deploying a unified communications platform in your enterprise could...
» MORE COLUMNISTS

MORE INFOWORLD BLOGS


Open Sources 
Product Management
When I joined MySQL four years ago, there was quite a lot of debate about product management. We didn't actually have ...

Zero Day 
Botnet herders tending smaller flocks
New research backs up the theory that botnet operators are keeping their networks smaller in a continued effort to keep ...



• Advice Line
• Database Underground
• The Deep End
• Enterprise Mac
• Geeks in Paradise
• Grid Meter
• The Gripe Line
• InfoWorld Daily
• Inside IT
• IT Troubleshooter
• ITXtreme
• Open Sources
• ProdBlog
• Real World SOA
• Reality Check
• Security Adviser
• SMB IT
• The Storage Network
• Tech Watch
• Virtualization Report
• Zero Day

ADVERTISEMENT


RESOURCE CENTERadvertisement 

GOVERNMENT IT & POLICY
'If you don't go after the network, you're never going to stop these guys. Never.'
From the State Department, All the News for Inquiring Minds
TechPresident, the Internet Citizenry's New Consensus Taker



Sponsored Technology Links

 
 
 HOME  NEWS  BLOGS  PODCASTS  VIDEOS  TECHNOLOGIES  TEST CENTER  EVENTS  CAREERS   About | Advertise | Awards | RSS | Contact Us 

Copyright © 2008, Reprints, Permissions, Licensing, IDG Network, Privacy Policy, Terms of Service.
All Rights reserved. InfoWorld is a leading publisher of technology information and product reviews on topics including viruses,
phishing, worms, firewalls, security, servers, storage, networking, wireless, databases, and web services.

CIO :: ComputerWorld :: CSO :: Demo :: GamePro :: Games.net :: IDG Connect :: IDG World Expo
Industry Standard :: IT World :: JavaWorld :: LinuxWorld :: MacUser :: Macworld :: Network World :: PC World :: Playlist