Free Newsletters
Technology & Business Daily

InfoWorld
Log-in | Register

Thunderstone Search Appliance finds the needle

Enterprise Web searcher combines ease of use, powerful searching, and blazing speed

By Wayne Rash
October 24, 2003
 

The Thunderstone Search Appliance is designed to be a nearly plug-and-play solution to enterprise Web searching. You’re supposed to be able to install the Linux-based 1U appliance in a rack, plug in a network cable and a power cable, and begin indexing. While the reality is a little more involved, Thunderstone is clearly onto something. I found that the process of getting the Search Appliance up and running was quick and easy, and that once running, it was a very powerful search appliance indeed.

Free IT resource

TechNet: More ways to know it, share it, and keep it running.

Sponsored by Microsoft

Free IT resource

Attend the SOA Executive Forum: Breaking SOA Bottlenecks SOAExecForum.com/may2007

Sponsored by InfoWorld



Thunderstone Search Appliance V. 4.3

Thunderstone Software, thunderstone.com

Excellent  8.8
criteria score weight
Performance 9 30%
Ease-of-use 9 20%
Management 9 15%
Setup 8 15%
Interoperability 9 10%
Value 8 10%

Cost:
$10,000 for 250,000 indexed pages; $15,000 for 500,000 indexed pages; $20,000 for 1 million indexed pages

Bottom Line:
The Thunderstone Search Appliance is easy to implement and administer -while the admin interface isn't sexy, it is clear and easy to use. Searches are blisteringly fast, advanced searches very flexible, and the device handles some tasks, such as importing Web site banners and frames, that would otherwise take hours to implement. The product does have a couple weaknesses, most notably its date-sorting scheme.

About our Reviews and Scoring Methodology

Getting the Thunderstone appliance up and running is not complicated. You install it. You hook it up. You attach a keyboard and monitor to ports on the back to make the basic settings, such as the IP address and host name. After that, you do the rest using the Web-based management console. The primary settings are the URL of the Web site you want the appliance to index and the file types you want it to retrieve. You can also choose whether to exclude material such as CGI scripts and inline frames if you wish.

Once you’ve accomplished the basic setup tasks, you tell the appliance to “walk” through the Web site, and off it goes, periodically reporting progress as it proceeds. And that’s all there is to it. You can tweak the process using the management interface if you desire, and fire off new walks when you feel like it. You can also set the appliance to perform walks at set times (daily at 1:00 a.m., for example).

There’s little to say about the administrative interface. It’s plain, consisting of boxes you can fill in, boxes you can check, menus you can pull down, and buttons you can push. Functions are separated by tabs and links. If you’re even minimally experienced using Web software (a near certainty) you’ll prefer this interface over the busy wizards and user-friendly excesses of the competition.

Superior Searching

Once the management is complete, your users will find Thunderstone easy and flexible. Basic searches operate just like everyone else’s engine. Enter what you want, and Thunderstone goes and looks for it. The advanced searches, however, are where Thunderstone really shines.

As is the case with most search engines, you can perform Boolean searches and you can match phrases, but Thunderstone goes beyond that. You can also specify the proximity of the entries, how close the match has to be, and what word forms are acceptable. You can rank results by word order, proximity, frequency, and where the search subject appears in the text. Once you’ve performed a search, you can search within those results for additional terms.

After you submit a query, Thunderstone searches its database and provides the responses almost instantly. The company claims the appliance can handle 1,000 queries a minute, and I found no reason to dispute this. Normally, the responses from Thunderstone are sorted by relevance and ranked according to your choices of ranking factors. You may also sort by date.

Unfortunately, sorting by date may not deliver the results you expect. Thunderstone uses the last modified date in the HTML file (usually the same as the file date) to determine the date for sorting. This means that if you post an article on a certain day, then make changes to the article, Thunderstone will find the date for sorting to be when you last modified it. The InfoWorld Web site includes date meta tags for the date an article was published. While Thunderstone can search for those meta tags, it can’t use them as the source for the date sort. A Thunderstone spokesperson said that date sorting will be improved in a future release.

While the price for the Thunderstone Search Appliance seems a little steep, the produce performs exactly as intended. It is easy to administer, easy to use, and very fast. More importantly, it excels at searching, especially when you use its advanced search capabilities. While it won’t figure out badly spelled words as Google does, and while the date sorting works differently from what you might expect, these are hardly major indictments for what I found to be an excellent product. This one is worth looking at.

InfoWorld CTO Chad Dickerson contributed to this review.





 


 
Wayne Rash is an InfoWorld senior contributing editor.
 

TOP NEWS:


»  Software piracy hurts the open-source community too
Many nations are beginning to see stolen proprietary software as a lost opportunity for open source software, whose development can encourage innovation and job growth

»  Intel readies slew of embedded chips based on Atom core
Intel is trying to increase performance and drop power consumption in more than 15 system-on-chips that use the Atom core

»  Microsoft surprise reorganization aimed at online woes
Microsoft's online troubles hint at larger vulnerability; the company is facing challenges in areas that have been a lock for many years

»  Attack code released for DNS bug
Security experts warn that this attack code may give cybercriminals a way to launch virtually undetectable phishing attacks

»  Parts of San Francisco network still locked out
Administrators are still locked out of the city's VoIP system and LANs within the Sheriff's Department and the Recreation & Park Department

»  Intel says Moblin update coming soon
Open-source effort set for mobile Linux should have an alpha-level release in a few weeks




Solutions to the Toughest IT Challenges in Remote Offices
Though small in size, remote offices face many of the same IT challenges as larger central offices. This Webcast zeroes in on the top line challenges to deliver information that can provide immediate benefits to your business. Sponsor: AMD and Dell

»  Click here to view this Webcast
  Zombie PCs Are Attacking Your LAN
A recent study showed that malware-infected zombie PCs are now a bigger threat to ISPs and Web infrastructure than DoS attacks. As this brand new IT Strategy Guide explains, an increased use of peer-to-peer techniques by the attackers has made it harder to fight back. Download now, compliments of Verio:

»  Click here to download now

- Special Advertising Partners -
WHITE PAPERS
 

» Technology White Papers Library

Technology White Papers by Topic

Technology White Papers E-mail Alert

Find out when the latest white paper is available:
 
 
INFOWORLD MARKETPLACE
 
» BUY A LINK NOW
 
SEE ALSO
• SIDEBAR: How I tested


FIND PRODUCTS AND COMPANIES
» COMPLETE PRODUCT GUIDE



TECHNOLOGY INDEX
• Applications
• Application Development
• Security
• Networking
• Wireless
• Platforms
• Hardware
• Data Management
• Storage
• Web Services
• Business
• Telecom
• Professional Services
• Standards

TECH WATCH 


What's the 411 on GOOG-411?
Just as Google has become synonymous with "performing a Web search," 411 is understood to mean "information" -- as in "what's the 411?" I was thus surprised to discover, from a billboard, no less, that the king of search is taking on the ...

Apple HTML source reveals 'iPhone Extreme'
"This one's a stretch..." reports AppleInsider. Um, yeah. Reporting on HTML code sightings of product names could be called a stretch, but iPhone Extreme has a ring to it. Now, that sounds like the product Apple should have released first, rather ...

COLUMNISTS

Unified under law
Ephraim Schwartz's Column and Blog (InfoWorld) - In the litigious world we live in, deploying a unified communications platform in your enterprise could...
» MORE COLUMNISTS

MORE INFOWORLD BLOGS


Open Sources 
Product Management
When I joined MySQL four years ago, there was quite a lot of debate about product management. We didn't actually have ...

Zero Day 
Botnet herders tending smaller flocks
New research backs up the theory that botnet operators are keeping their networks smaller in a continued effort to keep ...



• Advice Line
• Database Underground
• The Deep End
• Enterprise Mac
• Geeks in Paradise
• Grid Meter
• The Gripe Line
• InfoWorld Daily
• Inside IT
• IT Troubleshooter
• ITXtreme
• Open Sources
• ProdBlog
• Real World SOA
• Reality Check
• Security Adviser
• SMB IT
• The Storage Network
• Tech Watch
• Virtualization Report
• Zero Day

ADVERTISEMENT


RESOURCE CENTERadvertisement 

GOVERNMENT IT & POLICY
'If you don't go after the network, you're never going to stop these guys. Never.'
From the State Department, All the News for Inquiring Minds
TechPresident, the Internet Citizenry's New Consensus Taker



Sponsored Technology Links

 
 
 HOME  NEWS  BLOGS  PODCASTS  VIDEOS  TECHNOLOGIES  TEST CENTER  EVENTS  CAREERS   About | Advertise | Awards | RSS | Contact Us 

Copyright © 2008, Reprints, Permissions, Licensing, IDG Network, Privacy Policy, Terms of Service.
All Rights reserved. InfoWorld is a leading publisher of technology information and product reviews on topics including viruses,
phishing, worms, firewalls, security, servers, storage, networking, wireless, databases, and web services.

CIO :: ComputerWorld :: CSO :: Demo :: GamePro :: Games.net :: IDG Connect :: IDG World Expo
Industry Standard :: IT World :: JavaWorld :: LinuxWorld :: MacUser :: Macworld :: Network World :: PC World :: Playlist