March 17, 2003

Search engines target Weblogs

Feedster indexes RSS feeds to make blogs searchable

A new crop of search engines tuned specifically to unearth information from Weblogs and other news feeds are emerging to help simplify the process of searching and navigating Weblogs. To that end, the FuzzyGroup recently launched Feedster, a search engine designed to monitor and index the RSS (Really Simple Syndication) feeds that comprise Weblogs. RSS is a Web content syndication format based on XML.

With the number of Weblogs multiplying rapidly, the need to search and index RSS feeds is clear. Other search engines focused on RSS feeds include Feeder, RSS Search, and Snarf, to name a few. 

Feedster employs a traditional decentralized crawling model for Web search, but the technology includes a spider capable of indexing RSS feeds, according to Scott Johnson, president of Belmont, Mass.-based FuzzyGroup. The search engine also employs additional technology for relevance ranking and data exploration, he said. Users can search via a URL-based query or a keyword search.

"Feedster goes out and grabs RSS feeds on a very regular basis, indexes them, and makes them easily searchable and navigable," Johnson said.

Unlike typical Web search tools such as Google that index Web content at the page level, RSS-focused search engines index RSS feeds and do so more frequently and at a finer level of granularity. "There is a huge need for this capability in blogs. Blogs are heavily nested and interlinked. We take that data stream and go all the way with it. We are getting the whole information feed, storing it, and indexing it," Johnson said.

One application for RSS search in enterprises is to accurately monitor the burgeoning number of news feeds and Weblog postings, which could be vital for competitive intelligence or product research. A new feature currently under development in Feedster is designed to provide a personalized aggregate search capability that can build a view of information tailored to a particular theme or personal preference, according to Johnson.

Close

On Twitter now

Application development

Powered by Twitter

White Paper

D2D Virtual Tape Library Replication Primer

This whitepaper explains the terminology and concepts behind Data Replication technologies and establishes some sizing rules through worked examples. Learn the new paradigm in disaster tolerance—protect data anywhere.

Download now »

White Paper

An Alternative to Virtualization for Datacenter Cost Savings

Server virtualization is a popular option for dealing with mounting datacenter costs. Another equally promising approach is the use of an Application Delivery Controller. Citrix NetScaler provides a low-cost way for organizations to reduce their server count and accrue cost savings from a reduction in space, cooling, power and personnel.

Download now »

White Paper

Why Your Firewall, VPN, and IEEE 802.11i Aren't Enough to Protect Your Network

The emergence of WLANs has created a new breed of security threats to enterprise networks.

Included in HP ProCurve WLAN solutions is security technology that alleviates threats from WLANs through:
* Monitoring wireless activity inside and out of the enterprise
* Classifying WLAN transmissions into harmful and harmless
* Preventing transmissions that pose a security threat to the enterprise network
* Locating participating devices for physical remediation

Download now »

White Paper

Bringing the Edge to the Data Center

Effectively address data protection challenges, implementing solutions that help store and protect business–critical data while cutting costs and improving efficiency and reliability.

Download now »

Sign up to receive InfoWorld Resource Alerts

Subscribe to the Developer World Newsletter

Receive a weekly roundup about the art and science of software development.

©1994-2009 Infoworld, Inc.