May 15, 2006

Splunk combs log files for hidden problems

Plumbing the depths of log files and other metadata, Splunk helps IT find the telltale patterns that reveal what's really going on

More and more IT systems generate a glut of metadata -- log files, systems states, and so on. “And more and more of the IT budget is spent trying to keep systems up and running,” says Michael Baum, CEO of Splunk. Much of that cost can be attributed to the growing complexity of systems and the labor it takes to sift through metadata for troubleshooting purposes.

Current technology can help IT monitor how a specific system is doing, but the data captured is tied to a specific data structure. The result? “You’re going to miss things that change and break,” Baum says.

Splunk -- short for spelunk, the sport of cave diving -- applies Google-like technology to log-file analysis. “Think of it as a search engine for free-form indexing for the Web applied to IT logs and data,” Baum says.

But as opposed to Web search engines, which basically crawl HTML pages and index them according to keywords, Splunk’s indexing engine must apply multiple techniques to parse common types of IT data. According to Baum, “You need to look at syslogs, multiline JFLAGs, UDP traps, and more.”

The result is a single search engine that helps IT staff troubleshoot the disparate log and state data. The tool can also be used to generate alerts based on specified data patterns, and even for capacity planning and trending analysis by IT analysts.

The company’s newest tool, Splunkbase, is a wiki-like service that allows IT staff at different companies to compare notes on the unique “fingerprints” that Splunk creates for each pattern. Say hello to communal troubleshooting.


Click for larger view.


Galen Gruman is executive editor of InfoWorld for features and news.
Close

On Twitter now

Data management

Powered by Twitter

On Twitter now

White Paper

D2D Virtual Tape Library Replication Primer

This whitepaper explains the terminology and concepts behind Data Replication technologies and establishes some sizing rules through worked examples. Learn the new paradigm in disaster tolerance—protect data anywhere.

Download now »

White Paper

An Alternative to Virtualization for Datacenter Cost Savings

Server virtualization is a popular option for dealing with mounting datacenter costs. Another equally promising approach is the use of an Application Delivery Controller. Citrix NetScaler provides a low-cost way for organizations to reduce their server count and accrue cost savings from a reduction in space, cooling, power and personnel.

Download now »

White Paper

Why Your Firewall, VPN, and IEEE 802.11i Aren't Enough to Protect Your Network

The emergence of WLANs has created a new breed of security threats to enterprise networks.

Included in HP ProCurve WLAN solutions is security technology that alleviates threats from WLANs through:
* Monitoring wireless activity inside and out of the enterprise
* Classifying WLAN transmissions into harmful and harmless
* Preventing transmissions that pose a security threat to the enterprise network
* Locating participating devices for physical remediation

Download now »

White Paper

Bringing the Edge to the Data Center

Effectively address data protection challenges, implementing solutions that help store and protect business–critical data while cutting costs and improving efficiency and reliability.

Download now »

Sign up to receive Data Management Resource Alerts

Subscribe to the Today's Headlines: First Look Newsletter

Find out what will be news for the day, with our first-thing-in-the-morning briefing.

©1994-2009 Infoworld, Inc.