Free Newsletters
Technology & Business Daily

InfoWorld
Log-in | Register

Look deep into log files

Omnisight offers powerful, flexible analysis if you have the skills

By Paul Venezia
August 29, 2003
 

As every server administrator knows, log files are the pulse of a network infrastructure. They tell us what has occurred in an application or service, and if they stop growing, something is wrong. Log files can tell us who is using our services, how many users are using a particular resource, how often, and for how long. Logs can also be extremely valuable as forensic evidence in computer crime investigation and litigation. The trick is to be able to use log files to analyze trends in resource utilization, identify and remove security threats, and provide a useful audit trail of user action, without being buried by the sheer volume or resorting to the onerous task of manual inspection.

Free IT resource

TechNet: More ways to know it, share it, and keep it running.

Sponsored by Microsoft

Free IT resource

Attend the SOA Executive Forum: Breaking SOA Bottlenecks SOAExecForum.com/may2007

Sponsored by InfoWorld



Addamark Omnisight 2.5

Addamark Technologies, addamark.com

Very Good  8.3
criteria score weight
Performance 9 30%
Implementation 8 20%
Management 7 20%
Scalability 9 20%
Value 8 10%

Cost:
Starts at $124,000

Platforms:
Red Hat Linux 7.3, Red Hat Linux Advanced Server 2.1

Bottom Line:
Omnisight provies a powerful clustering and querying framework to store and analyze massive log files, but it has a high cost of entry and requires significant time and skill to implement.

About our Reviews and Scoring Methodology

A large infrastructure can generate many gigabytes of logs from several services and applications every day. These logs are usually archived for a period of time, analyzed, and then discarded on a predetermined schedule. The data contained in these logs may be extremely important or completely irrelevant. Either way, the logs need to be perused to determine which is which, and what is worth flagging for further investigation. For instance, the only way to really know how a Web site is performing is to generate reports based on the Web server log-file data, and use those reports to determine if there are problems with the servers or with the site itself.

Addamark Technologies addresses these issues with Omnisight 2.5. Currently implemented at Lehman Brothers, Yahoo, and Agilent Technologies, among others, Omnisight allows systems managers to extract meaningful data from truly massive log files generated by services and applications by providing a means to import, store, and perform deep analysis on that data.

Heavy Parsing

Addamark’s Omnisight is best described as a programming framework for log-file analysis. Relying on open source packages such as Apache, with a heart written in C and a nervous system written in Perl, Omnisight is not a tool for the faint of heart or light of wallet. Omnisight runs on Red Hat Linux 7.3 or Red Hat Enterprise Linux AS 2.1, with support for Red Hat 8 nearing completion. Omnisight is designed to be implemented in a distributed environment and installed on a cluster. Exchanging SSH (Secure Shell) public keys for the root user between the cluster servers permits seamless installation of the cluster, but could be viewed as a minimal security risk. In my testing, the cluster installation was simple, however, and controlled from a single installer script. When complete, three Red Hat Linux servers were ready to handle log files.

Omnisight isn’t designed to handle live log files, but to import large, static log files into a database. To import a log file requires first creating a parsing file that describes the data to be indexed. For instance, to parse an Apache Web server log, you’ll need to create a file containing the specific log format, parsing rules, variable declarations, and potentially embedded Perl code to handle special-case log files and varying reporting formats found in many applications. These files must be written with care and tested thoroughly, as any deviations will result in parsing errors and lost data. Once the parsing file is complete, it is referenced by the indexing engine, which then imports the log file.

Addamark provides a few sample files, but it would be great to see more included with the package. Omnisight is very powerful and flexible, but it’s also very complex. You can use it to analyze log files from any application, in any format, whether supported by Addamark or not, but it requires significant skill to do so. Most of the customization of the import tools is done in a mixture of Perl and SQL. To correctly implement Omnisight will require a high level of Perl and SQL experience, although Addamark does include seven days of assistance in the cost.

Once a log file has been imported it can then be analyzed by the engine. Querying and reporting is done via a CLI and Web front-end, both of which are minimalist interfaces. The Web interface provides a central interface for reporting, query construction, and maintenance, while the CLI tools are broken into separate functions.

Deep Analysis

Reports rely on queries to the database, and SQL queries must be written for the specific log file to be analyzed. Addamark provides a handful of queries that highlight the sample log files, but again, significant skill is required to write queries for anything beyond that. Once queries have been written, they can be collected into reports to be run by an administrator, or by an authorized user. No reports are provided with Omnisight; all reports must be developed in-house.

With its format-agnostic approach, Omnisight can be adapted to handle just about any log file analysis task or objective. An obvious use is to collect and analyze data from a variety of network devices to investigate a suspected employee or an external break-in. Another adaptation could be cross-analyzing log files generated by security-card access devices and the PBX to determine if anyone who wasn’t logged entering the building was making phone calls.

Addamark is aiming Omnisight at large infrastructures with heavy-duty log-file storage, maintenance, and analysis requirements, hence the built-in clustering. Log files imported into Omnisight are meant to stay there ad infinitum, not discarded after a period of time. To achieve this, every log file is compressed during import. On a five-node cluster of dual Xeon servers, a 500MB Check Point firewall log in LEA (Log Export API) format was imported, mirrored, and compressed in 125 seconds, with a nearly 10:1 compression ratio. On the same cluster, a fairly complex query of 6 million records returned in 23 seconds — truly impressive feats.

Omnisight is a powerful and flexible tool for log-file analysis. The scope of its reach is almost limitless, given its open architecture and highly customizable parsing functions. This is not a tool to be installed by administrators and driven by nontechnical management; it’s a tool to be carefully implemented and maintained by skilled programmers. If you have the need to store and analyze massive log files from a wide variety of services and devices — and if you have the skill to handle the implementation — Omnisight can handle the load.





 


 
Paul Venezia is a contributing editor at InfoWorld.
 

TOP NEWS:


»  Four quick tips for choosing an IM security product
71 percent of businesses will invest in real-time messaging this year. If you're one of them, be sure to protect your enterprise

»  Forrester analysts ID hot IT jobs
Research group finds 16 IT roles with a promising future

»  Nvidia claims 10 hours of HD video on Tegra chip
The Tegra 600 and 650 can be used with hard disk drives and are designed partly for mobile Internet devices

»  Database vendors add Google's MapReduce
Greenplum and Aster Data Systems will support Google's programming technique, developed for parallel processing of large data sets across commodity hardware

»  Network management: Tips for managing costs
New technologies, changing requirements, and ongoing equipment maintenance and upgrades cost money, but there are ways to manage expenses

»  EMC targets SMBs, branch offices with new low-end storage
Celerra NX4 highlights include thin provisioning, snapshot technology for data recovery and backups, and Web-based console for management of storage volumes




Take control of your content- leverage Microsoft SharePoint
Microsoft Office SharePoint Server (MOSS) offers core content management designed for a broad user population. Attend this webcast to learn how to implement a strategy that allows for the coexistence of both MOSS and advanced ECM solution within the same IT environment. Sponsor: IBM

»  Click here to view this Webcast
  Virtualization Solutions Guide
This comprehensive IT Strategy Guide covers Virtualization and puts you at the forefront of the discussion. You'll learn all you need to know from the cost of virtualization, how to implement it for your business, how to back it up safely and which products are best. Sponsored by Riverbed

»  Click here to download now

- Special Advertising Partners -
WHITE PAPERS
 

» Technology White Papers Library

Technology White Papers by Topic

Technology White Papers E-mail Alert

Find out when the latest white paper is available:
 
 
INFOWORLD MARKETPLACE
 
» BUY A LINK NOW
 

FIND PRODUCTS AND COMPANIES
» COMPLETE PRODUCT GUIDE



TECHNOLOGY INDEX
• Applications
• Application Development
• Security
• Networking
• Wireless
• Platforms
• Hardware
• Data Management
• Storage
• Web Services
• Business
• Telecom
• Professional Services
• Standards

TECH WATCH 


What's the 411 on GOOG-411?
Just as Google has become synonymous with "performing a Web search," 411 is understood to mean "information" -- as in "what's the 411?" I was thus surprised to discover, from a billboard, no less, that the king of search is taking on the ...

Apple HTML source reveals 'iPhone Extreme'
"This one's a stretch..." reports AppleInsider. Um, yeah. Reporting on HTML code sightings of product names could be called a stretch, but iPhone Extreme has a ring to it. Now, that sounds like the product Apple should have released first, rather ...

COLUMNISTS

Unified under law
Ephraim Schwartz's Column and Blog (InfoWorld) - In the litigious world we live in, deploying a unified communications platform in your enterprise could...
» MORE COLUMNISTS

MORE INFOWORLD BLOGS


Open Sources 
Product Management
When I joined MySQL four years ago, there was quite a lot of debate about product management. We didn't actually have ...

Zero Day 
Botnet herders tending smaller flocks
New research backs up the theory that botnet operators are keeping their networks smaller in a continued effort to keep ...



• Advice Line
• Database Underground
• The Deep End
• Enterprise Mac
• Geeks in Paradise
• Grid Meter
• The Gripe Line
• InfoWorld Daily
• Inside IT
• IT Troubleshooter
• ITXtreme
• Open Sources
• ProdBlog
• Real World SOA
• Reality Check
• Security Adviser
• SMB IT
• The Storage Network
• Tech Watch
• Virtualization Report
• Zero Day

ADVERTISEMENT


RESOURCE CENTERadvertisement 

GOVERNMENT IT & POLICY
'If you don't go after the network, you're never going to stop these guys. Never.'
From the State Department, All the News for Inquiring Minds
TechPresident, the Internet Citizenry's New Consensus Taker



Sponsored Technology Links

 
 
 HOME  NEWS  BLOGS  PODCASTS  VIDEOS  TECHNOLOGIES  TEST CENTER  EVENTS  CAREERS   About | Advertise | Awards | RSS | Contact Us 

Copyright © 2008, Reprints, Permissions, Licensing, IDG Network, Privacy Policy, Terms of Service.
All Rights reserved. InfoWorld is a leading publisher of technology information and product reviews on topics including viruses,
phishing, worms, firewalls, security, servers, storage, networking, wireless, databases, and web services.

CIO :: ComputerWorld :: CSO :: Demo :: GamePro :: Games.net :: IDG Connect :: IDG World Expo
Industry Standard :: IT World :: JavaWorld :: LinuxWorld :: MacUser :: Macworld :: Network World :: PC World :: Playlist