Olson would not discuss how much Cloudera has made from subscription and consulting fees thus far, but notes that the first quarter of 2010, the company made as much as what it earned through half of 2009. Among different industries, financial services, telecommunications, retail, government and Web commerce companies have shown an interest in the technology, Olson said.
"The things that companies are doing with Hadoop vary. In general, these people are catching lots of data from lots of places and need to subject it to sophisticated analytics," Olson said. "Financial services are interested in using Hadoop for fraud detection. In telecommunications, there is a real need to optimize networks and reduce churn of customers."
In addition to offering these packages, Cloudera has been rallying support for Hadoop from providers of business intelligence (BI) and data management software.
Olson plans to announce, during his keynote at the Hadoop Summit 2010, taking place in Santa Clara, California on Tuesday, that BI vendor MicroStrategy will support Hadoop use.
Another new partner is Talend, a vendor of open source data integration software. The company has extended its Talend Integration Suite to interface with Hadoop databases. Its suite allows administrators to manage and aggregate multiple data sources from a single console. With Hadoop, the software "can natively insert or retrieve data, and process the data within the Hadoop architecture," said Talend vice president of marketing, Yves de Montcheuil.
Microstrategy and Talend join a growing number of companies are prepping open source or commercial management tools for Hadoop. Last week, Cloudera and Quest embarked on a project to build software that can link Hadoop with Oracle databases. In May, open-source, business-intelligence company Pentaho announced that its BI suite would work with Hadoop databases.
In a separate interview with IDG News Service, Yahoo CTO Raymie Stata pointed out that Hadoop could reduce the need for building supercomputers to analyze large data sets. Traditionally, large data sets have been moved from storage into the supercomputer, which is a pooled set of servers, to be analyzed. In contrast, Hadoop moves the analytic computation to where the data resides, eliminating the need for a cental, giant number-crunching machine. Yahoo was an early leading contributor to Hadoop.
In addition to Cloudera's offering, Hadoop is also being commercialized by IBM, which recently started offering a set of analytic services that use the technology.