Streaming engines such as those offered by Nodeable and Metamarkets are different from technologies like Dremel in one important aspect -- they are designed for analyzing raw data before it hits a database. Dremel and other technologies are designed for ad hoc querying of data that is already in a data store such as a Hadoop environment.
Meanwhile, major Hadoop players are not standing by idly. Cloudera, the biggest vendor of commercial Hadoop technology, last week rolled out a technology called Cloudera Impala, a real-time query engine for data stored in Hadoop Distributed File System. The Impala technology will allow companies to do batch and real-time operations on structured and unstructured data within one system, according to Cloudera.
Jaikumar Vijayan covers data security and privacy issues, financial services security and e-voting for Computerworld. Follow Jaikumar on Twitter at @jaivijayan, or subscribe to Jaikumar's RSS feed . His email address is email@example.com.
Read more about big data in Computerworld's Big Data Topic Center.