The new Greenplum HD Data Computing Appliance is built on top of Intel X86 servers and it uses both a structured database built by Greenplum, which EMC acquired last year, and the Apache open source version of Hadoop. The older version of the appliance is based on Sun Fire x64-based servers.
According to Yara, administrators can read and write files in parallel from Greenplum to HDFS, enabling rapid data sharing. Cross-platform analysis can be performed using Greenplum SQL and advanced analytic functions accessing data on HDFS.
"We're here to build a big data analytics stack," Yara said. "It's a unified stack whether it's for structured data in Greenplum's database, or through a data computing appliance."
The new Hadoop appliance is expected to be able to scale to a large number of nodes, but EMC did not disclose details.
The appliance is due out in the third quarter of this year, Yara said.
Lucas Mearian covers storage, disaster recovery and business continuity, financial services infrastructure and health care IT for Computerworld. Follow Lucas on Twitter at @lucasmearian, or subscribe to Lucas's RSS feed. His email address is firstname.lastname@example.org. Read more about BI and analytics in Computerworld's BI and Analytics Topic Center.