Amazon Web Services hopes to entice more Hadoop users to its Elastic MapReduce service with new virtual servers, one of which has 262GB of memory and 6.4TB of storage for big-data analytics.
On Tuesday, the company launched 12 new virtual servers or instances that organizations can use to run their applications using Elastic MapReduce clusters. Potential applications include Web indexing, data mining, log file analysis, financial analysis, scientific simulation and bioinformatics research.
[ Stay on top of the state of the cloud with InfoWorld's "Cloud Computing Deep Dive" special report. Download it today! | Also check out our "Private Cloud Deep Dive," our "Cloud Security Deep Dive," our "Cloud Storage Deep Dive," and our "Cloud Services Deep Dive." ]
Hadoop is an open-source platform that allows for the distributed processing of large data sets across clusters of computers. The MapReduce framework assigns work to nodes in the cluster.
Amazon's compute-optimized c3.8xlarge virtual server is aimed at tasks such as image processing. It has 32 vCPUs (virtual CPUs), 64GB of memory, two times 320GB of SSD storage and 10Gbps network connectivity. The price tag is $0.270 per hour, plus from $1.680 for the corresponding EC2 (Elastic Compute Cloud) server.
The storage-optimized i2.8xlarge instance type is a good fit for analytics applications Impala, Spark and HBase, Amazon said. It has 32 vCPUs , 262GB of memory, eight times 800GB of SSD storage, and 10Gbps network connectivity. The cost is $0.270 per hour and from $6.820 per hour for the EC2 capacity.
One effective way to determine the most appropriate instance type is to launch several small clusters and benchmark them, according to Amazon.
In total, Amazon now has 25 Elastic MapReduce servers for users to choose between, which cost from $0.011 to $0.270 per hour plus the charge for EC2. Users are limited to 20 servers across all their clusters in the standard configuration. Those that want more need to ask Amazon for permission.
On Tuesday, Amazon also lowered the cost of existing virtual Elastic MapReduce servers by 27 percent to 61 percent. The price change is part of a general price drop that Amazon announced last week after Google cut the cost of its services.
The price war between public cloud providers shows no signs of abating, as Microsoft cut Azure pricing on Monday and also introduced a new basic service configuration.
Users who want to run Hadoop in a hosted environment have alternatives to Amazon's Elastic MapReduce, including running Microsoft's HDInsight on top of the company's Azure cloud and Rackspace's Cloud Big Data Platform.