Which freaking Hadoop engine should I use?

These four truths will help you determine which Hadoop technology to use for the types of workloads you anticipate

Which freaking Hadoop engine should I use?

In 2015, Hadoop no longer means MapReduce on HDFS. Instead, it refers to a whole ecosystem of technologies for working with “unstructured,” semi-structured, and structured data for complex processing at scale.

This also now includes streaming use cases, which can be massively parallelized or happen in “real time” (which today means many different things ... other than traditional RTOS-style “real time”). The streaming Spark crowd now likes to contrast itself from the Hadoop -- or more specifically, the YARN -- crowd.

Copyright © 2015 IDG Communications, Inc.

How to choose a low-code development platform