Hadoop news, information, and how-to advice

adopting tools
millennial hipster standing in front of huge black chalkboard looking at data

arrow diagram flow chart

Hortonworks buys better Hadoop data flow management

Hortonworks' newest acquisition is a prelude to creating an open-source-based data flow management product

big data lessons

Mesosphere's new big data solution: Add Spark, hold the Hadoop

A data-processing solution from Mesosphere leverages Spark, Kafka, and Cassandra -- but eschews Hadoop -- for enterprise level real-time big-data needs

Chuck Norris

How Apache Ranger and Chuck Norris help secure Hadoop

The Hadoop ecosystem has always been a bag of parts, each of which needs to be secured separately -- at least they did need that, until Apache Ranger came to town

Hadoop sign door

The 7 most common Hadoop and Spark projects

Think you're breaking new ground with your Hadoop project? Odds are it fits neatly into one of these seven common types of projects


Who will be the Ubuntu of Hadoop?

While Hadoop and its innumerable satellite projects make for a confusing market, there's more healthy unity there than may be obvious

frustrated headache confused user man

9 big data pain points

Do enough Hadoop and NoSQL deployments, and the same problems crop up again and again. It's time for the industry to nail them sooner rather than later


4 cases where the ancient skill of data story telling will come in handy

The data interpreter is one of the 5 hottest jobs in data. Data interpreters are in fact data story tellers, who help executives make sense of the data they get

blue sine wave 000011916953

Why streaming analytics is such a big deal

Analytics drive decisions, but some decisions shouldn't wait until batch processes complete -- which is why, eventually, we'll all analyze data as it streams in


Why Spark is spiking in the cloud

Interest and investment in Apache Spark have increased dramatically in recent months, to the benefit of cloud customers

Hadoop elephant code

Yahoo struts its Hadoop stuff

A peek under the hood of Yahoo's Hadoop deployment illustrates how vast the ecosystem has become -- and how the company that invented it is still leading the way

wide data bridge

Streaming analytics enter the fast lane

Already we've moved on to a new phase in analytics where data never rests

shrugging man unknown mystery question decision

Which freaking Hadoop engine should I use?

These four truths will help you determine which Hadoop technology to use for the types of workloads you anticipate

Briefcases marching

Hadoop keeps marching on, somehow

Deployments of Hadoop in production have been slower to arrive than many thought, but Hadoop job growth data shows that enterprises are keeping the faith

Navigating a field of uncertainty and doubt questions

Big data, big challenges: Hadoop in the enterprise

Fresh from the front lines: Common problems encountered when putting Hadoop to work -- and the best tools to make Hadoop less burdensome

Kyvos serves up Hadoop on easy-to-parse data cubes

New big data software from startup Kyvos Insights can format Hadoop data into OLAP repositories


Debunked! 9 myths about big data and Hadoop

These unfounded beliefs about budget skills, technology, and technology fit can lead you astray

flying sparks fire

IBM fires up Spark with Bluemix, machine learning contributions

IBM doubles up on Spark, adding it to Bluemix and contributing its SystemML machine-learning code to the Apache project

big data

Spark 1.4 adds support for R, Python 3, cluster management

Spark data processing framework adds languages used by many data crunchers, as well as container-based cluster management features

Load More