Hadoop news, information, and how-to advice

Briefcases marching
Navigating a field of uncertainty and doubt questions

Kyvos serves up Hadoop on easy-to-parse data cubes

New big data software from startup Kyvos Insights can format Hadoop data into OLAP repositories


Debunked! 9 myths about big data and Hadoop

These unfounded beliefs about budget skills, technology, and technology fit can lead you astray

flying sparks fire

IBM fires up Spark with Bluemix, machine learning contributions

IBM doubles up on Spark, adding it to Bluemix and contributing its SystemML machine-learning code to the Apache project

big data

Spark 1.4 adds support for R, Python 3, cluster management

Spark data processing framework adds languages used by many data crunchers, as well as container-based cluster management features

Data and analytics

LinkedIn fills another SQL-on-Hadoop niche

LinkedIn's open source, home-brew OLAP project is a new way for Hadoop users (and others) to query both real-time and historical data

maze simplify easy arrow easier

Hortonworks eases path to Hadoop

With new setup, management, and data-governance features, Hortonworks' latest Hadoop distribution wants to be an enterprise darling -- if enterprises will let it

6 real time

Spark and Storm face new competition for real-time Hadoop processing

DataTorrent is releasing its real-time data processing engine for Hadoop and beyond as the open source Project Apex

tahiti wave 000007128186

Salesforce wants enterprise big-data users to catch its Wave

With Wave for Big Data, Salesforce is determined to keep a foothold in enterprises with growing interests in Hadoop -- assuming existing self-service analytics outfits haven't gotten there first

the great wall of china

4 strategies to distribute your data between front end and back end

Where you store and process your data has a significant impact on issues such as privacy or performance, but also on the ability for apps to access and deliver relevant data

mind the gap london metro tube

The mythical Hadoop skills gap

Oh no! Big data is failing because we can't find enough people who know the technology! Relax, they're out there -- but don't fall for the buzzwords


Apache Drill 1.0 tears into data, with or without Hadoop

Drill 1.0 queries Hadoop data via SQL, but may have a life of its own outside of the framework

graph trend down

Hadoop demand falls as other big data tech rises

Hadoop isn’t living up to its hype -- which means that both Hadoop vendors and their customers need to widen their array of big data technologies

Oracle zeroes in on Hadoop data with new analytics tool

Oracle Big Data Spatial and Graph focuses on spatial and graph capabilities

data visualization

Sick of ETL? Database virtualization can help

Database virtualization, a seemingly bad idea from the past, turns out to be a good idea in the present

deathmatch 4 arm wrestle battle fight contest

Apache Flink: New Hadoop contender squares off against Spark

A flexible replacement for Hadoop MapReduce that supports real-time and batch processing, Flink offers advantages over Spark

machine learning robot touch screen

Adatao enhances Hadoop with natural-language queries and machine learning

By leveraging machine learning, the Data Intelligence Platform hopes to make querying Hadoop as easy as typing questions

big data 17

Microsoft's Azure moves indicate its big data ambitions

Azure SQL Database and Data Lake allow big data work to be done locally or remotely -- and break down barriers between the two

Hadoop door label

Apache Parquet paves the way for better Hadoop data storage

Newly graduated from the Apache Incubator, the Parquet project allows column-stored data to be handled at high speed

Load More