Hadoop

Hadoop news, information, and how-to advice

security 2016 big data
stupid factory

beautiful green farmland with blue sky and clouds

Redis plants the seeds for an open source ecosystem

Redis Modules help the caching and in-memory storage system work with new data structures and database behaviors

train leaving

HBase: The database big data left behind

As the default database for Hadoop, you'd expect HBase to be more popular than it is, but its time may already have passed

Spark Java microframework

Apache Spark powers live SQL analytics in SnappyData

The same team that created GemFire builds on Spark in a new open source database that can analyze OLTP and OLAP workloads side-by-side

Data lakes 101: Come on in, the water's fine

How to plan for and build a central hub for data analytics with the ever-evolving Hadoop ecosystem

analytics big data stats statistics charts

Apache Beam wants to be uber-API for big data

New, useful Apache big data projects seem to arrive daily. Rather than relearn your way every time, what if you could go through a unified API?

streaming river water creek flow

What Spark's Structured Streaming really means

Thanks to an impressive grab bag of improvements in version 2.0, Spark's quasi-streaming solution has become more powerful and easier to manage

big data rescue

Review: Databricks makes big data dreams come true

Cloud-based Spark machine learning and analytics platform is an excellent, full-featured product for data scientists

Elephant dog rain tint

Hadoop project ODP regroups under Linux Foundation's umbrella

The Open Data Platform's reorg aims to assuage criticism about vendor control over the initiative to create a consistent baseline Hadoop distribution

data integration

SAP's HANA Vora bridges divide between enterprise and Hadoop data

The SAP HANA Vora software is designed to allow companies to analyze data stored in Hadoop, enterprise systems and other distributed data sources

Hadoop sign door

Apache Flink 1.0 takes on Spark in Hadoop processing

Hadoop needs fast and easy-to-use stream processing, and Flink provides that -- but it'll compete with Spark and Storm

data abstract

MapR invites Docker and Mesos to its big data party

MapR's updated Hadoop distribution provides persistent storage for Dockerized apps, enables Hadoop jobs governance via Mesos

Hortonworks sign

Use Hortonworks Hadoop? Now you can rely on a more stable core

Hortonworks Data Platform updates will happen continuously for services like Spark and Hive but just annually for core components

money flying away loosing broke bankrupt

Hortonworks seeks salvation in proprietary software

Once Hortonworks seemed bound and determined to pursue the 'pure' open source model. Now reality is finally setting in

bi business intelligence ts

How different SQL-on-Hadoop engines satisfy BI workloads

Benchmark of SQL-on-Hadoop engines Impala, Spark, and Hive finds they each have their strengths and weaknesses when it comes to BI workloads

Spark Java microframework

Databricks offers a glimpse of Spark 2.0

Spark has taken big data by storm. What's next for the in-memory engine of choice? Spark's primary commercial backer, Databricks, offers a clue

Female executive shooting bow and arrow against green background

Apache Arrow aims to speed access to big data

Apache's new project leverages columnar storage to speed data access not only for Hadoop but potentially for every language and project with big data needs

sparkler blaze fire

Hadoop co-creator: Spark is great -- but people want more

Doug Cutting anticipates growth ahead and opportunities all around for the Hadoop ecosystem

Big Data analytics machine learning

Why open source is the 'new normal' for big data

Openness is driving the Hadoop ecosystem, Talend's CEO says

Load More