Hadoop

Hadoop news, information, and how-to advice

Nerves can get to you worried anxious fret nervous anxiety
Family in silhouette waving goodbye at airport

lion tamer woman whip zoo

Tame unruly big data flows with StreamSets

See how the free open source StreamSets Data Collector brings visibility and control to real-time streaming data

fire match ignite flame

Fire up big data processing with Apache Ignite

Apache Ignite brings RDBMS, NoSQL, and Hadoop data sets into memory to deliver huge performance gains

faceoff face-off hocket

Big data face-off: Spark vs. Impala vs. Hive vs. Presto

AtScale, a maker of big data reporting tools, has published speed tests on the latest versions of the top four big data SQL engines. Conclusion: Time to upgrade!

big data group

7 big data tools to ditch in 2017

You think you got the hang of big data analytics? There's no time to be smug. To deliver real value, you'd better keep your stack up to date

privacy eye peek look secret

8 'new' enterprise products we don't want to see

So many press releases, so little time -- here are the product announcement emails that get deleted based on the subject line alone

data lakes

How to answer the top three objections to a data lake

There is usually a set of stakeholders out there who are unfamiliar with Hadoop or the concept of a data lake or perhaps just not interested in changing the status quo of their organizations

messy cio desk worker office frustration

Big data security is a big mess

No one questions that the Hadoop/Spark ecosystem can yield business-changing insights. Yet few seem willing to face up to the sorry state of big data security

internet of things data

Big data problem? Don't forget search

While Hadoop, Spark, and NoSQL databases make more noise, search is the original -- and one of the most useful -- big data technologies

big data enter key 000034547914 medium

Spark 2.0 takes an all-in-one approach to big data

With a new streaming system, performance enhancements, and API refinements, Apache Spark 2.0 offers a big umbrella to data users

open source keyboard

Spark-powered Splice Machine goes open source

An open source version of the Hadoop-based and Spark-accelerated RDBMS is now available sans a few enterprise features

waste basket ideas trash

With big data, CEOs find garbage in is still garbage out

BI has always topped of the list of enterprise priorities -- and execs are always the least satisfied with BI initiatives. Why should big data be any different?

Network room and mainframes with virtual city in the cloud

How to get your mainframe's data for Hadoop analytics

IT's mainframe managers don't want to give you access but do want the mainframe's data used. Here's how to square that circle

Chain held together by string

HDFS: Big data analytics' weakest link

Hadoop's distributed file system isn't as fast, efficient, or easy to operate as it should be

Hadoop

Prioritize predictable performance in Hadoop

Organizations running Hadoop in production can ensure that high-priority jobs complete on time, every time

Two hands reaching and creating a spark of electricity

The next steps for Spark in the cloud

Simply having Spark in the cloud isn't enough. What matters is what it can connect to and how easy it is to use

great blue heron bird flight feathers

Had it with Apache Storm? Heron swoops to the rescue

Heron, Twitter's brand-new streaming replacement for Apache Storm, offers easier scaling and higher throughput while maintaining Storm code compatibility

security 2016 big data

Businesses harbor big data desires, but lack know-how

A fresh survey shows that while more companies are investing in big data, putting the results of all that processing to use remains dicey

stupid factory

Dear Silicon Valley: Stop saying stupid stuff

Silicon Valley has its head so far in the future it can't hear the laughter in response to its over-the-top pronouncements

Load More