Big Data

Big Data news, analysis, research, how-to, opinion, and video.

listen phonograph dog hear

Google Cloud Dataflow vs. Apache Spark: Benchmarks are in

In a simple batch processing test, Google Cloud Dataflow beat Apache Spark by a factor of two or more, depending on cluster size


Virtual machine

Review: HPE’s machine learning cloud overpromises, underdelivers

Haven OnDemand’s enterprise search and format conversions are the strongest services, while more interesting capabilities are not fully cooked

larry ellison executive keynote 02

Oracle is paying $532 million to snatch up another cloud service provider

Move follows the purchase of Textura last week for $663 million, as cloud holdout Oracle makes up for lost time

questionable signs 87489914

SQL Server 2016 heads for release, but Linux version is still under wraps

SQL Server 2016, sporting a full-fledged free edition for developer use, will have a June 1 release. But there's still no word about the promised Linux version

Big Data (4)

Microsoft SQL Server 2016 finally gets a release date

Microsoft's popular database software will have its latest major release on June 1

log wood chipper

6 Splunk alternatives for log analysis

Splunk may be the most famous way to make sense of mass quantities of log data, but it's far from the only player around

DNA fingerprint

Microsoft is making big data really small using DNA

A gram of DNA could store close to a trillion gigabytes of data

Spark Java microframework

Apache Spark powers live SQL analytics in SnappyData

The same team that created GemFire builds on Spark in a new open source database that can analyze OLTP and OLAP workloads side-by-side

fingers keyboard code hands programming

New functional programming language can generate C, Python code for apps

The open source Futhark makes it easier to program for GPUs that speed up machine learning and other math-intensive apps

house displayed on a hand 000005812122

Hot property: How Zillow became the real estate data hub

The R language, open source analytics software, and a migration to AWS are helping Zillow cement its position as the leading real estate data provider

Legal

How do you stop patent trolls? This algorithm just might do the trick

With 4.2 million listings so far, All Prior Art aims to 'democratize ideas'

Chair race    187891029

Look out, Spark and Storm, here comes Apache Apex

A new open source streaming analytics solution derived from DataTorrent's RTS platform, Apex offers blazing speed and simplified programmability. Let's give it a spin

Data lakes 101: Come on in, the water's fine

How to plan for and build a central hub for data analytics with the ever-evolving Hadoop ecosystem

machine learning

Why machine learning is the new BI

Get ready for artificial intelligence and automation that helps you make business decisions rather than just understanding what happened in the past

Abstract background texture with bright clouds in windows

Cloud review: Amazon, Microsoft, Google, IBM, and Joyent

The top five public clouds pile on the services and options, while adding unique twists

nosql

NoSQL chips away at Oracle, IBM, and Microsoft dominance

MongoDB, Cassandra, Basho, Couchbase, and MarkLogic are quietly insinuating themselves into enterprise data centers and cloud deployments

analytics big data stats statistics charts

Apache Beam wants to be uber-API for big data

New, useful Apache big data projects seem to arrive daily. Rather than relearn your way every time, what if you could go through a unified API?

5 steps to a modern data architecture

Becoming a true data-driven organization requires adopting a more centralized approach to data architecture and analysis

global network

Review: Amazon Web Services is eating the world

Amazon continues to define the cloud with an unrivaled set of services for developers, IT, and data crunchers

Load More