Ian Pointer

Ian Pointer is a senior big data and deep learning architect, working with Apache Spark and PyTorch. He has more than 15 years of development and operations experience.

Spark 2.0 prepares to catch fire

Spark 2.0 prepares to catch fire

Today, Databricks subscribers can get a technical preview of Spark 2.0. Improved performance, SparkSessions, and streaming lead a parade of enhancements

Look out, Spark and Storm, here comes Apache Apex

Look out, Spark and Storm, here comes Apache Apex

A new open source streaming analytics solution derived from DataTorrent's RTS platform, Apex offers blazing speed and simplified programmability. Let's give it a spin

Apache Beam wants to be uber-API for big data

Apache Beam wants to be uber-API for big data

New, useful Apache big data projects seem to arrive daily. Rather than relearn your way every time, what if you could go through a unified API?

What Spark's Structured Streaming really means

What Spark's Structured Streaming really means

Thanks to an impressive grab bag of improvements in version 2.0, Spark's quasi-streaming solution has become more powerful and easier to manage

Get started with Apache Spark

Reap the performance and developer productivity advantages of Spark for batch processing, streaming analysis, machine learning, and structured queries

Which freaking big data programming language should I use?

Which freaking big data programming language should I use?

When it comes to wrangling data at scale, R, Python, Scala, and Java have you covered -- mostly

Why Spark 1.6 is a big deal for big data

Why Spark 1.6 is a big deal for big data

Already the hottest thing in big data, Spark 1.6 turns up the heat. Here are the high points, including improved streaming and memory management

5 things we hate about Spark

5 things we hate about Spark

Spark has dethroned MapReduce and changed big data forever, but that rapid ascent has been accompanied by persistent frustrations

First look: Couchbase’s new SQL for NoSQL

First look: Couchbase’s new SQL for NoSQL

Couchbase Server 4.0 addresses NoSQL’s biggest pain point with SQL-like query language for its document datastore

Apache Flink: New Hadoop contender squares off against Spark

Apache Flink: New Hadoop contender squares off against Spark

A flexible replacement for Hadoop MapReduce that supports real-time and batch processing, Flink offers advantages over Spark

Load More