Data Science

Data Science | News, how-tos, features, reviews, and videos

Artificial intelligence computer brain circuits electronics grid
abstract data analytics

sparkler / firework / sparks / celebration / hands

Microsoft revamps machine learning tools for Apache Spark

The new open source release integrates Spark with Cognitive Toolkit and other Microsoft machine learning offerings

Do More With R [video teaser/video series] - R Programming Guide - Tips & Tricks

Reshape data in R with the tidyr package

See how the tidyr R package’s gather and spread functions work. Plus a bonus look at labeling in ggplot2 with the directlabels package

bos 2018 machine learning

The best open source software for machine learning

InfoWorld’s 2018 Best of Open Source Software Award winners in machine learning and deep learning

teamwork - collaboration

Why data science teams should operate like startups

Successful startups can teach data science teams a lesson or two.

stats big data stocks analytics

Microsoft adds Python support to Power BI

A preview feature allows Python scripts to be used as data sources and as a way to create visualizations in Power BI Desktop

watering a flower 115579466

What’s new in TensorFlow machine learning

Google’s TensorFlow 2.0 beta is expected later this year, with a focus on improving performance and correcting mistakes in compatibility and continuity

shortcut through a maze

Why there are no shortcuts to machine learning

As long as companies understand that good data science takes time in an enterprise, and give these people room to learn and grow, they won’t need shortcuts

weird science 2017universe

What’s new in Julia: Version 1.0 is here

New package manager, better optimization debut in the first production release

jw calculate

Julia tutorial: Get started with the Julia language

Want the convenience of a dynamic language and the performance of a compiled statically typed language? Try Julia

computer waste junk pile

Data is inherently messy. Is that really such a bad thing?

In an imperfect world, consider shifting your data quality mindset from “how do I clean all this up?” to “how do I make the most of this state of affairs?”

abstract data statistics

Practical use cases for real-time decisioning

Real-time decisioning has its place in almost every transaction and interaction you have with data and technology today

data science certification eye with raining binary numbers

What’s new in the Anaconda distribution for Python

Anaconda 5.2 adds job scheduling, support for GPUs, and integration with version control systems including Git and GitHub

newspaper headlines

What’s new in Python 3.7

Python 3.7 adds new classes for data handling, optimizations for script compilation and garbage collection, and faster asynchronous I/O

Digital explosion of data and numbers

What is Julia? A fresh approach to numerical computing

A “no compromises” programming language for data scientists, Julia combines the ease of a dynamic language with the speed of a compiled language

brain-shaped thought bubble showing flow of alphabetic characters

In an age of fake news, is there really such a thing as fake data?

The pitfalls and benefits of using synthetic data to train AI algorithms

data visualization

Why dynamically visualizing relationships in data matters

The ability to visualize data, their relationships to one another and connections to business objectives gives organizations the power to uncover insights that would otherwise elude them

statistics stats big data analytics

It’s time we tapped APIs for business analytics

With so much information flowing through APIs, the API management system offers a central hub for business insight

2 test exam vision

How to test developer, data science, and devops job candidates

Even if they can show you code on GitHub, you want your finalists to do a trial project as a real, objective test. Here are the test projects you should use

Load More