— In the News —
In 2011, Marc Andreessen wrote the prescient essay, “Why Software is Eating the World.” These days, software may be eating the world but yesterday’s advantage is today’s table stakes. This Wall Street Journal reprint explores how model-driven businesses are the next big thing.
Digital tape storage systems have been around since the 1950s and if you didn't know any better, you might assume that tape is either already dead or on it's way out. But unlike other technologies that eventually become obsolete, tape just keeps getting better and better.
— Sponsored Link —
Mode Studio combines a SQL editor, Python & R notebooks, and a visualization builder in one platform. And it's free forever. Connect data from anywhere and analyze with the best language for the job, without having to jump between tools. Build custom visualizations or use our out-of-the-box charts. Share your analysis with a click—every report lives at a URL.
— Tools and Techniques —
Building a REST API for your models provides an easy way to collaborate with engineers and web developers. The API enables a clear division of labor: you work on the models and they work on building applications that use the models. This tutorial is easy to follow and is intended especially for data scientists.
This interactive book aims to be the best place on the internet for learning SQL. This is an awesome introduction and even if you've been working with SQL for awhile, you're likely to learn something here.
Nice collection of tips and tricks for Python users.
Great post that shows how to create an R package, step-by-step. This builds on Hilary Parker's well-known post from 2014 and walks through some of the latest developments in the package development space.
Databot is a Python framework for data-driven programming. What that means is that functions are connected by pipes and they communicate via data. When data comes in, the function is called. That lets you decouple data and functionality. For details, check out the docs and/or get the repo.
— Data Viz —
In his latest post, Thomas Lin Pedersen shows how to create an animated time series plot using gganimate. Plots like this might not show more than a static plot of the full dataset but animation can be useful for emphasizing the time-dimension of the data. A working example is embedded in the post as a gif.
Data-Driven Storytelling is a new book that offers insights into how to organize, create, and tell stories with data. Chapters are written by different practitioners at the intersection of data visualization and data journalism, which gives this book a broad perspective about communicating with data. For the next two weeks, it's available for free as a PDF or if you prefer print, you can order it online.