Tools and Techniques
Shirin Glander's latest tutorial shows how to run machine learning applications on a Spark cluster. This is a very well organized walk-through.
Google Spreadsheets probably wouldn't be your first choice for production use but they're flexible and can be great as quick, online data stores for non-critical uses. This post offers an easy way to programmatically interact with your spreadsheets using Python.
Tips and tricks for working with data using Python pandas and visualizing it with seaborn.
This tutorial introduces Spotify’s Web API for accessing detailed information about artists, albums, and song lyrics using R. Next, create a gloom index for analyzing sentiment. It's not perfect but it's a fun project that could be easily extended to other applications.