— In the News —
AI Nationalism isn't here yet but machine learning is already becoming a nationalized pursuit and could become a big differentiator between countries. There are the obvious technological benefits and that leads right into economic and military advantages, as well. This longread explores the issues and what we should be thinking about now.
— Sponsored Link —
Data is the key to solving some of the world’s most challenging problems, and the need for professionals who can understand and manage that data is growing every day. The UC Berkeley School of Information is meeting that need with [email protected], a Master of Information and Data Science degree program delivered online.
— Tools and Techniques —
Here's the long-awaited second part of the Visual Introduction to Machine Learning. In Part 2, the focus is on model tuning and the bias-variance trade-off. This is an elegant, must-read tutorial.
Here's how to take advantage of spaCy and a bit of Cython for blazing fast NLP. This is a great tutorial that also includes a Jupyter notebook with examples.
It's not uncommon for data to be locked up in pdf files and sometimes, there may even be good reasons for that. pdftools is an R package that makes it easy to extract data from pdf files and get it into a useful format. This tutorial is super clear and makes it easy to get started.
In this short and insightful post, Roger Peng explores the question, "Which is more important? The person doing the analysis? Or the institutional knowledge behind the analysis?"
— Resources —
This free, online text offers a comprehensive introduction to forecasting methods using R. This newly released version is the second edition and it's also available for purchase if you prefer print.
— Datasets —
Great collection of high-quality datasets.
— Data Viz —
There are many ways to compare values and, depending on what you're trying to understand, some ways are better than others. In this post, Mike Bostock walks through some common techniques and shows how they're useful.