— In the News —
Comical look at data storage, backups, and interoperability in the Star Wars universe. There's nothing important here but it's a fun read!
— Tools and Techniques —
This new series by Tony Ojeda provides a framework for exploring data with Python. This first part introduces the framework and ways to think about data preparation.
Machine Learning Crash Course
Two-part tutorial from the Machine Learning at Berkeley organization. This is very visual and easy to follow:
- Part 1 - Introduction, Regression/Classification, Cost Functions, and Gradient Descent
- Part 2 - Perceptrons, Logistic Regression, and SVMs
Here's the latest installment of the very popular, Machine Learning is Fun series. You'll need more than this to recreate Siri but it's an interesting read and is a good start for a weekend project.
Must-read article for anyone who communicates using data to non-expert audiences.
— Resources —
This new Twitter account is a curation of the best open (and preferably machine readable) datasets you can find on the web.
Interested in learning about Deep Learning but not sure where to start? Here's a very well-organized guide to tutorials, videos, and book chapters around the web.
— Data Viz —
This choropleth was made purely in rstats – using only ggplot2. Well, mostly. This step-by-step tutorial shows how to create this beautiful, publication-ready choropleth, starting with a basic default map.
There are a lot of Best-Of lists around this time of year. For data visualization, this list by Nathan Yau is one of the best.