— Insight —
Nice introduction to recent NLP advances for smart, non-experts.
The US criminal system uses algorithms to help determine who should be jailed and who should be released and it's messy. In this interactive article, Karen Hao and Jonathan Stray walk through the algorithm known as COMPAS and show how hard it is to quantify fairness.
Over the past 150 years, ideas have shifted drastically as to what counts as data, which data are reliable and who owns them. Here's how the idea of "data" has evolved over that time and gotten to be so big.
— Tools and Techniques —
"When this SQL query runs, here's how I think of what happens... " Great thread!
In this introduction to working with time-series data, Thomas Ebermann compares prominent techniques and shows how to implement them. This is a well-organized post that's easy to follow with plots and code snippets along the way. Covers Winter-Holt models, SARIMA, Neural Networks and Facebook's Prophet library.
In this post, Andreu Mora describes the design considerations and implementation decisions for a machine learning system that scores in real-time and is deployed at scale. This is a real-world design that's used by the Adyen Payments platform.
— Career —
Stack Overflow's 2019 Salary Calculator supports a wide variety of regions and technical job titles around the world. In this post on the Stack Overflow blog, Julia Silge offers a nice introduction to the data with insights into how the model works. Check out the post first and then head over to the calculator to see how your salary compares.
Great tips and insights here for figuring out good projects to include in a portfolio of ML work. Includes specific resources for generating ideas, things to watch out for, and practical ways to attract interest. Also, be sure to check out the suggested readings in the final bullet.