Tools and Techniques
Great overview of how R is used at Airbnb. Includes insights regarding their daily workflow, predictive modeling, experimentation, scaling, data visualization, the tools they use, and practical insights for incorporating R into your own organization.
xgboostExplainer makes your XGBoost model as transparent and 'white-box' as a single decision tree. Here's a very clear description of why you might want to use xgboostExplainer and how it works.
If you've ever taken a taxi in New York City, you may have wondered if you would have been better off riding a bike. That turns out to be a smart question. This is a great post by Todd Schneider that shows how to think through the issues with data.
K-Means is a very simple algorithm which clusters data into K number of clusters. It's a strategy for unsupervised learning, which is often used when data isn't labeled. Here's how it works, how to implement it from scratch, common issues, and links to useful resources.
Here are some practical considerations for maintaining your machine learning models in a production environment. This is far from complete but is a good entry point for thinking through some of the issues.