— In the News —
This article from the latest McKinsey Journal takes a look at the importance of organizational culture on the success of data analytics efforts. Leaders throughout industry were interviewed for the article and snippets of those interviews are organized into 7 key principles that underpin a healthy data culture.
Dataset Search is a new tool from Google that aims to make it easy to find data from the thousands of data repositories around the web. This looks like the start of an awesome resource. Check out the post for details, including how to make sure your own datasets get indexed.
— Sponsored Link —
Vettery specializes in tech roles and is completely free for job seekers. Interested? Submit your profile, and if accepted onto the platform, you can receive interview requests directly from top companies growing their data science teams.
— Tools and Techniques —
An anonymous op-ed in The New York Times got a lot of attention in Washington last week. Key people were calling for the author's identity to be revealed, which The New York Times refused to do. In this post, David Robinson offers a great walk-through of how to quickly scope out the problem using a document similarity approach. It's not definitive but it's a fantastic tutorial.
deon is a command line tool that allows you to easily add an ethics checklist to your data science projects. A lot of thought has gone into this tool and there are a lot of examples to get you thinking. This is a project from DrivenData, which hosts data science competitions to save the world.
Insightful post on Jupyter notebooks, IDEs, and R. This is a long read that will get you thinking.
StitchFix sells clothes but fundamentally, it's a data science organization. Using personalized recommendations at scale, they sold nearly a billion dollars worth of clothes last year and they're growing fast. In this post, Elizabeth Bennett peeks into Stitch Fix’s Data Science culture and explores how that culture drove their infrastructure design.
Nice introduction to knowledge graphs with insights for practical uses at an organization like Airbnb.
A friendly introduction to matrix factorization and how it's used to recommend movies in Netflix. This is a very clear half-hour video tutorial with an accompanying notebook.