— Insight —
This special report by Katie Malone and Michelangelo D'Agostino explores ways to build, manage, and retain a data science team. Covers things like recruiting, interviewing, career paths, etc. There are a lot of great insights here.
The algorithms that underlie much of the modern world have grown so complex that we can’t always predict what they’ll do. Iyad Rahwan’s radical idea: observe their behavior in the wild.
— Tools and Techniques —
Great exploration of the technologies used for storing and processing data. This article covers a lot of ground but it's fairly high-level and is very well written. Covers things like relational databases, NoSQL, data warehouses, data lakes, distributed & parallel processing and cloud services. Includes discussion of the problems being solved, specific products and how the technologies have evolved.
And speaking of data management, what happened with Hadoop?? In this article, Derrick Harris, creator of ARCHITECHT, tells the story of Hadoop: the problems it solved, the unexpected challenges, and how it should serve as a "valuable lesson for anyone trying to make sense of the next big thing."
This step-by-step guide shows how to build a collaborative filtering recommender engine for research papers. The problem in this case is more complex than a typical recommendation system because the papers are part of user-curated collections - like Spotify playlists but for papers.
Large-scale language models have significantly improved the state-of-the-art on pretty much every NLP task but these large models have become cumbersome and expensive to deploy. So the team at Huggingface bucked the trend and significantly decreased the size. Here's what they gained and how you can get the code.
This introduction to graph algorithms covers key algorithms and how to implement them using Python. Each algorithm is clearly described and includes code snippets, diagrams, and real-world examples.
Vettery is an online hiring marketplace that's changing the way people hire and get hired. Ready for a bold career move? Make a free profile, name your salary, and connect with hiring managers from top employers today.
— Data Viz —
This is an awesome visualization that's gotten a lot of attention around the web this week. "The Enigma Machine is an especially neat thing to visualize because it was electromechanical. As you used it, it moved. Instead of circuit traces, it had beautiful real wires connecting its pieces..."
If you've been interested in learning D3 but not sure where to start, start here! This interactive guide to learning D3 breaks down the framework into its component parts and makes it easy to see how everything fits together. This tutorial was created by Amelia Wattenberger whose new book, Fullstack D3, has been getting rave reviews around the web.
Jessica Hullman's new article in Scientific American will be available for free for a few days so read it while you can! The article covers a variety of visualization techniques for showing uncertainty, including use-cases, examples, and strengths & weaknesses of each approach.