— In the News —
Ben Thompson writes at the intersection of technology and business strategy in his popular Stratechery blog. In this post, he explores how big online advertising businesses are essentially "data factories" and what that means for everyday users, businesses, and regulators.
Everyone wants to be "data driven" these days but that's not always the best approach. This article takes a look at the risks of being a data driven culture and offers a well-reasoned alternative.
— Tools and Techniques —
Being able to quantify uncertainty is key for really understanding your data. This is a great walk-through of a variety of methods, including bootstrapping, confidence intervals, regression and Monte Carlo methods.
The process of developing machine learning models is very different than what most engineers are accustomed to. In this post, Emmanuel Ameisen describes the differences and introduces an approach he calls the "ML Engineering Loop." It's an iterative approach that enables rapid discovery and development of the best models.
Sebastian Ruder's latest post offers high-level overviews of recent NLP advancements with a focus on neural network-based methods. This is organized around 8 key milestones and includes lots of linked references.
dbdiagram.io is a database diagrams designer for analysts & developers. Create and visualize database schemas using just your keyboard.
— Data Viz —
av is a new package for working with audio/video directly from R. It uses the FFmpeg AV libraries and it enables you to easily create and edit videos using FFmpeg's video editing library. Here are the highlights, along with code snippets and embedded video examples.
— Career —
As the field of Data Product Management matures, it's dividing into multiple sub-areas. This article takes a look at the evolving role of Data PMs and where things are going.
— In Case You Missed It —
Be sure to catch the most popular articles from last week's Data Elixir...