— Insight —
20+ experts, 1 architecture, 3 blueprints. This is a great article from Andreessen Horowitz that explores how data architectures are evolving as systems become increasingly based on data, rather than code. The article came out of discussions with dozens of practitioners.
Here's a twist on algorithmic bias. Recent research shows that it's not just the machines that are biased. People hold biases towards machines. Sometimes they trust the machines. Sometimes not. Understanding "why" is essential for developing effective Recommenders.
— Tools and Techniques —
Why use one machine learning algorithm when you could use all of them?! This post contains a step-by-step walk-through of how to build a superlearner prediction algorithm in R.
Structural time series display common characteristics, such as a general upwards or downwards trend; repeating, and potentially nested, patterns; or sudden spikes or drops. This applied research report shows how to work with and model structural time series.
- VisiData 2.0 is a major new release of this popular multi-tool for tabular data. "Combines the clarity of a spreadsheet, the efficiency of the terminal, and the power of Python, into a lightweight utility."
- tldrstory is a python framework for AI-powered understanding of headlines and text content related to stories.
- stacks is an R package for model stacking that aligns with the tidymodels. Model stacking is an ensembling method that takes the outputs of many models and combines them to generate a new model
- waldo is designed to find and concisely describe the difference between a pair of R objects.
— Resources —
Very well organized, choose-your-own learning path for data engineering topics. Covers things like SQL, database design, noSQL, data warehouses, data processing, pipelines, security, and more.
This new seminar series explores the frontier of machine learning systems and how machine learning changes the modern programming stack. The goal is to help curate a curriculum of awesome work in ML systems. Lectures start in the Fall and will be streamed for free online.
— Data Viz —
As part of a major new series, the Financial Times has compiled chronological chapters of the Covid-19 crisis using data from around the world. It's free to access and includes lots of data viz for key trends.
— Conferences & Events —
The IEEE VIS conference is one of the best data visualization conferences in the world and this year, it will be online and free. This is a technical conference that explores advances in visualization and visual analytics and there's a LOT to explore here. In particular, check these out: