Data Elixir logo

ISSUE 323 ·   February 16, 2021        

 

Insight

How to build data literacy in your company

A recent Gartner survey found that poor data literacy in an organization is one of the top three barriers to building a strong data and analytics team. Meanwhile, a survey by Accenture showed that most employees aren't confident in their data skills. This post explores how companies should think about data literacy and develop it as an asset.
MIT Sloan School of Management

 

Sponsored Link

Patterns: The Science of Data

Publish your data science research in Patterns

Patterns is a new gold open access journal that promotes all types of cross- disciplinary data science research and outputs. We bring together researchers to solve key scientific problems that cross domain boundaries. Publish your work with us today!

 

Reach Data Elixir readers by sponsoring an issue. Click here for details.

 
 

Tutorials, Projects & Opinions

Building the Modern Data Stack

For organizations that are leveling-up their data stack, figuring out which combination of tools to use is complicated. This post explores the issues and offers a framework with specific suggestions to simplify the process. Includes insights from interviews with hundreds of data teams.
Raghu Murthy, Founder and CEO of Datacoral

 
 
 

Supercharging Apache Superset

Apache Superset is an open-source data exploration platform that's designed to be visual and easy to use. It started as a hackathon project at Airbnb, joined Apache as an Incubator project, and has since grown to be a mature BI tool for the enterprise. This is an awesome introduction to the project and how Airbnb has customized it for BI at scale.
Airbnb Engineering & Data Science

 
 
 

Data Observability, Part II: How to Build Your Own Data Quality Monitors Using SQL

In this SQL tutorial, Barr Moses and Ryan Kearns show how data teams can use metadata gleaned from schema and lineage to understand the root cause of data anomalies. In other words, this tutorial will help you contextualize the "why" and "how" behind broken data pipelines.
Barr Moses, CEO of Monte Carlo and Ryan Kearns

 

Code & Tools

WTF Python! 😱

Strain your brain with this collection of code snippets that demonstrate some counter-intuitive and lesser-known features of Python.
GitHub | Satwik Kansal

 
 
 

Ploomber

Ploomber makes it easy to turn your notebooks, scripts or Python functions into a reproducible data pipeline for data science or ML.
GitHub | ploomber

 
 
 

Math Inspector

Math Inspector is a visual programming environment for scientific computing based on numpy and scipy. Imagine web inspector, except for math and with block coding.
Math Inspector

 

Career

How Can We Fix the Data Science Talent Shortage?

New grads with data science skills are getting a cool reception from a job market that's supposedly hot for data scientists. This post explores the reasons why, what employers are really looking for, and how people new to the field should approach it.
Springboard | Kindra Cooper

 

Resources

Patterns, Predictions, and Actions

Here's a new machine learning gem that's worth downloading. This is a graduate level text that covers the foundations of prediction and pattern recognition, moving from decision theory to supervised learning to causality and reinforcement learning. Intended for readers who have some experience with probability, calculus, and linear algebra.
arXiv | Moritz Hardt, Benjamin Recht

 
 
 

Modern Data Science with R

The second edition of Modern Data Science with R is coming out in March but the Bookdown version is already available for free online.
Benjamin S. Baumer, Daniel T. Kaplan, and Nicholas J. Horton

 
 

Data Elixir is curated and maintained by Lon Riesberg. If you have questions or suggestions for the newsletter, just reply back to this email.

 
 
The Data Elixir Library

To find specific content from prior issues or to research topics, check out the catalogued archives on Data Elixir's new Search Page >>

 
 
 
 

Sign up to get Data Elixir's  data science newsletter in your Inbox >>

 
FacebookTwitterLinkedInWebsite
Data Elixir, LLC
P.O. Box 21255
Boulder, CO 80308
Unsubscribe