Data Elixir

ISSUE 311   ·   November 10, 2020        

 

Insight

Data Feminism

This interview with Catherine D’Ignazio and Lauren Klein is a great introduction to their recent book, Data Feminism. Don't be fooled by the title. Their book covers broad questions about power and ethics in data science and has been getting rave reviews around the web.
PAIR Blog

 
 
 

A hypothesis is a liability

20 years ago, a psychology experiment showed that people tend to miss obvious things if they're specifically looking for something else. It was called a "selective attention test" and it's become a psychology classic. Now, a similar phenomenon has been shown to occur in data analysis. When given a specific hypothesis to test, students tended to miss the gorilla in the data.
Genome Biology

 
 

Sponsored Link

Smith School of Business at Queen’s University

Future-Proof Your Career, While You Work

The Global Master of Management Analytics from Smith School of Business at Queen’s University is a 12-month program that can be taken from anywhere in the world.  Master the essential strategies for applying analytics to business needs in this ground-breaking program.

 

Reach Data Elixir readers by sponsoring an issue. Click here for details.

 
 

Tutorials, Projects & Opinions

Data Quality at Airbnb - Part 1: Rebuilding at Scale

Airbnb is widely known as a data-driven organization but it wasn't always that way. Just two years ago, the company made a deep commitment to data quality and embarked on a major rebuilding effort. This first in a series of posts explores the issues and the key components of what became Airbnb's Data Quality initiative.
Airbnb Engineering & Data Science Blog

 
 
 

Hierarchical Time Series With Prophet and PyMC3

When doing time-series modeling, it's not uncommon to want to make long-term predictions for multiple, related, time-series. This is a great talk that shows how to build a hierarchical version of Facebook’s Prophet package to do exactly that.
PyMCon | Matthijs Brouns

 
 
 

Free Webinar: The Power of Spreadsheets for Achieving a Data Driven Culture

Now that companies understand the importance of a data driven culture, leaders are searching for practical ways to reinforce this new mindset. Join Metis to learn how spreadsheets can help employees get comfortable with data and empower them to perform their own analysis without hand holding by your advanced analytics team.
// sponsored

 

Code & Tools

Deepnote - A better data science notebook

Deepnote is a new type of notebook for data science that's Jupyter-compatible, has real-time collaboration and runs in the cloud. This introduction shows how it can help you work more efficiently.
Deepnote

 
 
 

Pointblank

The pointblank R package makes it easy to methodically validate your data whether in the form of data frames or as database tables. In addition to the validation tools, the package gives you the ability to stay up-to-date with the information that defines your tables.
GitHub | rich-iannone

 

Resources

Intro to Linear Algebra for Applied ML with Python

Great introduction to linear algebra, especially with regards to machine learning. This is very well-organized and includes a detailed list of linked topics, lots of diagrams, and links to useful resources around the web.
Pablo Caceres

 
 
 

List of ML and DL Conferences in 2020 and 2021

Looking for a conference to attend, sponsor or submit a talk to? This latest list from Tryolabs includes 40 machine learning conferences around the world. Most have gone virtual and offer good discounts.
Tryolabs

 

And Finally...

earth

Cameron Beccario's live weather application just got a major upgrade. What started as a small project to refresh the menu turned into a rewrite of nearly the entire site. This is one of the best applications you'll find for visualizing wind, waves, temperatures and more using weather data from around the world.
nullschool | Cameron Beccario

 

Data Elixir is curated and maintained by Lon Riesberg. If you need help on a data project or have a suggestion for the newsletter, reply back to this email or grab a spot on my calendar >>

 

Sign up to get Data Elixir's  data science newsletter in your Inbox >>

 
FacebookTwitterLinkedInWebsite
Data Elixir, LLC
P.O. Box 21255
Boulder, CO 80308
Unsubscribe