Data Elixir logo

ISSUE 350  ·   August 24, 2021

 

In the News

How big data carried graph theory to new dimensions

Researchers are turning to the mathematics of higher-order interactions to better model the complex connections within their data.
Quanta Magazine | Stephen Ornes

 

Sponsored Link

Announcing TransformX Conference

Announcing TransformX Conference: Driving AI from Experimentation to Reality

Join Scale AI for our two-day, virtual conference. We’re bringing together a community of leaders, visionaries, practitioners, and researchers across industries as we explore the shift from research to reality within AI and Machine Learning. 100+ speakers, 60+ sessions, 20,000+ attendees.

 

Reach Data Elixir readers by sponsoring an issue. Click here for details.

 
 

Tutorials, Projects & Opinions

Exploring R² and regression variance with Euler/Venn diagrams

Great post that uses Euler and Venn diagrams to explain R² and shared variation in regression models. This is a very intuitive approach!
Andrew Heiss

 

ML Explained in 5 Levels of Difficulty

If you're new to the field or need a good way to explain the work you do, this is a great ML explainer video by Hilary Mason. Hilary is a well-known data scientist who has been on the founding team of several ML startups. In this video, she explains machine learning to 5 different people: a child, a teen, a college student, a grad student and an expert.
Wired | Hilary Mason

 

Nearest Neighbor Indexes for Similarity Search

Nice introduction to similarity search, including and an overview and comparison of key options.
Pinecone | James Briggs

 

Pitfalls in Machine Learning Research: Reexamining the Development Cycle

Machine learning tends to be hindered by an ad hoc design process, poor data hygiene, and a lack of statistical rigor in model evaluation. This paper explores common pitfalls, case studies, and offers practical recommendations for improvements.
arXiv | Stella Biderman, Walter J. Scheirer

 

Patterns in confusing explanations

Clear writing is a superpower but it can be a LOT of work. In this post, Julia Evans helps simplify the process by identifying common patterns of unclear writing and suggesting examples of what to do instead.
Julia Evans

 

DataCamp for Business: All you need for learning and doing data science

Measure your team’s skill gaps with timed assessments, receive personalized learning recommendations, and get them certified as professional data scientists. Assign your team custom learning tracks & let them apply their skills in our cloud-based IDE. Join over 18,000 data teams using DataCamp. Request a free demo today.
// sponsored

 

Resources

RStudio Cheatsheets

These cheatsheets for a variety of R packages have recently been updated to include the latest features. Covers dplyr, ggplot2, lubridate, forcats, reticulate, the RStudio IDE, Shiny, and stringr.
RStudio Blog | Averi Perny

 

Data Visualization

Visualizing ordinal variables

Ordinal variables aren't numeric and they’re not categorical, which makes them hard to make sense of. This post explores why they're so tricky and ways to approach them.
Octavio Medina

 
 

Sign up to get Data Elixir's  data science newsletter in your Inbox >>

 
 
 
Data Elixir logo

Data Elixir, LLC
P.O. Box 21255
Boulder, CO 80308

Data Elixir is curated and maintained by Lon Riesberg. If you have questions or suggestions for the newsletter, just reply back to this email.

Unsubscribe