Data Elixir logo

ISSUE 419  ·   January 10, 2023

 

Resources

arXiv Xplorer

arxivxplorer is a new semantic search engine for arXiv papers. At this point, it's a self-funded side project that looks promising. Enter a query in plain English and behind the scenes, it uses OpenAI's latest embedding model to find the most relevant papers. Or paste in an arXiv URL and it will search for papers that are similar. For info and examples, see this discussion on Twitter >>
arxivxplorer by Tom Tumiel

 

The Illustrated Machine Learning website

Nice collection of illustrations that show how a variety of machine learning concepts work. There's a lot of information here and it's clear and very well organized. For the list of topics, click the top-left menu. 
Illustrated Machine Learning

 

Sponsored Link

Anomalo - Data quality vitals at your fingertips

 Data quality vitals at your fingertips

Anomalo Pulse gives you the power to improve your data quality in minutes, not quarters.

 

Reach Data Elixir readers by sponsoring an issue. for details.

 

Tutorials, Projects & Opinions

Matrices and graphs

The single most undervalued fact of linear algebra: matrices are graphs, and graphs are matrices. Encoding matrices as graphs is a cheat code, making complex behavior simple to study. This is a great tutorial — with lots of illustrations — that shows how.
The Palindrome | Tivadar Danka

 

Introduction to Graph Machine Learning

Graphs are everywhere and have a wide variety of uses. This post starts from the basics and introduces ways that graphs can be used in machine learning. Includes a nice selection of linked references to go further.
Hugging Face | Clémentine Fourrier

 

SQL Tells a Human Story

You can learn a lot about an organization by reading through its SQL files. These files contain the map of how an organization works and by reading through them, you can learn how systems come together to make a business run. In this post, Laura Ellis walks through some examples of common SQL patterns and what you can learn from them.
Laura Ellis

 

Combining R and Python with {reticulate} and Quarto

Sometimes you might need to use R. Sometimes you might need to use Python. Sometimes you need to use both at the same time. This post shows you how to combine R and Python code using {reticulate} and output the results using Quarto.
Nicola Rennie

 

Don’t Get Lost in the Semantics: Jan 26th

It’s 2023 and Semantic layers are back and better than ever. Join us Jan 26th to learn how data teams can think about tradeoffs between governance and flexibility to deliver better outcomes. Featuring dbt Labs’ Anna Filippova & Mode Analytics’ Benn Stancil.
Jan 26, 10 am PST RSVP Now. 
// sponsored

 

Data Visualization

Science visualization trends of 2022

Helena Jambor went through a collection of scientific journals and threads from science twitter in 2022 and distilled 10 key visualization trends. This is well organized and includes screenshots, clear descriptions, and links to key resources.
Helena Jambor

 

Flow fields

Great step-by-step tutorial for creating flow fields, aka vector fields, using R. This is intended for creating generative art and the screenshots along the way are gorgeous.
Mathematical Art and Creative Coding | George Savva

 
 

Sign up to get Data Elixir's  data science newsletter in your Inbox >>

 
« Previous Issue   Next Issue  »  
 
 
 
Data Elixir logo

Data Elixir, LLC
P.O. Box 21255
Boulder, CO 80308

Data Elixir is curated and maintained by Lon Riesberg. If you have questions or suggestions, send a note!