Data Elixir logo

ISSUE 449 · August 22, 2023

 

Posts & Tutorials

Analysis of the data job market using "Ask HN: Who is hiring?" posts

This analysis of the monthly "Who is Hiring" discussion on Hacker News explores 10 years of posts to understand the trends in the data job market, with a particular focus on data science. It's based on Hacker News posts, which aren't necessarily an accurate reflection of the job market but it's a great analysis, with useful insights along the way.
Emir U

 

Is probability frequentist or Bayesian?

The single biggest argument about statistics: is probability frequentist or Bayesian? It's neither, and this post explains why. Buckle up — this is a deep-dive explanation.
The Palindrome | Tivadar Danka

 

Open challenges in LLM research

Great post summarizing the top challenges facing large language models. If you're looking for problems to solve or are interested in understanding LLM weaknesses, this is a good place to start.
Chip Huyen

 

Sponsored Link

Get an edge in AI careers of tomorrow. Learn the concepts that drive it today.

Brilliant makes it easy. In just 15 minutes a day, you can level up in the core building blocks of AI, as well as data science, programming, logic, and beyond. Thousands of bite-size, interactive lessons help you master the key concepts and make it insanely easy to build a daily learning habit. Join over 10 million people and start your 30-day free trial today.

 
 
 

Reach Data Elixir readers by sponsoring an issue. for details.

 

Tools & Code

Automatic Generation of Visualizations using LLMs

LIDA is a new library for generating data visualizations and infographics using large language models. It's language agnostic and can work with any visualization library too. There's a lot here, including a notebook tutorial, a paper, and a code repo.
Microsoft Open Source

 

QGIS in R

QGIS is a popular open-source geospatial data system that lets you create, edit, visualize, analyze and publish geospatial information from any platform. Now, with the new qgisprocess package, you can work with QGIS directly from R. 
RSpatial | Dewey Dunnington, Floris Vanderhaeghe, et al.

 

Career

Open Doors in Your Career by Unlocking the Dark Side of Your Superpower

There's a dark side to every superpower but you need to understand those dark sides to move past them. If you've ever felt stuck in your career, the insights here could help you get unstuck. Great post.
The Skip | Nikhyl Singhal

 

Resources

Anti-hype LLM reading list

Great collection of posts and papers that dive into LLMs and how to work with them. This is an evolving resource by Vicki Boykis who's particularly interested in practical first-hand accounts.
Vicki Boykis

 

Data Visualization - Fall 2023

This free online course by Andrew Heiss just started and it looks great. The course covers a wide variety of topics and includes video lectures, examples, assignments, and pointers to books, articles, and key resources that are all free.
Georgia State University | Andrew Heiss

 

Outlier

The Nature of Code

The Nature of Code is a beginner-friendly creative coding tutorial that explores a range of programming strategies for developing computer simulations of natural systems—from elementary concepts in math and physics to sophisticated machine learning algorithms. This is the newly updated, 2nd edition of this popular book and it's free to read online.
Daniel Shiffman

 
 

Sign up to get Data Elixir's  data science newsletter in your Inbox >>

 
« Previous Issue   Next Issue  »  
 
 
 
Data Elixir logo

Data Elixir, LLC
P.O. Box 21255
Boulder, CO 80308

Data Elixir® is curated and maintained by Lon Riesberg. If you have questions or suggestions, send a note!