ISSUE 420 · January 17, 2023ResourcesModern PolarsIf you're not already familiar with it, Polars is a multithreaded, memory efficient, and very fast DataFrames library. This online book is a side-by-side comparison of Polars and Pandas, based on the Modern Pandas series by Tom Augsburger. This looks like a great resource. Sponsored LinkTurn documents into structured data with SensibleSensible is the developer-first platform that makes accessing the data in documents as easy as calling an API. Avoid the complexities and headaches of PDF parsing. Learn more about Sensible → Tutorials, Projects & OpinionsLet's build GPT: from scratch, in code, spelled out.In this awesome explainer, Andrej Karpathy shows how to build and train a Transformer following the "Attention Is All You Need" paper. This is very approachable and there are a lot useful links in the notes. How to get computational superpowers via ChatGPTHow does the impressively human-like ChatGPT get computational knowledge superpowers? Give it a Wolfram|Alpha neural implant! Making predictions from a mixed model using RNice introduction to using mixed models to make predictions using R. Starts with a simple linear regression and then walks through using the mixed model package {lme4} to extract confidence intervals and predictions intervals. Tools & Codebalancebalance is a new Python package that makes it easy to adjust biased data samples. In this announcement post, Roee Eilat describes the problem of biased data, how it occurs, and how this new package works. A Jupyter kernel for GNU OctaveGNU Octave is a high-level programming language that's primarily intended for scientific computing and numerical computation. It has a mathematics oriented syntax that's mostly compatible with MATLAB and it helps solve linear and nonlinear problems numerically. In this post, Giulio Girardi introduces a GNU Octave kernal for Jupyter. CareerMachine Learning in Weather & Climate MOOCThis free course introduces ML and its applications for weather and climate work. It covers a variety of research and operations areas in forecasting, ocean & climate modeling, meteorology and more. Includes expert speakers throughout and it looks like a great way to learn about a variety of careers that use ML for weather and climate. Data VisualizationAnalyzing labor markets in Python with LODES dataIn his latest post, Kyle Walker shows how to analyze and map commute patterns with the Python pygris package and LODES data. This is the second of a series where Kyle translates his favorite sections from his book, Analyzing US Census Data, to Python. Graphic WalkerGraphic Walker is an open source alternative to Tableau that's built as a React component for easy installation on a website. The interface uses simple drag-and-drop operations and is based on the Grammar of Graphics using vega-lite. |