ISSUE 419 · January 10, 2023ResourcesarXiv Xplorerarxivxplorer is a new semantic search engine for arXiv papers. At this point, it's a self-funded side project that looks promising. Enter a query in plain English and behind the scenes, it uses OpenAI's latest embedding model to find the most relevant papers. Or paste in an arXiv URL and it will search for papers that are similar. For info and examples, see this discussion on Twitter >> The Illustrated Machine Learning websiteNice collection of illustrations that show how a variety of machine learning concepts work. There's a lot of information here and it's clear and very well organized. For the list of topics, click the top-left menu. Sponsored LinkData quality vitals at your fingertipsAnomalo Pulse gives you the power to improve your data quality in minutes, not quarters. Tutorials, Projects & OpinionsMatrices and graphsThe single most undervalued fact of linear algebra: matrices are graphs, and graphs are matrices. Encoding matrices as graphs is a cheat code, making complex behavior simple to study. This is a great tutorial — with lots of illustrations — that shows how. Introduction to Graph Machine LearningGraphs are everywhere and have a wide variety of uses. This post starts from the basics and introduces ways that graphs can be used in machine learning. Includes a nice selection of linked references to go further. SQL Tells a Human StoryYou can learn a lot about an organization by reading through its SQL files. These files contain the map of how an organization works and by reading through them, you can learn how systems come together to make a business run. In this post, Laura Ellis walks through some examples of common SQL patterns and what you can learn from them. Combining R and Python with {reticulate} and QuartoSometimes you might need to use R. Sometimes you might need to use Python. Sometimes you need to use both at the same time. This post shows you how to combine R and Python code using {reticulate} and output the results using Quarto. Don’t Get Lost in the Semantics: Jan 26thIt’s 2023 and Semantic layers are back and better than ever. Join us Jan 26th to learn how data teams can think about tradeoffs between governance and flexibility to deliver better outcomes. Featuring dbt Labs’ Anna Filippova & Mode Analytics’ Benn Stancil. Data VisualizationScience visualization trends of 2022Helena Jambor went through a collection of scientific journals and threads from science twitter in 2022 and distilled 10 key visualization trends. This is well organized and includes screenshots, clear descriptions, and links to key resources. Flow fieldsGreat step-by-step tutorial for creating flow fields, aka vector fields, using R. This is intended for creating generative art and the screenshots along the way are gorgeous. |