ISSUE 448 · August 15, 2023Talks & ConferencesCatching up on the weird world of LLMsGreat talk summarizing the last few years of the world of LLM development. Covers what LLMs are, what you can use them for, what you can build on them, how they’re trained and some of the many challenges involved in using them safely, effectively and ethically. The post includes an edited transcript and slides if you prefer to read. Sponsored LinkApplied Data Science with PythonIn this intermediate level course from Coursera, learn how to conduct statistical analysis, critique data visualization, apply machine learning, analyze social network connectivity, and much more. Enroll for free. Posts & TutorialsHow to Match LLM Patterns to ProblemsIn this follow-up to his LLM Patterns post from a couple weeks ago, Eugene Yan explores potential problems when using LLMs and the patterns that help mitigate them. Covers issues related to both internal LLMs that you control and external LLMs that you don't control. LearnDBThis new Python project is a good opportunity to learn how databases work under the hood. Essentially, it's a simplified sqlite clone that's been implemented from scratch with the intention to make it easy to learn and tinker with. Wrapping C Code in an R PackageIf you work in R and a collaborator who works in C gives you some code, what do you do? You could rewrite their code in R. You could say, "no thanks." Or, you could follow this step-by-step tutorial and package up their C code to make it available in R. How to fill maps with density gradients with RPlotting hundreds of data points on a small map of a city and have it make any sense is hard to do. This tutorial explores the issues and shows how to fix overplotted points on maps by creating bins or filled density gradients using R, {ggplot2}, and {sf}. ResourcesProbabilistic Machine Learning: Advanced TopicsKevin Murphy's Advanced Topics book is for researchers and graduate students working in machine learning and statistics who want to learn about deep learning, Bayesian inference, generative models, and decision making under uncertainty. The final version was just released and is available in print or follow the links to download a free PDF. Tools & Codefunctimefunctime is a Python library for time-series machine learning and embeddings at scale. causactThe {causact} R package is bringing numpyro speed to R. Learning computational Bayesian inference has never been easier. OutlierWhat a yarn!Great story about tactile data visualization - with crochet! If you like the ideas here, be sure to follow the links. |