ISSUE 428 · March 14, 2023
Tutorials, Projects & Opinions
Online gradient descent written in SQL
A machine learning algorithm which could be trained using SQL opens up a world of possibilities. If you really wanted to simplify things, how much could you do in SQL?
The State of Competitive Machine Learning
In 2022, there were more than 200 machine learning competitions that were hosted by organizations such as Kaggle, DrivenData, AIcrowd, CodaLab, Zindi and others. This review summarizes the state of competitive machine learning and analyzes 67 winning solutions to figure out the best strategies to win.
Big Data Is Dead… Long Live Big Data
A couple weeks ago, the "Big Data is Dead" post stirred up a lot of interest around the web. In this counter-post, Aditya Parameswaran draws on insights from the database research community to argue that big data is certainly not dead.
Why Retailers Fail to Adopt Advanced Data Analytics
Advanced analytics are getting better all the time but many companies still use very basic tools. This article specifically looks at retailers but, for the most part, the conclusions it draws are broadly applicable.
Invest in your success with The Information.
Their team of expert journalists provide exclusive insights into the people, trends, and forces that are defining the future of technology and business, so you'll always be in the know. Subscribe now and save 25%
Have a product, service, job, or event you'd like to share with Data Elixir readers?
Sponsor an Issue | Talent Collective
Tools & Code
webR 0.1.0 - first general use version
WebR enables interactive use of R in a browser where visitors don't need to have R on their own computer and you don't need an R-enabled server. This post introduces webR, walks through use-cases, and shows how to include webR in your own web applications.
PyBroker - Algorithmic Trading in Python with ML
PyBroker is a new python framework that's designed for developing algorithmic trading strategies using machine learning. With PyBroker, you can easily create and fine-tune trading rules, build powerful models, and get insights into your strategy’s performance. Includes pre-built access to free data sources or connect your own.
Iggy: Activate your address data
Iggy enables real estate teams, analysts, and data scientists to incorporate location data and insights into products, models and analyses with a single click. Check out our new neighborhood investment reports and get insights into any address here.
The Data Elixir Job Board currently lists 40 openings for a variety of roles, including data scientists, data analysts, machine learning engineers, head of data, and more. The roles cover a variety of levels, from entry-level to Director and 14 of the jobs are remote.
If you have 3+ years of data science experience, join the Data Elixir Talent Collective where top companies apply to you. For details, check out the Collective 👉
Geographic Data Science with R
Great book for learning how to use R with time series and geospatial data to address topics related to environmental change. Starts with an intro to R and data wrangling and then continues with chapters on vector geospatial data, raster geospatial data, coordinate reference systems, distribution modeling and much more. Free to read online.
Applied Machine Learning
This newly updated online course is one of the best available for learning about topics in applied machine learning. The course is free and includes 23 lectures with detailed course notes, 30 hours of lecture videos, and 20 implementations of ML algorithms in Python notebooks.