No images? Click here

Data Elixir

ISSUE 294 ·   July 14, 2020        

 

In the News

Bottleneck for Coronavirus Response: Fax Machines

Before public health officials in the U.S. can manage the pandemic, they must deal with a broken data system that sends incomplete results in formats they can’t easily use. It's a messy problem that's not easily fixed.
New York Times

 
 

Sponsored Link

Advanced video annotation platform

The most advanced video annotation platform

Alegion solves for big video (4k and length) with speed, efficiency, and quality helping companies tackle the most demanding video-based computer vision applications. With ML at its core, smart tooling offers fast loading, object tracking, interpolation, smart poly and more. Learn more about the leading video annotation platform. 

 

Reach Data Elixir readers by sponsoring an issue. Click here for details.

 
 

Tools and Techniques

Large scale experimentation

When it's easy to run lots of experiments but expensive to observe, it's best to halt unpromising experiments as soon as possible. Here's how to think through such "experiment-rich" regimes.
MultiThreaded

 
 
 

The Data Science Lifecycle Process

The Data Science Lifecycle Process is a git-based framework that helps keep track of business questions, data requirements, experiments and model deployments involved in data science projects.  
GitHub | dslp

 
 
 
 

Making Netflix’s Data Infrastructure Cost-Effective

Here's the thinking behind Netflix's "data efficiency dashboard," which has helped reduce Netflix's overall storage footprint by more than 10%. 
Netflix Tech Blog

 
 
 

Testing Firefox more efficiently with machine learning

Great article about how the team at Mozilla is using machine learning to facilitate continuous integration - at scale - for Firefox development.
Mozilla HACKS

 
 
 

Darts: Time Series Made Easy in Python

Doing machine learning with time series data can get complicated fast and Darts is an open-source library that aims to simplify the process. It's inspired by scikit-learn and uses a consistent API with a powerful set of tools. This announcement explores its capabilities and motivations.
Unit 8

 
 
 

On-demand data for Excel and Google Sheets

Load refreshable data from the web. Search and filter COVID-19 data, or lookup data from Quandl and other cloud data services without leaving your spreadsheet.
// sponsored

 

Resources

Full Stack Deep Learning

This free, self-paced course is a gold mine for anyone that wants to get started with production ML in the real world. Covers things like infrastructure & tooling, training & debugging, testing & deployment, and data management. Lots of guest lectures by industry leaders too.
Full Stack Deep Learning

 
 
 

Awesome Machine Learning and AI Courses

Awesome collection of freely available machine learning and AI courses with videos, lecture notes, readings and assignments by some of the best AI researchers and teachers in the world.
GitHub | Lukas Spranger

 

Data Viz

a ggplot2 grammar guide

Great resource for ggplot2 users and/or people interested in learning about ggplot2. This is well-organized and each section includes a short discussion and examples with code walk-throughs.
Gina Reynolds

 

Job Board

New on the Job Board:

  • Senior Data Scientist at komoot (remote)
  • Senior Backend Developer / Data Scientist at komoot (remote)
  • Database Analyst at Transformation Church (border of NC/SC, US)
  • Senior Data Scientist, Enterprise at Slack (remote)
  • Director of Data Science at UnitedHealth Group (remote)
  • Senior Data Product Manager at GrubHub (New York, Chicago)
  • Staff Data Scientist at Twitter  (remote)
  • Director of Data Analytics at Elevate Labs (remote)

Check these out and more >>

 

Data Elixir is curated and maintained by Lon Riesberg. If you need help on a data project or have a suggestion for the newsletter, reply back to this email or grab a spot on my calendar >>

 
FacebookTwitterLinkedInWebsite
Data Elixir, LLC
P.O. Box 21255
Boulder, CO 80308
Unsubscribe