No images? Click here ISSUE 294 · July 14, 2020In the NewsBottleneck for Coronavirus Response: Fax MachinesBefore public health officials in the U.S. can manage the pandemic, they must deal with a broken data system that sends incomplete results in formats they can’t easily use. It's a messy problem that's not easily fixed. Sponsored LinkThe most advanced video annotation platformAlegion solves for big video (4k and length) with speed, efficiency, and quality helping companies tackle the most demanding video-based computer vision applications. With ML at its core, smart tooling offers fast loading, object tracking, interpolation, smart poly and more. Learn more about the leading video annotation platform. Tools and TechniquesLarge scale experimentationWhen it's easy to run lots of experiments but expensive to observe, it's best to halt unpromising experiments as soon as possible. Here's how to think through such "experiment-rich" regimes. The Data Science Lifecycle ProcessThe Data Science Lifecycle Process is a git-based framework that helps keep track of business questions, data requirements, experiments and model deployments involved in data science projects. Making Netflix’s Data Infrastructure Cost-EffectiveHere's the thinking behind Netflix's "data efficiency dashboard," which has helped reduce Netflix's overall storage footprint by more than 10%. Testing Firefox more efficiently with machine learningGreat article about how the team at Mozilla is using machine learning to facilitate continuous integration - at scale - for Firefox development. Darts: Time Series Made Easy in PythonDoing machine learning with time series data can get complicated fast and Darts is an open-source library that aims to simplify the process. It's inspired by scikit-learn and uses a consistent API with a powerful set of tools. This announcement explores its capabilities and motivations. On-demand data for Excel and Google SheetsLoad refreshable data from the web. Search and filter COVID-19 data, or lookup data from Quandl and other cloud data services without leaving your spreadsheet. ResourcesFull Stack Deep LearningThis free, self-paced course is a gold mine for anyone that wants to get started with production ML in the real world. Covers things like infrastructure & tooling, training & debugging, testing & deployment, and data management. Lots of guest lectures by industry leaders too. Awesome Machine Learning and AI CoursesAwesome collection of freely available machine learning and AI courses with videos, lecture notes, readings and assignments by some of the best AI researchers and teachers in the world. Data Viza ggplot2 grammar guideGreat resource for ggplot2 users and/or people interested in learning about ggplot2. This is well-organized and each section includes a short discussion and examples with code walk-throughs. Job BoardNew on the Job Board:
![]() Data Elixir is curated and maintained by Lon Riesberg. If you need help on a data project or have a suggestion for the newsletter, reply back to this email or grab a spot on my calendar >> |