— In the News —
This new report from McKinsey & Company explores how advanced analytics may help struggling fisheries thrive while simultaneously protecting endangered ocean resources.
— Sponsored Link —
With deep fakes entering the mainstream, data scientists and researchers are assessing whether to leverage GANs in their own workflows. To help industry with this assessment, the upcoming Domino Data Lab webinar covers an implementation of a basic GAN model and demonstrates how adversarial networks can be used to generate training samples. Register here.
— Tools and Techniques —
This isn't another R versus Python post. Gordon Shotwell offers a smart, thoughtful perspective about his preference for R and how ultimately, programming languages are "just bundles of trade-offs."
In her latest post, Julia Silge uses data from the 2019 Stack Overflow Developer Survey to explore how gender affects salary for people who code. This is insightful, easy to follow and is a great walk-through of her approach for modeling the data.
Suppose you have a population of wild animals and you want to estimate the population size. It's impractical to catch them all, so what do you do? This post walks through the problem with a nice demonstration of Bayesian inference.
Sheetfu is a small library that provides an easy way to interact with Google Sheets from Python. With Sheetfu, you can get or set cell values, background colors, font colors or any other cell attributes that are supported by the Google App Script API.
tidypredict lets you move your predictions to your database! Fit a model in R and then use tidypredict with dplyr to create a runnable SQL statement. Supports a variety of models including linear regression, GLMs, random forest, XGBoost, tree models and more.
Join tens of thousands of data scientists worldwide who use Comet.ml
Automatically track, compare, explain and reproduce your ML models and experiments. Sign-up for free.
— Resources —
Great summary of 2019 research highlights by Sebastian Ruder. Each highlight includes a short summary, links and an outlook for the future.
FiveThirtyEight is an online news outlet that uses statistical analysis to tell stories about elections, politics, sports, science, economics and lifestyle. Many people don't realize that FiveThirtyEight also shares the data behind each of its articles so you can verify the analysis or dig in and find other stories. This guide to the articles and data is a great learning resource.
— Data Viz —
What does "normal" even mean?
This project by Victor Dibia is a gentle introduction to anomaly detection with autoencoders. His introduction to the project on Medium is also worthwhile.
— Job Board —
- Data Scientist - Use Data to Improve Mobile Apps that Treat Disease at Pear Therapeutics - Boston, MA
- Data Scientist with a Data Engineering Background at Pear Therapeutics - Boston, MA
- Executive Director at The Carpentries - REMOTE
- Data Analyst at Social Finance - London, UK
- Part-time Data Scientist for Applied Healthcare Research at University of Lucerne, Switzerland - REMOTE
- Research Associate, Center for Data Insights at MDRC - New York, NY or Oakland, CA
- Data Analytics Course Mentor at Springboard - REMOTE
- Senior Data Scientist at Intercom - San Francisco, CA, USA
- Data Engineer at OECD (Organisation for Economic Co-operation and Development) - Paris, France