No images? Click here ISSUE 266 ยท January 7, 2020In the NewsPrecision fisheries: Navigating a sea of troubles with advanced analyticsThis new report from McKinsey & Company explores how advanced analytics may help struggling fisheries thrive while simultaneously protecting endangered ocean resources. Sponsored LinkEvaluating Generative Adversarial Networks (GANs)With deep fakes entering the mainstream, data scientists and researchers are assessing whether to leverage GANs in their own workflows. To help industry with this assessment, the upcoming Domino Data Lab webinar covers an implementation of a basic GAN model and demonstrates how adversarial networks can be used to generate training samples. Register here. Tools and TechniquesWhy I use R. They said the war was over...This isn't another R versus Python post. Gordon Shotwell offers a smart, thoughtful perspective about his preference for R and how ultimately, programming languages are "just bundles of trade-offs." Modeling salary and gender in the tech industryIn her latest post, Julia Silge uses data from the 2019 Stack Overflow Developer Survey to explore how gender affects salary for people who code. This is insightful, easy to follow and is a great walk-through of her approach for modeling the data. Iterated mark and recaptureSuppose you have a population of wild animals and you want to estimate the population size. It's impractical to catch them all, so what do you do? This post walks through the problem with a nice demonstration of Bayesian inference. SheetfuSheetfu is a small library that provides an easy way to interact with Google Sheets from Python. With Sheetfu, you can get or set cell values, background colors, font colors or any other cell attributes that are supported by the Google App Script API. tidypredicttidypredict lets you move your predictions to your database! Fit a model in R and then use tidypredict with dplyr to create a runnable SQL statement. Supports a variety of models including linear regression, GLMs, random forest, XGBoost, tree models and more. Comet: Machine Learning Experiment ManagementJoin tens of thousands of data scientists worldwide who use Comet.ml Resources10 ML & NLP Research Highlights of 2019Great summary of 2019 research highlights by Sebastian Ruder. Each highlight includes a short summary, links and an outlook for the future. FiveThirtyEight DatasetsFiveThirtyEight is an online news outlet that uses statistical analysis to tell stories about elections, politics, sports, science, economics and lifestyle. Many people don't realize that FiveThirtyEight also shares the data behind each of its articles so you can verify the analysis or dig in and find other stories. This guide to the articles and data is a great learning resource. Data Viz2019 was hotter than normal - what does that mean?What does "normal" even mean? Anomagram: Interactive Visualization for AutoencodersThis project by Victor Dibia is a gentle introduction to anomaly detection with autoencoders. His introduction to the project on Medium is also worthwhile. Job Board
![]() Data Elixir is curated and maintained by @lonriesberg. For additional finds from around the web, follow Data Elixir on LinkedIn, Twitter or Facebook. |