ISSUE 323 · February 16, 2021InsightHow to build data literacy in your companyA recent Gartner survey found that poor data literacy in an organization is one of the top three barriers to building a strong data and analytics team. Meanwhile, a survey by Accenture showed that most employees aren't confident in their data skills. This post explores how companies should think about data literacy and develop it as an asset. Sponsored LinkPublish your data science research in PatternsPatterns is a new gold open access journal that promotes all types of cross- disciplinary data science research and outputs. We bring together researchers to solve key scientific problems that cross domain boundaries. Publish your work with us today! Tutorials, Projects & OpinionsBuilding the Modern Data StackFor organizations that are leveling-up their data stack, figuring out which combination of tools to use is complicated. This post explores the issues and offers a framework with specific suggestions to simplify the process. Includes insights from interviews with hundreds of data teams. Supercharging Apache SupersetApache Superset is an open-source data exploration platform that's designed to be visual and easy to use. It started as a hackathon project at Airbnb, joined Apache as an Incubator project, and has since grown to be a mature BI tool for the enterprise. This is an awesome introduction to the project and how Airbnb has customized it for BI at scale. Data Observability, Part II: How to Build Your Own Data Quality Monitors Using SQLIn this SQL tutorial, Barr Moses and Ryan Kearns show how data teams can use metadata gleaned from schema and lineage to understand the root cause of data anomalies. In other words, this tutorial will help you contextualize the "why" and "how" behind broken data pipelines. Code & ToolsWTF Python! 😱Strain your brain with this collection of code snippets that demonstrate some counter-intuitive and lesser-known features of Python. PloomberPloomber makes it easy to turn your notebooks, scripts or Python functions into a reproducible data pipeline for data science or ML. Math InspectorMath Inspector is a visual programming environment for scientific computing based on numpy and scipy. Imagine web inspector, except for math and with block coding. CareerHow Can We Fix the Data Science Talent Shortage?New grads with data science skills are getting a cool reception from a job market that's supposedly hot for data scientists. This post explores the reasons why, what employers are really looking for, and how people new to the field should approach it. ResourcesPatterns, Predictions, and ActionsHere's a new machine learning gem that's worth downloading. This is a graduate level text that covers the foundations of prediction and pattern recognition, moving from decision theory to supervised learning to causality and reinforcement learning. Intended for readers who have some experience with probability, calculus, and linear algebra. Modern Data Science with RThe second edition of Modern Data Science with R is coming out in March but the Bookdown version is already available for free online. Data Elixir is curated and maintained by Lon Riesberg. If you have questions or suggestions for the newsletter, just reply back to this email. To find specific content from prior issues or to research topics, check out the catalogued archives on Data Elixir's new Search Page >> |