— In the News —
This is a big deal that's been widely discussed around the web over the past couple weeks. This short article describes the issues, starting with how the closed-access publishing model was important when it began more than 200 years ago.
The latest Technology Quarterly from The Economist is a special issue that focuses on data and how it's being used to transform justice systems. There's a lot here including things like monitoring of individuals, crime prediction, how police bypass encryption, and how/why to maintain accountability in a data-centric era. This is a long read that dives into issues that are deep and sometimes troubling.
— Sponsored Link —
Mode Studio combines a SQL editor, Python & R notebooks, and a visualization builder in one platform. And it's free forever. Connect data from anywhere and analyze with the best language for the job, without having to jump between tools. Build custom visualizations or use our out-of-the-box charts. Share your analysis with a click—every report lives at a URL.
— Tools and Techniques —
datasheets is a library that makes it easy to read data from, write data to, and modify the formatting of Google Sheets using Python.
You often hear, “correlation doesn't imply causation” but what does that mean exactly? Correlation has a specific mathematical meaning but causation is a slippery concept that's been debated by philosophers for millennia. In this article, Quanta Magazine's Puzzle Columnist, Pradeep Mutalik, explores the questions and offers 3 problems to puzzle through.
Here's a good introduction to pass along to the colleagues you may be supporting that aren't so data-savvy.
The DevOps world is increasingly rediscovering the value of service management techniques, just as ITSM teams are adopting DevOps principles and practices. At last, ITSM and DevOps are coming in from the cold. Read the new guide, 7 Crazy Goals to Start You on Your ITSM Journey to DevOps, to learn how you can encourage crossover and boost productivity in your organization.
— Resources —
Jacob Eisenstein leads the Computational Linguistics Laboratory at Georgia Tech where he works on machine learning approaches for understanding human language. This new text goes along with an NLP course he teaches and it offers a solid introduction to current NLP techniques.
This popular series of articles about creating a data science discipline at a startup has several new sections and has been reformatted for easy reference. The link above goes to a free online version or, if you prefer print, it's also available at Amazon.
— Data Viz —
Kepler.gl is an open source geospatial analysis tool for large-scale data sets. This project came about through a collaboration between Mapbox and Uber and, like their other tools, this is powerful and makes it easy to create beautiful interactives.