— In the News —
Doctors and scientists are increasingly dipping into the aggregated medical records of millions of patients to uncover patterns that might help steer care. Godsend? Or privacy concern?
Open data is the future — of how we govern, of how public services are delivered, of how governments engage with those that they serve. And right now, it is unevenly distributed. This article explains why we should care and offers suggestions for how to fix it.
— Tools and Techniques —
csvkit started out as a small newsroom tool that made it easy to select a subset of columns from a csv file. Five years later, this tool has become a data swiss army knife. This article describes some of its most useful features which include input, processing, and output operations. If you ever work with data in an Excel or csv file, this tool is a must!
Twitter recently open-sourced an algorithm designed to ease the process of running recommendation engines at large scale. This article describes how it works and when it might be useful.
This isn't just an article about how to find stories in census data. This article lays out a strategy for finding stories in any data.
— Resources —
BBC just released an open data website that includes data on the places, people and organisations that appear in BBC programs and online content. This data already powers large parts of the BBC website, including BBC News and Sport, and is now available for anyone to access in standard open data formats.
Well-known for his data visualization work, Simon Rogers has started this list of key resources for finding the data that's used by journalists.
Great list of some of the best free big data sources available today. If you don't have a need for it now, this is definitely worth bookmarking.
— Data Viz —
It’s a puzzle wrapped in an enigma wrapped in a symphony. It’s the Higgs boson, the so-called God particle, the greatest physics find of the 21st century, turned into music and ultimately, visualizations. Does this strategy of using multiple sensory streams to analyze large amounts of data really work?
We spend hours a day online, and we see ads on every webpage we visit. But we don’t have any way of tracking the ads we’re being served. Until now.
Floodwatch is a Chrome extension that tracks the ads you see as you browse the internet. This is a Jer Thorp project, complete with personalized data visualizations. You don't want to miss this.