— In the News —
Gordon Rios has been instrumental in building up a team of data scientists at Pandora. In this interview, Rios offers his insights into what worked and what didn't as he built the team from just himself to over a dozen. An interesting and important read for anyone interested in building a data science team.
The title says it all. Here’s an in-depth overview from Harvard Magazine about the importance of Big Data, its uses, and where things are going. This is a great read.
A startup called Algorithmia has a new twist on online matchmaking. Its website connects businesses that have lots of data to researchers that have developed powerful but largely unknown algorithms. This will be an interesting company to watch.
Is data the future of sports? These guys sure think so. The Astros front office is full of engineers, data scientists, and a physicist. They call their office "The Nerd Cave," and it's lined with whiteboards "covered with algebraic formulas." Clearly, this is a work in progress but the story is a worthwhile look into the data players of the sports world.
— Tools and Techniques —
Is it art? Or is it science? Andrew Hanson explores these questions in IEEE's "Computer Graphics and Applications" magazine and shows why data visualizations should probably be a bit of both.
For anyone interested in working with Twitter, Buffer’s list of tools is a treasure. There are 59 free tools here that aid analytics, discovery, and tweet management. Great resource.
— Resources —
The Data Journalism Handbook provides a good overview of working in this burgeoning field. It includes in-depth case studies, sources for data, help for understanding data, and ideas for telling data stories. The book is free to read via the link here or you can purchase a printed copy from O’Reilly.
This spreadsheet shows military equipment that’s been given to state and local law enforcement agencies by the Defense Department. It was released in the Spring and became very pertinent recently with the Ferguson riots. This data has already been cleaned and is easy to work with.
If you’re looking for data, Quandl is a good place to start. Quandl offers unlimited and unrestricted access to over 10 million time-series datasets - for free. And in this bold blog post, Quandl founder, Tammer Kamel, says, “Our ultimate objective is nothing less than everything: all numerical data in the world available on Quandl.” This is a company worth paying attention to. They offer a large variety of curated datasets, an API and libraries to work with many popular programming languages.
— Data Viz —
If you believe a lot of what's being written about the markets lately, you’d think a bubble was about to burst. But this interactive by Bloomberg’s Visual Data Group suggests that’s far from true. There is a lot to explore here. Be sure to click on the drop-down list of questions at the top and from there, click about anywhere to drill deeper.