— In the News —
Machines that truly understand language would be incredibly useful. But we don’t know how to build them.
There are sports stories we wish we could tell — but the data just isn’t there even at the highest level.
— Sponsored Link —
See what your users see. Capture everything, and we mean everything (including the full DOM and console logs), about your customer experience with one easy-to-install script. Try it free!
— Tools and Techniques —
This post by David Robinson went viral this week. There are also great discussions in the Comments.
Great Reddit AMA ("Ask Me Anything") with core members of the Google Brain Team.
Quantopian is building a crowdsourced hedge fund. This post by Quantopian's Lead Data Scientist describes a recent experiment with machine learning.
Nice discussion of near-real-time use cases and data processing strategies at Uber.
— Resources —
The Data Is Plural newsletter offers a weekly collection of interesting datasets. Here's a collection of all the datasets that have been featured so far.
Here are 17 R packages that appeared on CRAN between May 1st and August 8th that, in one way or another, provide access to publicly available data.
— Data Viz —
Useful trick for visualizing data for different groups in a facetted plot with all of the data plotted in the background.
Here's an idea that's sure to spark some controversy. The tool described here helps you design your data visualizations with "placeholder" data rather than the real thing.