— In the News —
In a major ruling by the U.S. Ninth Circuit Court of Appeals, data that's publicly available (e.g. LinkedIn profiles) has been deemed to be legally okay to scrape. Depending on the data, there may be copyright violations but that's different. The ruling has huge implications for e-commerce businesses of all sorts. This post by Orin Kerr, professor of law at UC Berkeley, explains the case, the decision, and what it means. For more, there's a worthwhile discussion on Hacker News.
— Insight —
Obsessed with numbers? This new article in the Harvard Business Review explores how that can sink your strategy.
In the era of Big Data, we’ve come to believe that, with enough information, human behavior is predictable. But number crunching can lead us perilously wrong.
— Sponsored Link —
Mode Studio combines a SQL editor, Python & R notebooks, and viz builder in one platform. Free forever. Connect data from anywhere and analyze with your preferred language. Build custom viz/data applications (D3.js, HTML/CSS) or use out-of-the-box charts.
— How-to —
Machine learning applications are subject to change in three axis: the code itself, the model, and the data. That makes them significantly more complicated to build, test, and deploy than traditional software applications. In this initial installment of a series, Martin Fowler explores these challenges while applying Continuous Delivery concepts to machine learning applications.
Welcome to the fantastic world of nerdy regex fun! This collection of interactive regex puzzles covers a wide range of difficulties from beginner to expert. There's also a Puzzle Builder, a collection of user-submitted puzzles, and a Leaderboard!
— Data Viz —
This new R package provides a ggplot2 geom that's a cross between a 2D density plot and a scatter plot. It's useful because it lets you see the overall distribution and also lets you investigate individual outlier points.
Here's what to do when your boss points to some expertly crafted visualization on the web that's well beyond anything you've ever done and says, "Make it look like this."
This is a great presentation by Nathan Yau from the recent Eyeo Festival. Nathan is the creator of the popular site called "Flowing Data" and in this presentation he walks-through a selection of his data visualization work. Nathan's work is highly regarded and this presentation offers a glimpse into his world and how he develops ideas.
— Career —
If you're interviewing, don't ever forget that you're interviewing them as much as they're interviewing you. This first-hand account by David Neuzerling offers some good yellow flags to watch for.