— In the News —
Gaze estimation is a classic problem of machine vision, which can now be solved by one computer training another. Here's a good overview of how that works.
A group of researchers at Baidu have been barred from participating in a major international AI competition when it was discovered that they gamed the system. Here's what happened and why it's a big deal.
It's no surprise that a lot of people think personalized services and targeted advertising is NOT a fair exchange for their personal data. New studies reveal exactly what people are comfortable with and how that will ultimately impact research in the near future.
In this article in Nature, four leading researchers share their concerns and solutions for reducing societal risks from intelligent machines. This is a thought-provoking read.
— Tools and Techniques —
Airbnb helps its hosts set reasonable prices by showing the probability that a listing will be accepted. Many features go into predicting the demand for a listing and these features interact in complex ways. To help users understand the results, Airbnb has created Aerosolve: "a machine learning package built for humans." This article on Airbnb's engineering page is a worthwhile overview of why it's valuable and Aerosolve's GitHub page provides details for using it.
Great reference for exploring data with Python. There are lots of tricks here that are worth being familiar with. Give it a skim and then bookmark it.
— Resources —
Because of the breadth and availability of its content, Wikipedia has been widely used as a reference dataset for research in machine learning. That seems reasonable but unfortunately, a large percentage of Wikipedia pages are essentially fluff that have been created by robots. Here's a short article explaining the problem with a link to a cleaned Wikipedia dataset.
— Data Viz —
Andy Kirk has recently updated his collection of resources on his Visualizing Data blog. This is a comprehensive collection that contains nearly 300 tools. It's well-organized into a variety of tagged categories such as Data Handling, Charting, Programming, Multivariate, and Color among others. This is definitely worth bookmarking.
William Zinsser authored a classic writing book called, On Writing Well, which identifies seven key principles for effective writing. As Ben Jones shows in this article, those same principles apply directly to visual communication.
— Archive Pick —
What exactly is a "data scientist" anyway?
Tom, Dick & Harry showed up for an interview at the new grocery delivery startup. There was a whiteboard, a laptop & a notebook, so they could use whatever makes them comfortable...
— About —
Data Elixir is curated and maintained by @lonriesberg. If you find this newsletter worthwhile, please help spread the word! Forward to your colleagues or use the links below to share to your favorite network: