— In the News —
Facebook just released its own version of a smart, digital assistant you can interact with. It's called "M" and it's a lot different than Apple's Siri or Google Now. With its live, human trainers in the loop that help its neural nets learn, M has the potential to be a game changer.
The hype around big data has caused many organizations to hire data scientists without giving much thought to what these data scientists are going to do and whether they’re actually needed. This post discusses some questions you should ask yourself before deciding to hire your first data scientist.
— Sponsored Link —
Only Localytics brings app marketing and analytics together. If you either have or are thinking about an app of your own, this set of templates will help you define your app feature roadmap, now and over time.
— Tools and Techniques —
Great introduction to machine learning by Randy Olson. Includes data tidying, exploratory analysis, classification and reproducibility.
A cohort is a group of users who share something in common, such as a sign-up date, first purchase month, birth date, acquisition channel, etc. This tutorial provides a good foundation for tracking these groups over time, which help you spot trends and understand repeat behaviors.
Using cameras installed in the catwalks of every arena, the NBA is tracking the movements of every player on the court and the basketball 25 times per second. This tutorial shows how to access and work with that data.
Ever have data trapped in a Word doc? Here's a good description of how Word docs are organized under the hood and how to easily get data out.
— Resources —
Nice collection of eBooks covering a variety of topics in business analytics, data mining, big data, machine learning, algorithms, tools, and programming languages. Along with a link to the online version, many of the titles include a link to purchase a hardcopy at Amazon. Sometimes lists like these link to torrents and pirated copies but these look legitimate.
— Data Viz —
Through crowdsourcing and citizen science projects, the general public is making profound contributions to research. Can data visualization help make sense of this wealth of new information? Geoff McGhee explores this question in National Geographic's new visualization series, Data Points.
Tamara Munzner is a professor at the University of British Columbia Department of Computer Science and holds a PhD from Stanford. She has been active in visualization research since 1991 and has published over sixty-five papers and chapters. Her book Visualization Analysis and Design appeared in 2014. This Reddit AMA is a fantastic read.