Tools and Techniques
Airbnb’s marketplace contains millions of listings that users explore through search results that are generated from a sophisticated machine learning model. Searches are personalized in real-time and drive 99% of Airbnb's bookings. This post describes their "Listing Embedding" technique that helps make the search results useful and are applicable to "any type of online marketplace on the Web."
Stream processing enables organizations to access huge amounts of data in real-time but so far, it's required expert software skills to utilize. Now, there's a new approach gaining ground. KSQL is the new streaming SQL engine for Apache Kafka and it's dramatically lowering the bar.
Here's a gold mine of "documents" that demonstrate a variety of statistical concepts and programming. Many of these are interactive. Topics include things like Bayesian Basics, Mixed Models, Latent Variables, and tools for R. This is a great resource.
Have you ever developed a great solution that never got used? It may not be intuitive but even internal tools need to be marketed. This is a great post that walks through 7 key steps for making sure your data products have the impact you're hoping for.
This new post on the Distill site might just blow your mind. It's an interactive tutorial that shows what each layer of a deep learning network "sees." It's long and amazing so be prepared to spend some time with it. For a higher-level perspective, check out the New York Times article about the post, "Google Researchers Are Learning How Machines Learn."