— In the News —
Cars are the vehicle here but algorithms are increasingly making life or death decisions. While the outcomes may be black and white, the decisions to get there are far less clear.
Machine learning algorithms are looking through old, discarded datasets and making some surprising discoveries. Considering the high cost of new experiments and the pace of machine learning advances, it's likely that these are the early days of retroactive data mining.
Although the methods aren't new, there sure is a lot of enthusiasm lately for neural networks, AI, and machine learning. Here's a good overview of the reasons, the opportunities, and where things are going.
— Tools and Techniques —
Nice exploration of learning how to work with GraphViz. It's not technical but it may inspire you to play!
Great introductory tutorial for exploring a dataset using the PyData stack (NumPy, Pandas, Matplotlib, and Seaborn). This is a recent post by Jake VanderPlas, who consistently creates very clear and engaging tutorials.
Trifacta Wrangler is a new desktop tool that enables users to easily format and clean data for analysis. It's free and has been getting great reviews around the web this week.
Nice collection of interview questions curated from actual interviews and sources on the web. This is very well organized and definitely worth bookmarking.
— Data Viz —
Edward Tufte is widely regarded as a pioneer in the field of data visualization. His books are classics and his one-day workshops have had over 275,000 participants. In this interview on PolicyViz, Edward Tufte discusses the current landscape and where things are going. It's all worthwhile but if you don't have time for the entire interview, the best insights start at 27:20.
This is a great exploration of why storytelling is important in data science and even better, how to get started. The job of a data scientist is to be the mouthpiece of the story that every data set tells.
— Conferences —
Extract gathers 600 of the biggest and baddest minds in the dataverse to share cool stories about how they have used data to evolve products, grow teams and build global companies. Extract has incredible keynotes and data workshops from the likes of Reddit, Lyft, Tableau, Baidu, CartoDB, Import.io, Kaggle and more.
Data Elixir readers get a special 25% discount on tickets when they enter this promo code: Extract-Elixir
See you October 30 in San Francisco!