Tools and Techniques
Jupyter/IPython notebook (.ipynb) now render directly on GitHub. These notebooks make it easy to capture data-driven workflows that combine code, equations, text and visualizations and share them with others. Having them render directly on GitHub is huge.
Emojineering - Discovering The Hidden Semantics Of Emoji
👍😍The Instagram Engineering team posted two great articles this week about how they handle Emoji characters. These are great posts that describe how Instagram uses machine learning and natural language processing to discover the hidden meanings in this relatively new, visual language. Even if you don't think you're interested in machine learning, these are MUST READS! Really well-written at an easy-to-understand level:
Great text data mining tutorial contained in a set of IPython notebooks. This is self-contained with data and is very well-documented. It demonstrates the basic functionality of several libraries, including Numpy, Scipy, Matplotlib, Scikit-learn, Pandas, Networkx, and NLTK. Fun little evening project.
Fascinating 10 minute presentation about insights that Shazam derives from its users' data. Highly recommended.