— In the News —
If you're collecting personal data, "How should I protect this?" is actually your third question.
"Should I collect this?" is only the second question.
The first question is, "What would the worst people do if they got hold of this?"
That's where it starts. This is a great Twitter thread that's spreading fast.
— Sponsored Link —
Join a new generation of scientists. Develop the skills you need to analyze "big data" with a Master's in Data Science Online from UC Berkeley.
— Tools and Techniques —
Draw.io is a popular diagramming tool that's used for making flowcharts, process diagrams, org charts, UML, ER diagrams, network diagrams, etc. It's been around for awhile, has a mature code base and is now available as an extension for JupyterLab too.
This is a solid Quora answer that offers an objective look at the strengths and weaknesses of Python and R. The historical background helps provide context and the concluding thoughts offer useful insights for deciding which to use, when.
This looks amazing. Pandas on Ray is a new library for distributing your Pandas workflows. It's super fast, scalable and can be implemented simply by changing a single import statement. This is being actively developed and will ultimately enable Pandas for use in cloud environments.
This is a nice collection of exercises to help develop your NumPy skills. It's organized into 4 levels of difficulty and the solutions are easy to show/hide.
— Resources —
Google recently opened up their internal resources for learning about Machine Learning. The web portal, "Learn with Google AI," includes a wide variety of lessons, tutorials, and hands-on exercises, including the new "Machine Learning Crash Course," which has already had more than 18,000 signups within Google.
— Data Viz —
matplotlib is a versatile library but can be daunting to master. This guide is a solid introduction that includes a great overview of how matplotlib works under the hood, lots of examples with code snippets, and useful links to other resources.
If you ever find yourself looking for a color palette, this repository aims to provide a comprehensive list of color palettes for R. Palettes are sorted by package with options for sequential, diverging, and qualitative palettes.