No Images? Click here ISSUE 260 · November 19, 2019InsightA global survey of journalism and AIThis report from the Polis think-tank explores how AI is already being used in journalism and how newsroooms think about the implications. Given how fast the tech is moving, this is important work that spans 71 news organizations in 32 different countries. 13 Big Data Trends Industry Pros Are WatchingEvery organization has their own use-cases, pain points and perspectives regarding data. In these interview snippets, data pros at 11 tech companies share insights from their corner of the data world. Sponsored LinkAutomated Data Governance 101Data is more vulnerable than ever – and the way we control our data isn’t working. Governing data and putting it to use are dueling objectives and businesses are stuck in the middle. Download this white paper for an introduction to automated data governance, which introduces speed, agility, and precision into the process of applying rules on data. Tools and Techniques74 Summaries of ML and NLP Research PapersThese short summaries of Machine Learning and NLP research papers cover a wide variety of authors, topics and venues from the past couple of years. Includes key points, diagrams and links for each paper. TileDB: A Database for Data ScientistsTileDB is a new database that's designed from the bottom-up for data science. This post explores key problems with current solutions and how TileDB's approach is a more effective way to store, update, analyze, and share large sets of diverse data. An Introduction to Confident LearningConfident Learning is an emerging field for characterizing label noise, identifying errors and learning using datasets with noisy labels. In this post, Curtis G. Northcutt describes what it is exactly, practical applications and how it works. This post also introduces a new open-source Python package for cleaning labels called the cleanlab. Advanced Training For CV Models: 3 Key Aspects Of Video AnnotationTo maximize the potential in video data, there are 3 key aspects to consider when determining an approach: Entity Persistence, Detecting State Change, and Temporal Tagging. An enterprise-grade labeling platform
that employs a holistic annotation approach can scale video annotation without diminishing complexity and compromising accuracy. ResourcesI haven't paid much attention to Microsoft since Windows 8 but I was at the Ignite Conference a couple weeks ago and there is a LOT that's worthwhile at Microsoft these days. Seriously. Along with the new HoloLens 2 and Project Silica, here are a few key resources that are worth checking out:
Data VizWhy scientists need to be better at data visualizationThe scientific literature is riddled with bad charts and graphs, leading to misunderstanding and worse. This post offers practical guidelines for choosing charts and colors to help others (and you!) understand your research. Includes lots of visual examples and common mis-perceptions. A network of science: 150 years of Nature papersBeautiful presentation of the network of paper citations at Nature. This short video is nice introduction to the project. The interactive is here >> Conferences & EventsMetis Webinar | AI ROI: The Questions You Need to Be Asking - As business leaders increase investment in advanced analytics, data science, and AI, many struggle to recognize a return on those efforts. During this free Metis Corporate Training webinar Kerstin Frailey, Senior Data Scientist and Head of Executive Corporate Training at Metis, will walk through what you need to ask before, during, and after the lifetime of a data science project. Thursday, December 5, 12pm ET ![]() Data Elixir is curated and maintained by @lonriesberg. For additional finds from around the web, follow Data Elixir on LinkedIn, Twitter or Facebook. |