ISSUE 324 ยท February 23, 2021InsightCan Algorithms Learn to Fight Wars Ethically?The U.S. Pentagon is spending a lot of money to develop advanced AI capabilities for the military. AI may be better at making fast decisions but when it comes to moral decisions, it could be a nightmare. There's a lot at stake and the issues are anything but black and white. External data should be part of your data strategyA 2018 report found that the most analytically mature organizations use a variety of data sources from outside of their organization. Third-party data is out there to find and use. Here's how to get started. Sponsored LinkVirtual event: Using third-party data to make smarter decisionsJoin this virtual event with Stephen Orban, author of Ahead in the Cloud and General Manager of AWS Data Exchange. He will lead a panel discussion with AWS customers to discuss the innovative techniques being used in data pipelines, with data analysis, and for data visualization. You will discover real-world initiatives and breakthroughs being enabled by third-party data. Tutorials, Projects & OpinionsData Mesh 101: Everything You Need To KnowGreat starting point for understanding all the excitement about data meshes. Follow the links for some of the best resources that explain what data meshes are, how they're useful and how to get started. Create privacy-preserving synthetic data for machine learning with SmartNoiseDifferential privacy promises significantly higher privacy protection than data anonymization techniques and it could be transformative in fields like medicine and economics. This post introduces the technique and how it's becoming the gold standard for data protection. Includes links to a detailed white paper and an open-source toolkit. Autodata: Automating common data operationsA lot of tasks in a data science workflow can be automated. In this post, Adam Marcus explores the landscape of open-source autodata tools and how it may evolve. Patterns: Publishing cross-disciplinary data science outputsThere is more to data science than just the research article. Patterns publishes all the outputs of data science research including software, data sets, algorithms and infrastructures. Shine a spotlight on your work; submit your paper today. Code & ToolsMindsDBMindsDB is a predictive AI layer for existing databases that allows you to develop, train and deploy state-of-the-art ML models using SQL queries. This project is growing fast and there's a lot to explore here. kglab: Graph-Based Data ScienceThe kglab package provides an abstraction layer in Python for building knowledge graphs. It leverages idiomatic Python for common use cases in data science and data engineering work that require graph data. If you're just getting started or need help, see the Resources section. ResourcesDSPodsNice collection of podcasts about data science, machine learning, and ML engineering. As of this morning, there are 24 active podcasts on this page, covering a wide range of topics. Updated daily. Data VisualizationD3 Turns 10Ten years ago, the popular data viz library known as D3 was introduced to the world and empowered a new generation of experts to work with data in ways that weren't possible before. This anniversary post looks back at the stories and the people that brought D3 to life. For a gallery and Getting Started guide, see the D3
tag in the Data Elixir Archives. Data visualization using ggplot2 (intermediate)Take your ggplot2 skills to the next level with these new workshop materials. This is well organized, includes a linked index, and there are lots of code snippets with screenshots to make it useful as a reference. For introductory materials, follow the links at the top. Sign up to get Data Elixir's data science newsletter in your Inbox >> Data Elixir is curated and maintained by Lon Riesberg. If you have questions or suggestions for the newsletter, just reply back to this email. To find specific content from prior issues or to research topics, check out the catalogued Archives on Data Elixir's new Search Page >> |