Data Elixir logo

ISSUE 336   ·   May 18, 2021

 

Insight

Be Decision-Driven Not Data-Driven

Since launching its annual executive survey in 2012, NewVantage Partners has watched leading companies steadily invest in efforts to become more data-driven. Nearly 99% of reporting companies report active investments in data initiatives and yet, only 24% claim to be in a data-driven organization. Maybe being data-driven is the wrong goal...
Techno Sapien | Mark Palmer

 

Sponsored Link

Ray Summit: Scalable ML & AI for everyone

Ray Summit: Scalable ML & AI for everyone

Want to learn the best way to scale? Ray Summit brings together data scientists and engineers to build scalable ML & AI using Ray, the dominant platform for distributed computing. Learn about top trends in machine learning & AI, ML in production, reinforcement learning, cloud computing & more. Register to join live or on-demand.

 

Reach Data Elixir readers by sponsoring an issue. Click here for details.

 
 

Tutorials, Projects & Opinions

Why Dagster is the next-generation data orchestrator

Dagster is a data orchestrator for machine learning, analytics, and ETL. It's similar to Airflow but it handles each stage of the data life cycle differently. In this post, Nick Schrock compares the two systems and explores the advantages of using Dagster.
Dagster Blog | Nick Schrock

 
 
 

Using PostgreSQL as a Data Warehouse

With some tweaking, Postgres can be a great data warehouse. Here's why that's worth considering and how to configure it.
Narrator | Cedric Dussud

 
 
 

Good Data Scientist, Bad Data Scientist

Regardless of what part of the stack you work on, there are common traits that separate "good" data scientists from "bad" data scientists. This short post applies to most people in tech and is  good food for thought.
Ian Whitestone

 
 
 

What is a Vector Database?

Vector embeddings are fundamental parts of many recommendation and search algorithms and they've become increasingly important to machine learning applications. This post introduces vector embeddings, their unique needs and how, ultimately, a new type of database is needed.
Pinecone Blog | Rajat Tripathi

 
 
 

Scale Transform: The Present and Future of AI 

Scale Transform broke records and brought together more than 10,000+ leading researchers, practitioners, and executives. The conference featured an all-star line-up of 27 of the leading AI researchers and practitioners and 19 sessions from the latest research breakthroughs to the real-world impact across industries.
// sponsored

 

Code & Tools

Greykite for flexible, intuitive, and fast forecasting

Greykite is an open-source Python library that was developed to support LinkedIn’s forecasting needs. Its main forecasting algorithm, called Silverkite, is fast, accurate, and intuitive, making it suitable for interactive and automated forecasting at scale.
Linkedin Engineering Blog | Reza Hosseini

 
 
 

Data Profiler | What's in your data?

The DataProfiler is a Python library that makes it easy to extract schema, statistics and entities from your datasets. Data Profiles can then be used in downstream applications or reports.
GitHub | Capital One

 
 

Events

Data Week

General Assembly's Data Week is happening THIS WEEK! Sessions are scheduled throughout the week and cover things like bias detection, communicating with data, career development, playlist recommenders, and lots more. All sessions are online and free. 
General Assembly

 

Data Visualization

Introducing Dataflow, a self-hosted Observable Notebook Editor

Dataflow is a standalone notebook editor for Observable. Create, run, and compile Observable notebooks on your own machine! MIT license.
Oberservable | Alex Garcia

 
 
 

Falx: Visualization by Example

Falx is a visualization-by-example tool that uses small examples of a dataset to show how the full dataset should be visualized. This post is a nice introduction to the project. Follow the links for an online demo.
UCBerkeley RISELab

 

Sign up to get Data Elixir's  data science newsletter in your Inbox >>

 
Data Elixir logo

Data Elixir is curated and maintained by Lon Riesberg. If you have questions or suggestions for the newsletter, just reply back to this email.

 

To find specific content from prior issues or to research topics, check out the catalogued Archives on Data Elixir's Search Page >> 

 
FacebookTwitterLinkedInWebsite
 
 
Data Elixir, LLC
P.O. Box 21255
Boulder, CO 80308
Unsubscribe