ISSUE 412 · November 8, 2022TrendsEverything I know about MastodonWho knows how things will ultimately play out but the data community on Twitter seems to be unraveling. Mastodon is an obvious Twitter alternative but it's different than Twitter. This is the best post I've seen on Mastodon, including details on where to go, how to set things up, key differences from Twitter, how to find the data community and more. Sponsored LinkInstant Access to Web Data | Bright Data’s NoCode SolutionWhen it comes to data analysis you’re #1 but your skills are only as good as your data. Bright Data is the world's leading web data collection platform covering everything from ready-made datasets to web scraping and proxies. Just for you, an exclusive offer for Data Elixir subscribers - 1 dataset refresh free of charge to make sure your data is as fresh as you are ;) Tutorials, Projects & OpinionsNo, you don’t need MLOpsOver the past year or so, MLOps has taken on a life of its own and many things that are sold as MLOps aren't needed for most teams. In this post, Lak Lakshmanan takes a look at the original problems, how they can be addressed in today’s ML frameworks, and why additional complexity beyond keep-it-simple solutions is often unnecessary. Caveats and Limitations of A/B Testing at Growth Tech CompaniesA/B tests are the gold standard of user testing, but there are some fundamental limitations that aren't always obvious. This is an insightful longread on things to watch for. Command-line data analytics made easySpyQL is a query language that combines the simplicity and structure of SQL with the power and readability of Python. It's lightweight, easy to use and will feel familiar if you already work with Python or SQL. Here's a practical tutorial for getting started. Code & ToolsRaster4MLRaster4ML is a python package that extracts ML-ready datasets from geospatial raster data and shapefiles. The package aims to aid geospatial researchers and scientists to extract meaningful features easily so they can focus more on model training and reproducibility. ResourcesForecasting: Principles and Practice (3rd ed)Great study guide and reference for forecasting methods. Examples use R with many datasets taken from the authors' own consulting experience. In this third edition, all chapters have been updated to cover the latest research and methods. Free to read online. CareerNew OpportunitiesThe Data Elixir Job Board currently has 47+ listings for remote positions, including roles for data scientists, data analysts, researchers, data architects, machine learning engineers, and more. The roles cover a variety of job levels, from mid-level to Director. If you're HIRING, join the Data Elixir Talent Collective and get regular drops of outstanding data practitioners and leaders who are open to new opportunities 👉Data VisualizationShould you stop using bullet graphs?Nick Desbarats has taught dashboard design to thousands of workshop participants and along the way, he noticed a variety of downsides with bullet graphs. In this post, he shows how bullet graphs work, typical issues, and he introduces an alternative chart type he calls "action dots" that are simpler and easier to understand. How to access, analyze & visualize your Twitter dataGreat notebook to help you download your Twitter data and analyze it locally in a searchable interface. There are a variety of visualizations here and everything is easy to modify and build on. Definitely, do this if you're thinking about leaving Twitter. Sign up to get Data Elixir's data science newsletter in your Inbox >> |