Human-Friendly, Production-Ready Data Science with Metaflow

Abstract: 

There is a pressing need for tools and workflows that meet data scientists where they are. This is also a serious business need: How to enable an organization of data scientists, who are not software engineers by training, to build and deploy end-to-end machine learning workflows and applications independently. In this talk, we discuss the problem space and the approach we took to solving it with Metaflow, the open-source framework we developed at Netflix, which now powers hundreds of business-critical ML projects at Netflix and other companies from bioinformatics and drones to real estate. We wanted to provide the best possible user experience for data scientists, allowing them to focus on parts they like (such as LLMs and modeling using their favorite off-the-shelf libraries) while providing robust built-in solutions for the foundational infrastructure: data, compute, orchestration, versioning, and reactive ML systems. In this talk, we will share lessons learned from 100s of companies we've worked with at Outerbounds, and you will learn about:

* What to expect from a modern ML infrastructure stack.
* Using tools such as Metaflow to boost the productivity of your data science organization, based on lessons learned from Netflix and many other companies.
* Deployment strategies for a full stack of ML infrastructure that plays nicely with your existing systems and policies.
* How to build reactive machine learning systems;
* How to incorporate new technologies, such as Generative AI and LLMs, into pre-existing enterprise software stacks.

Bio: 

Ville has been developing infrastructure for machine learning for over two decades. He has worked as an ML researcher in academia and as a leader at a number of companies, including Netflix where he led the ML infrastructure team that created Metaflow, a popular open-source framework for data science infrastructure. He is a co-founder and CEO of Outerbounds, a company developing modern human-centric ML. He is also the author of the book Effective Data Science Infrastructure, published by Manning.

Open Data Science

 

 

 

Open Data Science
One Broadway
Cambridge, MA 02142
info@odsc.com

Privacy Settings
We use cookies to enhance your experience while using our website. If you are using our Services via a browser you can restrict, block or remove cookies through your web browser settings. We also use content and scripts from third parties that may use tracking technologies. You can selectively provide your consent below to allow such third party embeds. For complete information about the cookies we use, data we collect and how we process them, please check our Privacy Policy
Youtube
Consent to display content from - Youtube
Vimeo
Consent to display content from - Vimeo
Google Maps
Consent to display content from - Google