Abstract: This session is a hands on workshop (with coding) to demonstrate how to gain observability (monitoring & alerting) for production machine learning pipelines. We will provide background on why observability is important to run successful MLOps, then walk through in detail how to set up a robust observability system.
Without a proper observability system, it is impossible to scale a successful machine learning effort. The session will provide ML engineering teams with the tools they need (all available in the open source ecosystem) to solve major visibility gaps in the machine learning lifecycle, including monitoring data quality, job statuses, ML model performance, and retraining.
The session will cover the end-to-end process, from data prep jobs running in Airflow, to model development and experimentation in Jupyter notebooks, to model serving in production.
The content covered will be of interest to data engineers and data scientists, including anyone who is working on machine learning projects.
We recommend that participants have strong backgrounds in python and at least high level knowledge of job orchestrators like Airflow, which are used to run automated data pipelines.
● Python libraries:
● Apache DBND
● Apache Airflow will be demod, but will not be required for attendees.
Bio: Coming Soon!Josh is Cofounder of Databand, an APM and observability solution for data engineering teams. Prior to founding Databand, Josh was a Product Manager at Sisense, a business analytics software startup. Josh led product on Sisense's ETL and database integration technologies as the startup scaled to over 700 team members and over 1,000 clients. Before Sisense, Josh worked in venture capital at Bessemer Venture Partners, where he focused on cloud infrastructure and machine learning investments.