November 1st - 3rd, 2022
MLOps & Data Engineering
Focusing on the practice, management, workflows and dataops of data science
Track Sponsored by: 
Understand the MLOps & Practice of Data Engineering in the Real World
As data science extends its reach across an enterprise, the need for better management, workflow, production and deployment practices increases. The challenges of deploying and monitoring models in production, managing data science workflows and teams, and understanding ROI are a few of the issues organizations wrestle with.
Learn best practices for effective data science management
Sessions in this broad focus area will look at uses cases, best practices, and stories from the field to show how to effectively incorporate data science practice into the wider business process. This focus area will look beyond data sourcing and modeling towards the many challenges teams need to overcome to effectively apply data science in their organization.
Featured Past Speakers

Yaron Haviv
Yaron Haviv is a serial entrepreneur who has been applying his deep technological experience in data, cloud, AI and networking to leading startups and enterprise companies since the late 1990s. As the co-founder and CTO of Iguazio, Yaron drives the strategy for the company’s MLOps platform and led the shift towards the production-first approach to data science and catering to real-time AI use cases. He also initiated and built Nuclio, a leading open source serverless platform with over 4,000 Github stars and MLRun, Iguazio’s open source MLOps orchestration framework. Prior to co-founding Iguazio in 2014, Yaron was the Vice President of Datacenter Solutions at Mellanox (now NVIDIA), where he led technology innovation, software development and solution integrations. He was also the CTO and Vice President of R&D at Voltaire, a high-performance computing, IO and networking company which floated on the NYSE in 2007. Yaron is an active contributor to the CNCF Working Group and was one of the foundation’s first members. He presents at major industry events and writes tech content for leading publications including TheNewStack, Hackernoon, DZone, Towards Data Science and more.
MLOps Beyond Training: The Production-First Approach to AI(Track Keynote)

Nick Brown
Nick Brown is a Senior Data Scientist at IHS Markit working within the Engineering and Product Design business line. Currently working in AI based information extraction, Nick has worked on projects which generated millions of dollars of incremental value for organizations by applying data science and machine learning to pricing and purchasing optimization, competitive behavior analysis, and geographic demand seasonality. He is dedicated to educating people unfamiliar with data science to facilitate long-term success both within his organization and in the greater business community.
MLOps Spotlight: Scaling NLP Pipelines at IHS Markit(Track Keynote)

Chip Huyen
Chip Huyen is an engineer and founder working to develop tools that leverage real-time machine learning. Through her work with Snorkel AI, NVIDIA, and Netflix, she has helped some of the world’s largest organizations deploy machine learning systems. She teaches Machine Learning Systems Design at Stanford. She’s also published four bestselling Vietnamese books.
Unifying Development and Production Environments for Machine Learning Projects(Workshop)

Ketan Umare
Ketan Umare is the TSC Chair for Flyte (incubating under LF AI & Data). He is also currently the Chief Software Architect at Union.ai. Previously he had multiple Senior Lead roles at Lyft, Oracle and Amazon ranging from Cloud, Distributed storage, Mapping (map making), and machine learning systems. He is passionate about building software that makes developers and other engineers’ lives easier and provides simplified access to large scale systems. With Flyte, he is trying to bridge gap from ideation to productionization for data and ML pipelines and bring a battle tested approach and structure to the data and ML world.
Deep Dive into Flyte(Half-Day Training)

Thomas Fan
Thomas J. Fan is a Senior Software Engineer at Quansight Labs, working to sustain and evolve the PyData open-source ecosystem. He is a maintainer for scikit-learn, an open-source machine learning library written for Python. Previously, he worked at Columbia University, improving the interoperability between scikit-learn and AutoML systems. Thomas holds a Masters in Physics from Stony Brook University and a Masters in Mathematics from New York University.
Introduction to Scikit-learn: Machine Learning in Python(Training)
Intermediate Machine Learning with Scikit-learn: Evaluation, Calibration, and Inspection(Training)
Advanced Machine Learning with Scikit-learn: Text Data, Imbalanced Data, and Poisson Regression(Training)

Jimmy Whitaker
Jimmy Whitaker is the Data Science Evangelist at Pachyderm. He focuses on creating a great data science experience and sharing best practices for how to use Pachyderm. When he isn’t at work, he’s either playing music or trying to learn something new, because “You suddenly understand something you’ve understood all your life, but in a new way.
MLOps: From 0-60 with Pachyderm(Demo Talk)

Milecia McGregor
Milecia is a senior software engineer, international tech speaker, and mad scientist that works with hardware and software. She will try to make anything with JavaScript first. In her free time, she enjoys learning random things, like how to ride a unicycle, and playing with her dog.
Preventing Stale Models in Production(Talk)

Sourav Mazumder
Sourav Mazumder is an IBM Data Scientist Thought Leader and The Open Group Distinguished Data Scientist. Sourav has consistently driven business innovation and values through methodologies and Technologies related to Artificial Intelligence, Data Science and Big Data transpired through his knowledge, insights, experience and influencing skills across multiple industries including Manufacturing, Insurance, Telecom, Banking, Media, Health Care and Retail industries in USA, Europe, Australia, Japan and India. Over the last 10 years, he has influenced key decision makers of several fortune 500 companies to adopt Artificial Intelligence, Data Science, and Big Data related technologies to address complex business needs. Sourav has also consistently provided directions to and successfully led numerous challenging Artificial Intelligence, Data Science and Big Data projects, applying various related methodologies ranging from Descriptive statistics, Probabilistic Modelling, Algorithmic Modelling, Natural Language Processing, etc., to solve critical business problems. Sourav has also successfully partnered with academia within North America, India, South Africa to mentor students and enable them in this field. Sourav has experience and exposure in working with a variety of Artificial Intelligence, Data Science and Big Data related technologies such as Watson Open Scale, Watson Natural Language Processing, Watson Machine Learning, IBM Cloud Pak for Data, Spark, Hadoop, BigSQL, HBase, MongoDb, Solr, System ML, Cognos, R, Python, Scala/Java and using them in projects involving phases from creation of Minimum Viable Product to Productionization at an enterprise level. Sourav is an Open Source enthusiast and contributes to Open Source regularly. Sourav holds patents in the Data and AI space (patent profile https://patents.justia.com/search?q=Sourav+Mazumder). Sourav consistently publishes papers/blogs/articles in various industry forums. Sourav is co-author, guest editor and chief editor of multiple books in AI, Data Science and Big Data space (https://www.researchgate.net/profile/Sourav-Mazumder). Sourav is regularly invited to speak in various Industry conferences, like Open Data Science Conference, Spark Summit, IBM Think, Global AI Conference, etc in this subject area. He can be found on Linkedin (https://www.linkedin.com/in/souravmazumder/)
Operationalization of Models Developed and Deployed in Heterogeneous Platforms(Tutorial)

Filipa Peleja, PhD
Filipa Peleja is the Levi Strauss & Co Europe Lead Data Scientist at the Data Analytics & AI team. She has always been enthusiastic about technology where she first stepped into the tech world as an undergrad in Computer Science and later Ph.D. in the Machine Learning domain. Her academic accomplishments were recognized with the 1st prize of an industry challenge from a telco and publications in international conferences among which, top tier conferences like SIGIR and ACL. Before joining Levi, Filipa interned at Yahoo! Research and, later, worked as a Sr Data Scientist at Vodafone. Filipa loves to work in an area that she feels very passionate about and also enjoys passing along knowledge, hence, she lectures, supervises projects/thesis for CodeOp, Neueda and Barcelona Technical School.
MLOps… From Model to Production(Workshop)

Eduardo Blancas
Eduardo is interested in developing tools to deliver reliable Machine Learning products. Towards that end, he created Ploomber, an open-source Python library to compose production-ready data workflows. Eduardo holds an M.S in Data Science from Columbia University, where he took part in Computational Neuroscience research. Eduardo started his Data Science career in 2015 at the Center for Data Science and Public Policy at The University of Chicago.
Develop and Deploy a Machine Learning Pipeline in 45 Minutes with Ploomber(Talk)
What You’ll Learn
Data science has many focus areas. The goal of this track is to accelerate your knowledge of data science through a series of introductory level training sessions, talks, tutorials and workshops on the most important data science tools and topics.
Experimentation to Production
Agile Data Science
Data Science Architecture
Runtime Pipelines
Model Monitoring & Auditing
Automated Machine Learning
Debugging Machine Mearning
Kubeflow and Kubernetes
Distributed Computing
Data Science Workflows
Data Provenance & Governance
and many more…
Why Attend?
Accelerate and broaden your knowledge of key areas in data science, including deep learning, machine learning, and predictive analytics
With numerous introductory level workshops, you get hands-on experience to quickly build your skills
Post-conference, get access to recorded talks online and learn from over 100+ high quality recording sessions that let you review content at your own pace
Take time out of your busy schedule to accelerate your knowledge of the latest advances in data science practice and management
Learn directly from world-class instructors who are the authors and contributors to many of the tools and languages used in data science today
Meet hiring companies, ranging from hot startups to Fortune 500, looking to hire professionals with data science skills at all levels
Network at our numerous lunches and events to meet with data scientists, enthusiasts, and business professionals
Get access to other focus area content, including machine learning & deep learning, data visualization, and much more
Who Should Attend
Data Science is cross industry and cross enterprise, impacting many different departments across job roles and functions. This track is not only for data scientists of all levels but for anyone interested in the practice and management of data science, including:
Data scientists moving beyond model experimentation looking to understand production workflow
Data scientists seeking to improve the overall practice of management and development
Anyone interested in understanding better collaborative and agile management techniques as applied to data science
Business professionals and industry experts looking to understand data science in practice
Software engineers and technologists who need to work with data science workflows and understand the unique requirements of these systems
CTO, CDS, and other managerial roles that require a bigger picture view of data science
Technologists in the field of MLOps, databases, project management and others looking to break into data science
Students and academics looking for more practical applied training in data science tools and techniques
ODSC WEST 2022 - November 1st-3rd
Register and save 75%ODSC Newsletter
Stay current with the latest news and updates in open source data science. In addition, we’ll inform you about our many upcoming Virtual and in person events in Boston, NYC, Sao Paulo, San Francisco, and London. And keep a lookout for special discount codes, only available to our newsletter subscribers!