Getting Data Ready for Data Science with MLflow and Delta Lake


One must take a holistic view of the entire data analytics realm when it comes to planning for data science initiatives. Data engineering is a key enabler of data science, helping furnish reliable, quality data in a timely fashion. Delta Lake, an open-source storage layer that brings reliability to data lakes, can help take your data reliability to the next level. MLflow is an open source platform to manage the ML lifecycle, including experimentation, reproducibility, deployment, and a central model registry.

Session Outline:
In this session you will learn about:
* The data science lifecycle
* The importance of data engineering to successful data science
* Key tenets of modern data engineering
* How Delta Lake can help make reliable data ready for analytics
* The ease of adopting Delta Lake for powering your data lake
* How to incorporate MLflow and Delta Lake within your data infrastructure to enable Data Science


Open Data Science




Open Data Science
One Broadway
Cambridge, MA 02142

Privacy Settings
We use cookies to enhance your experience while using our website. If you are using our Services via a browser you can restrict, block or remove cookies through your web browser settings. We also use content and scripts from third parties that may use tracking technologies. You can selectively provide your consent below to allow such third party embeds. For complete information about the cookies we use, data we collect and how we process them, please check our Privacy Policy
Consent to display content from - Youtube
Consent to display content from - Vimeo
Google Maps
Consent to display content from - Google