Dr. Einat Orr

Dr. Einat Orr

Co-Founder & CEO at Treeverse

    Einat Orr has 20+ years of experience building R&D organizations and leading the technology vision at multiple companies, the latest being Similarweb, that IPO in NYSE last May. Currently she serves as Co-founder and CEO of Treeverse, the company behind lakeFS, an open source platform that delivers a git-like experience to object-storage based data lakes. She received her PhD. in Mathematics from Tel Aviv University, in the field of optimization in graph theory.

    All Sessions by Dr. Einat Orr

    West Talks 07/23/2024

    Don't Go Over the Deep End: Building an Effective OSS Management Layer for Your Data Lake

    <span class="etn-schedule-location"> <span class="firstfocus">Generative AI</span> <span class="secfocus">Intermediate</span> </span>

    Managing a data lake with both structured and unstructured data sometimes feels like diving int a deep abyss, especially for beginners. But it doesn't have to be that way! This talk offers a high-level overview of tools and strategies to enhance data lake manageability—without going off the deep end. We'll start by exploring fundamental challenges, focusing on the different needs of structured versus unstructured data where each requires its own distinct approach. We'll dispel some of the chaos by covering the key components of a robust data lake management architecture, including open table formats, catalogs, and data version control systems. By understanding these components, you'll see how they contribute to an organized data lake environment, helping you avoid feeling like you're constantly treading water. We'll present real life data lake architectures using Databricks, Apache Iceberg, and AWS technologies to show how these components integrate seamlessly in a modern data engineering stack. If time allows, we will conclude with an open discussion, encouraging attendees to share their experiences (read: rants) and challenges, so you can feel less alone in the murky waters of the multi-structure data lake, and come away with practical methods for data lake manageability.

    Open Data Science




    Open Data Science
    One Broadway
    Cambridge, MA 02142

    Privacy Settings
    We use cookies to enhance your experience while using our website. If you are using our Services via a browser you can restrict, block or remove cookies through your web browser settings. We also use content and scripts from third parties that may use tracking technologies. You can selectively provide your consent below to allow such third party embeds. For complete information about the cookies we use, data we collect and how we process them, please check our Privacy Policy
    Consent to display content from - Youtube
    Consent to display content from - Vimeo
    Google Maps
    Consent to display content from - Google