Abstract: Continuous data regulations like GDPR, CCPA, DMA and many others are giving control to users over how their data is used and imposing restrictions on what companies can do with user data. This talk will focus on LinkedIn's approach to converting these regulations into policies and integrating policy enforcement in data engineering practices using our Policy Based Access Control (PBAC) system. It will cover how to annotate data, features, pipelines and models; how to integrate model training and inferences with the PBAC system; and how to enforce policies. It will describe the architecture and components of LinkedIn's governance system and various tools used to automate the annotation and enforcement process.
LinkedIn plans to open source its PBAC system, but probably will not happen by April. The talk will also reference DataHub data catalog and FalDisco automated data classification projects
Bio: Alex Gorelik is currently a Distinguished Engineer at LinkedIn and former software engineer at Meta. Alex spent his career helping enterprises use data. He is a HItachi Fellow and IBM Distinguished Engineer and a founder and CTO of three companies in data space: Acta - an ETL company acquired by Business Objects and now marketed as SAP Data Services, Exeros - a data discovery company acquired by IBM and marketed as Infosphere Discovery, and Waterline Data - a pioneering Big Data catalog company acquired by Hitachi Vantara and marketed as Lumada Data Catalog. He also worked as an independent consultant helping large enterprises around the world (British Aerospace, Royal Carribean, IBM, Jysk, Unilever, Teradata and many others) to help develop their large scale data projects and adapt big data technologies. When not developing or deploying industry leading data technology, Alex speaks at industry conferences and authors articles in industry publications. Alex holds a B.S. in Computer Science from Columbia University School of Engineering and a M.S. in Computer Science from Stanford University.