Data Engineering in the Age of Data Regulations


Continuous data regulations like GDPR, CCPA, DMA and many others are giving control to users over how their data is used and imposing restrictions on what companies can do with user data. This talk will focus on LinkedIn's approach to converting these regulations into policies and integrating policy enforcement in data engineering practices using our Policy Based Access Control (PBAC) system. It will cover how to annotate data, features, pipelines and models; how to integrate model training and inferences with the PBAC system; and how to enforce policies. It will describe the architecture and components of LinkedIn's governance system and various tools used to automate the annotation and enforcement process.

Session Outline:

LinkedIn plans to open source its PBAC system, but probably will not happen by April. The talk will also reference DataHub data catalog and FalDisco automated data classification projects


Alex Gorelik is currently a Distinguished Engineer at LinkedIn and former software engineer at Meta. Alex spent his career helping enterprises use data. He is a HItachi Fellow and IBM Distinguished Engineer and a founder and CTO of three companies in data space: Acta - an ETL company acquired by Business Objects and now marketed as SAP Data Services, Exeros - a data discovery company acquired by IBM and marketed as Infosphere Discovery, and Waterline Data - a pioneering Big Data catalog company acquired by Hitachi Vantara and marketed as Lumada Data Catalog. He also worked as an independent consultant helping large enterprises around the world (British Aerospace, Royal Carribean, IBM, Jysk, Unilever, Teradata and many others) to help develop their large scale data projects and adapt big data technologies. When not developing or deploying industry leading data technology, Alex speaks at industry conferences and authors articles in industry publications. Alex holds a B.S. in Computer Science from Columbia University School of Engineering and a M.S. in Computer Science from Stanford University.

Open Data Science




Open Data Science
One Broadway
Cambridge, MA 02142

Privacy Settings
We use cookies to enhance your experience while using our website. If you are using our Services via a browser you can restrict, block or remove cookies through your web browser settings. We also use content and scripts from third parties that may use tracking technologies. You can selectively provide your consent below to allow such third party embeds. For complete information about the cookies we use, data we collect and how we process them, please check our Privacy Policy
Consent to display content from - Youtube
Consent to display content from - Vimeo
Google Maps
Consent to display content from - Google