How to Supercharge Spark with Apache Iceberg


This talk is about using Iceberg tables with Spark to make data products more reliable, easier to maintain, and more performant. This will include an overview of Iceberg tables and how they help you avoid data correctness problems, as well as Iceberg's SQL extensions that make you more efficient.


Ryan is the co-creator of Apache Iceberg and spent the last decade working on big data infrastructure at Netflix, Cloudera, and now Tabular. He is an ASF member and a committer in the Apache Parquet, Avro, and Spark communities.

Open Data Science




Open Data Science
One Broadway
Cambridge, MA 02142

Privacy Settings
We use cookies to enhance your experience while using our website. If you are using our Services via a browser you can restrict, block or remove cookies through your web browser settings. We also use content and scripts from third parties that may use tracking technologies. You can selectively provide your consent below to allow such third party embeds. For complete information about the cookies we use, data we collect and how we process them, please check our Privacy Policy
Consent to display content from - Youtube
Consent to display content from - Vimeo
Google Maps
Consent to display content from - Google