Changelog Stream Processing with Apache Flink


Practically all business data is produced as an infinite stream of events: sensor measurements, website engagements, geo-location data from industrial IoT devices, database modifications, stock trades, and financial transactions, to name a few. Successful data-driven organizations go beyond just discovering valuable insights once but do so continuously and in real-time. But how do you leap analytics-based historical data to real-time insights on streams?

This talk will introduce Apache Flink as a general data processor for all these use cases on both finite and infinite streams. We demonstrate Flink's SQL engine as a changelog processor, with an ecosystem tailored to process CDC data and maintain materialized views. We will use Kafka as an upsert log, Debezium, to connect to databases and enrich streams of various sources using different kinds of joins. Finally, we illustrate how to combine Flink's Table API with DataStream API for event-driven applications beyond SQL.


Bio Coming Soon!

Open Data Science




Open Data Science
One Broadway
Cambridge, MA 02142

Privacy Settings
We use cookies to enhance your experience while using our website. If you are using our Services via a browser you can restrict, block or remove cookies through your web browser settings. We also use content and scripts from third parties that may use tracking technologies. You can selectively provide your consent below to allow such third party embeds. For complete information about the cookies we use, data we collect and how we process them, please check our Privacy Policy
Consent to display content from - Youtube
Consent to display content from - Vimeo
Google Maps
Consent to display content from - Google