Рет қаралды 619
This webinar covers aspects of creating change data capture (CDC) pipelines into a data lake, using Debezium to create an event stream from the source databases and Apache Iceberg tables as the destination.
Basics of Change Data Capture
Batch vs Stream
Binary Logs
Debezium
Message Structure
Exploring OP
Snapshot Read Events
Timestamps
To The Lake
Ordering + Consistency
Snapshot / The Merge
Soft vs Hard Deletion
Cost Control The Merge
Opinionated Architecture Takeaways