Netflix Data Engineering Tech Talks - Psyberg, An Incremental ETL Framework Using Iceberg

  Рет қаралды 4,972

Netflix Data

Netflix Data

8 ай бұрын

Abhinaya Shetty and Bharath Mummadisetty, Data Engineers from Netflix’s Membership Data Engineering team, introduce Psyberg, an incremental ETL framework. Learn about how Psyberg leverages Iceberg metadata to handle late-arriving data, and improves data pipelines while simplifying on-call life!
#netflix
#datascience
#dataengineering
#etl
#bigdata

Пікірлер: 3
@rajvellaturi
@rajvellaturi 8 ай бұрын
This is such an exciting talk. I faced this problem in my experience working as a DE. Identifying and reprocessing those late-arriving records is resource-intensive and time consuming for sure. Thanks to Iceberg for making it easy and possible to put together a solution with the help of metadata.
@iirdna
@iirdna 6 ай бұрын
how you avoiding too high tide of a changes? meaning - is any late data arriving triggers Psyberg? even just few thousand of rows? or you accumulating changes at some sort of gates/elevators and process when enough late data accumulated to justify downstream reprocessing?
@ViralDave_26
@ViralDave_26 8 ай бұрын
👍👍
No empty
00:35
Mamasoboliha
Рет қаралды 12 МЛН
My Cheetos🍕PIZZA #cooking #shorts
00:43
BANKII
Рет қаралды 20 МЛН
Они так быстро убрались!
01:00
Аришнев
Рет қаралды 2,7 МЛН
What is Apache Iceberg?
12:54
IBM Technology
Рет қаралды 20 М.
Iceberg: a fast table format for S3
51:23
DataWorks Summit
Рет қаралды 14 М.
Netflix Data Engineering Tech Talks - Data Processing Patterns
23:38
Netflix Data Engineering Tech Talks - Streaming SQL on Data Mesh
18:14