No video

33. Medallion Architecture and Change Data Feed

  Рет қаралды 13,187

CloudFitness

CloudFitness

Жыл бұрын

Follow me on Linkedin
/ bhawna-bedi-540398102
Instagram
www.instagram....
A medallion architecture is a data design pattern used to logically organize data in a lakehouse, with the goal of incrementally and progressively improving the structure and quality of data as it flows through each layer of the architecture (from Bronze ⇒ Silver ⇒ Gold layer tables). Medallion architectures are sometimes also referred to as "multi-hop" architectures.
Change Data Feed (CDF) feature allows Delta tables to track row-level changes between versions of a Delta table. When enabled on a Delta table, the runtime records “change events” for all the data written into the table. This includes the row data along with metadata indicating whether the specified row was inserted, deleted, or updated.
Data-bricks hands on tutorials
• Databricks hands on tu...
Azure Event Hubs
• Azure Event Hubs
Azure Data Factory Interview Question
• Azure Data Factory Int...
SQL leet code Questions
• SQL Interview Question...
Azure Synapse tutorials
• Azure Synapse Analytic...
Azure Event Grid
• Event Grid
Azure Data factory CI-CD
• CI-CD in Azure Data Fa...
Azure Basics
• Azure Basics
Data Bricks interview questions
• DataBricks Interview Q...

Пікірлер: 23
@aneeshmarathe7269
@aneeshmarathe7269 Жыл бұрын
Loved your explanation. Simple and to the point 👌
@bonasiuday
@bonasiuday Жыл бұрын
Crystal clear presentation 👏
@corybeyer1
@corybeyer1 2 ай бұрын
Another amazing video, thanks!
@suresh.suthar.24
@suresh.suthar.24 Жыл бұрын
Thank You mam for databricks series.
@atbplibrarychannel695
@atbplibrarychannel695 Жыл бұрын
Great Demo!
@sravankumar1767
@sravankumar1767 Жыл бұрын
Superb explanation 👌 👏
@moughosh3640
@moughosh3640 Жыл бұрын
Crystal Clear Explanation. You have hard coded the version of the table while merging , when you schedule in a pipeline , how to make it dynamic ? is it like get max ( _change_type) and then use it in merge?
@raviv5109
@raviv5109 Жыл бұрын
Hey thank you so much for creating this! One question - how can i always pick up the latest version of changes?
@shubhamchaturkar8324
@shubhamchaturkar8324 Жыл бұрын
Hi Bhavana, great explanation for CDF. Now when update happens on a table , table_changes contains two records update_preimage and update_postimage. I am more curious about how you handle these cases while taking table into next merge. because i can see you take table name and version only.
@vipinkumarjha5587
@vipinkumarjha5587 Жыл бұрын
Hi Thanks for the video on new feature of Data bricks. I have one question , you have hardcoded the version number in code. how can we pass the latest version dynamically in merge part. Thanks in advance
@felipecastro3710
@felipecastro3710 Жыл бұрын
Did you find the answer for that? The only way I can think of, would be to store a checkpoint somewhere for the latest processed _commit_version...
@priyankam3977
@priyankam3977 Жыл бұрын
@@felipecastro3710 how do you do that and I am looking for an answer for the same question.
@priyankam3977
@priyankam3977 Жыл бұрын
Great video and explanation. I have a question - In Cmd22- For Merge stmt we need to keep updating the version number from 2 to 3 and so on for each incremental load right based on the latest version. How do I do that automatically?
@vipuljain-ok9qy
@vipuljain-ok9qy Жыл бұрын
Thanks Bhawna for sharing the knowledgeable video.. Is it possible to share this code through sharing Apps like GIT so it would be easy to do hands-on
@obedvaldes3680
@obedvaldes3680 9 ай бұрын
When you selected from the silver table on the table_changes, you need to pick the postimage record only , correct ?
@philfang1822
@philfang1822 9 ай бұрын
At 18:08, 2nd insert of 7 rows, Six rows are on date "4/1/2021", One row on date "3/1/2021" and they are inserted, how come at 18:28, showing 7 rows inserted, but Six of them are on date "3/1/2021" and One row is on "4/1/2021" ?? I'd expect them to be the same dates as of 18:08 ? Am I missing something or I got something wrong ? Can someone explain to me please ?
@amulyakonala5036
@amulyakonala5036 11 ай бұрын
Great demo RIGHT , RIGHT ?
@ahmedtariqsilat
@ahmedtariqsilat Жыл бұрын
Hi Can you show how to implement this delta lake in AWS?
@jamesang8735
@jamesang8735 Жыл бұрын
Where can we get the tutorial codes?
@pranaydurvesula5891
@pranaydurvesula5891 Жыл бұрын
Is medillian architecture and delta live tables are same?
@priyankam3977
@priyankam3977 Жыл бұрын
That is my understanding as well.
@sunitabedi1230
@sunitabedi1230 Жыл бұрын
👌👍👍👍
@ahmedtariqsilat
@ahmedtariqsilat Жыл бұрын
Hi Can you show how to implement this delta lake in AWS?
34.  Change Data Feed Demo 02
12:55
CloudFitness
Рет қаралды 6 М.
What is Medallion Architecture? Scalable Data Lakes | 2023
20:57
Make With Data
Рет қаралды 14 М.
Harley Quinn's plan for revenge!!!#Harley Quinn #joker
00:49
Harley Quinn with the Joker
Рет қаралды 31 МЛН
Бутылка Air Up обмани мозг вкусом
01:00
Костя Павлов
Рет қаралды 2,2 МЛН
Kids' Guide to Fire Safety: Essential Lessons #shorts
00:34
Fabiosa Animated
Рет қаралды 13 МЛН
Simplify ETL pipelines on the Databricks Lakehouse
30:19
Databricks
Рет қаралды 25 М.
8.  Delta Optimization Techniques in databricks
20:41
CloudFitness
Рет қаралды 16 М.
Advancing Spark - Databricks Delta Change Feed
17:01
Advancing Analytics
Рет қаралды 14 М.
Lake House and Delta Lake the difference #dataengineering #spark
9:35
Evolution of Data Architectures and How to Build a Lakehouse
22:34
Harley Quinn's plan for revenge!!!#Harley Quinn #joker
00:49
Harley Quinn with the Joker
Рет қаралды 31 МЛН