Master Dimensional Modeling Lesson 01 - Why Use a Dimensional Model?

  Рет қаралды 5,953

Bryan Cafferky

Bryan Cafferky

Күн бұрын

Dimensional Modeling is a popular and effective way to organize your data to maximize business value. In this video, you will learn what a Dimensional Model, aka
a Star Schema is and why you should use them to organize your data warehouse.
Support me on Patreon
www.patreon.com/bePatron?u=63...
Slides
github.com/bcafferky/shared/b...
Understanding Dimensional Modeling
• Dimensional Modeling
Should You Use a Data Vault for a Data Lake? by Advancing Analytics
• Behind the Hype: Shoul...
Databricks Blog: Dimensional Modeling on Databricks
www.databricks.com/glossary/s...

Пікірлер: 26
@TRZMac
@TRZMac 24 күн бұрын
As a 21 Year Old struggeling with these stuff, because we usually dont really get told what to exactly learn, I really appreciate your videos !
@BryanCafferky
@BryanCafferky 24 күн бұрын
Thank You!
@MDFeb11975
@MDFeb11975 5 ай бұрын
I watched and took notes from your older video on dimensional modeling. Best explanation I’ve watched! Looking forward to watching this series too
@BryanCafferky
@BryanCafferky 5 ай бұрын
Thanks!
@kushaleregowda
@kushaleregowda 5 ай бұрын
Thanks a lot Bryan. This is a great info and hard to find it so please continue the series..
@stu8924
@stu8924 5 ай бұрын
Thanks Bryan, I'm looking forward to your series. Take care.
@BryanCafferky
@BryanCafferky 5 ай бұрын
Thanks. You too.
@recs8564
@recs8564 25 күн бұрын
Very excited about this series
@raghuramsharma2603
@raghuramsharma2603 5 ай бұрын
loved it and enjoy ur voice modulation :)
@muhammadzakiahmad8069
@muhammadzakiahmad8069 4 ай бұрын
This is exactly we need.
@omarcruz6326
@omarcruz6326 6 күн бұрын
Great Material ! Thanks a lot.
@BryanCafferky
@BryanCafferky 4 күн бұрын
You're Welcome!
@coffeemusiccoding
@coffeemusiccoding Күн бұрын
Love this! With the fact tables that have the numeric values. Would these values be calculated in the transformation process in a data pipeline? For example data bricks?
@banihas22
@banihas22 5 ай бұрын
Yes!
@ericaleverson9430
@ericaleverson9430 Ай бұрын
I made a mistake in an interview today and confused the star schema with the 3 Normal Forms. I also stated star schema was normalization when it was denormalized...oh well.
@Milhouse77BS
@Milhouse77BS Ай бұрын
The important part of the star schema, the fact table, IS in third normal form. Only the dimensions are 2NF for simplicity and speed. Pure 3NF is not as important in a reporting database as it is an a transactional database. I like to say that a star schema is a reorganization, not a denormalization. The more important “denormalization“ is header detail tables into one table.
@ericaleverson9430
@ericaleverson9430 Ай бұрын
@@Milhouse77BS Thank you for clearing this up for me!
@mindyourown6320
@mindyourown6320 24 күн бұрын
What position did you apply for?
@ericaleverson9430
@ericaleverson9430 24 күн бұрын
@@mindyourown6320 Sr Healthcare Data Analyst. Just did a 2nd interview with them today.
@Mithali00
@Mithali00 3 ай бұрын
Thanks for the video! Would you always choose dimensional modeling when the goal is to support reporting and analytics? I ran into a situation where the analysts would rather use Salesforce's source data model with some minimal denormalization than its star schema version. They believe it would actually introduce MORE joins for them as one table - e.g. Opportunity - is now split into a fact and a dimension table so they don't see the benefit. How would you go about demonstrating the benefits for the end users in such a case? Thank you!
@BryanCafferky
@BryanCafferky 3 ай бұрын
It is not always necessary to use a Star Schema but usually a better option than an OLTP model. If the purpose is to use the data by Power BI then a Star Schema is highly recommended by Microsoft. I'm not clear on Tableau. However, without a star schema, you can't easily support dimension history, i.e., SCD 2. So if a store changes from one region to another, all data related to the store is updated at that point in time which means reports a year ago don't match new reports year over year and regional sales comparisons will not longer be valid. If the business does not care about that and are Ok with a changing snapshot of the data, then the next thing is to look at the complexity of the model. If it's just a few tables, it may not matter but if you need to join many tables and nest joins like Opportunity to Order to Product to Product Category, then it will likely not perform well. Best bet is to test both models against what you expect the primary queries to be like and see how they hold up. It comes down to what delivers the most value to the business.
@ciobanuradumihnea5793
@ciobanuradumihnea5793 5 ай бұрын
Hello, thank you for the video, very informative. Im just wondering isnt there a risk of table deadlock since the table FactResellerSaler is always the entrypoint for most of the queries ? Thank you for your response :).
@BryanCafferky
@BryanCafferky 5 ай бұрын
Good question. Only possible during table loads but that should be done one table at a time to avoid the issue. Main thing is the Fact table must be updated after the dimension tables. Reading the tables will not cause deadlocks.
@dertrickwinn7982
@dertrickwinn7982 3 ай бұрын
When you are describing what's on the screen, can you please use the mouse to point to what you are referring to please.
@BryanCafferky
@BryanCafferky 3 ай бұрын
It can be tricky using mouse pointers in the recordings. I've experimented with some. I usually animate slide bullet points so you focus on the current bullet point. Which specific parts of the presentation were you confused by?
@marcin2x4
@marcin2x4 4 ай бұрын
Snowflake doesn't enforce Keys, they are only informative. That forces devs and architects to handle data quality on their end thus dim model might be daunting to implement, still doable :)
Master Dimensional Modeling Lesson 02 - The 4 Step Process
14:25
Bryan Cafferky
Рет қаралды 2,5 М.
Data Modeling Tutorial: Star Schema (aka Kimball Approach)
16:34
Kahan Data Solutions
Рет қаралды 104 М.
Double Stacked Pizza @Lionfield @ChefRush
00:33
albert_cancook
Рет қаралды 125 МЛН
Советы на всё лето 4 @postworkllc
00:23
История одного вокалиста
Рет қаралды 4,9 МЛН
World’s Largest Jello Pool
01:00
Mark Rober
Рет қаралды 115 МЛН
Understand Slowly Changing Dimensions
23:21
Bryan Cafferky
Рет қаралды 20 М.
Dimensional Modeling
53:54
Bryan Cafferky
Рет қаралды 166 М.
Power BI - Multiple Fact Tables
33:57
PowerÆgg
Рет қаралды 401 М.
NEW 2024.2|Multi-fact Relationship Data Models -Tableau
7:02
DanceSingwith Data - BI & Analytics
Рет қаралды 810
Data Modeling in the Modern Data Stack
10:14
Kahan Data Solutions
Рет қаралды 98 М.
Database vs Data Warehouse vs Data Lake | What is the Difference?
5:22
Alex The Analyst
Рет қаралды 759 М.
ноутбуки от 7.900 в тг laptopshoptop
0:14
Ноутбуковая лавка
Рет қаралды 3,7 МЛН
Как противодействовать FPV дронам
44:34
Стратег Диванного Легиона
Рет қаралды 137 М.
КРАХ WINDOWS 19 ИЮЛЯ 2024 | ОБЪЯСНЯЕМ
10:04
Это - iPhone 16!
16:29
Rozetked
Рет қаралды 463 М.