5 Probability Distributions you should know as a Data Scientist

  Рет қаралды 26,427

CodeEmporium

CodeEmporium

Күн бұрын

Here are a few probability distributions you should know as a data scientist
Follow me on M E D I U M: towardsdatascience.com/likeli...
JOIN US ON DISCORD: / discord
SPONSOR
Kite is a free AI-powered coding assistant that will help you code faster and smarter. The Kite plugin integrates with all the top editors and IDEs to give you smart completions and documentation while you’re typing. I've been using Kite. Love it!
Learn more: www.kite.com/get-kite/?...
REFERENCES
[1] Where do we see normal distributions? stat.ethz.ch/~stahel/lognorma...
[2] Useful probability Distributions with applications: towardsdatascience.com/6-usef...
[3] Assumption of Normality explained very well here: www2.psychology.uiowa.edu/facu...
[4] Monte Carlo Simulations in practice: • Running Simulations as...

Пікірлер: 46
@rishisharma8311
@rishisharma8311 2 жыл бұрын
The real life examples for each of the distribution were amazing !!
@CodeEmporium
@CodeEmporium 2 жыл бұрын
Glad you liked them. Many thanks :)
@hkumar7340
@hkumar7340 2 жыл бұрын
1:00 Normal Distribution 5:37 Log-normal Distribution 7:30 Uniform Distribution 8:48 Beta Distribution 10:33 Chi-squared Distribution.
@CodeEmporium
@CodeEmporium 2 жыл бұрын
Thanks for the timestamps :)
@mohammadrezaghiasy6618
@mohammadrezaghiasy6618 2 жыл бұрын
Hey buddy. Awesome as always. THANK YOU 💓
@harry8175ritchie
@harry8175ritchie Жыл бұрын
Counts are often distributed under a Poisson. The domain is very important to mention when selecting a distribution, and the discrepancy between probability mass functions and density functions. Keep it up man, love your stuff.
@CodeEmporium
@CodeEmporium Жыл бұрын
Thank you! More math videos to come!
@McMurchie
@McMurchie 2 жыл бұрын
Ahhh, there are like a million Data/ML channels but this is still the only one I subscribe to (after being burned a bit by Siraj). Love this guys ability to articulate complex phenomena in a way that makes sense.
@CodeEmporium
@CodeEmporium 2 жыл бұрын
Thanks for being a part of the community 🙂
@McMurchie
@McMurchie 2 жыл бұрын
@@CodeEmporium pleasure!
@bipinkapri9986
@bipinkapri9986 2 жыл бұрын
That was really helpful! Amazing content!
@CodeEmporium
@CodeEmporium 2 жыл бұрын
Many thanks and very glad you enjoyed it :)
@ronin2963
@ronin2963 2 жыл бұрын
Nice summary of five different topics that could be their own lessons
@CodeEmporium
@CodeEmporium 2 жыл бұрын
Thank you. Will def dive into these topics in thier videos in some consumable form. I just need to think of the best way to deliver this content
@hamzadata
@hamzadata 5 ай бұрын
Man you are awesome!
@gokulkurup1584
@gokulkurup1584 2 жыл бұрын
Really good content
@CodeEmporium
@CodeEmporium 2 жыл бұрын
Thank you :)
@scott7948
@scott7948 2 жыл бұрын
You missed tweedie distribution which is used in insurance modelling
@erickballesteros4531
@erickballesteros4531 2 жыл бұрын
good vid :)
@timz2917
@timz2917 5 ай бұрын
The sample means can still be normal even if the samples arent
@yensteel
@yensteel 2 жыл бұрын
Is there a way to create a custom probability distribution from a sample dataset? It can then generate new data with similar characteristics while remaining completely continuous?
@CodeEmporium
@CodeEmporium 2 жыл бұрын
Yes that is possible. In python, scipy has distributions where you call a "fit" function and pass in sample data. For example, check out scipy.beta.fit.
@yensteel
@yensteel 2 жыл бұрын
@@CodeEmporium Thank you so much for the reply!
@harry8175ritchie
@harry8175ritchie Жыл бұрын
@@CodeEmporium Man, I feel like I'm really annoying here. I'm sorry! Be careful with this. Understand your data first: if there's any domain expertise you can throw into this, the data may be enforced to be a certain distribution, despite it not looking like it yet. For example: counting the number of times you see cars drive past your house within one hour blocks. Maybe you collect a handful of data. You notice a small tail at 2-5 cars, a peak at 7 cars, and a tail at 10 cars. You might think this is normal, but from the definition of the experiment, this is indeed a Poisson distribution: counting within set intervals.
@doristhebrowndog
@doristhebrowndog Жыл бұрын
how are y’all so smart… i left everything i learned about statistics back at where it started, at Uni :(
@CodeEmporium
@CodeEmporium Жыл бұрын
Honestly I did the same. But the more you work with this stuff on applications, the better you’ll remember it. :)
@monkyebrain
@monkyebrain 2 жыл бұрын
Weibull gang stand up!
@CodeEmporium
@CodeEmporium 2 жыл бұрын
You have been heard
@k.alipardhan6957
@k.alipardhan6957 2 жыл бұрын
start at 1:00
@k.alipardhan6957
@k.alipardhan6957 2 жыл бұрын
i think 4 & 5 needed much more details, as much as we got for 1. but good video, thank you
@SiyaMedia
@SiyaMedia 2 жыл бұрын
poison ooops we need to talk about the poisson distribution as well
@CodeEmporium
@CodeEmporium 2 жыл бұрын
Another video for sure
@lucio8794
@lucio8794 2 жыл бұрын
My man, I love your videos, but the audio is often out of sync, just a heads up
@CodeEmporium
@CodeEmporium 2 жыл бұрын
Yep. Thanks for the heads up. I'm trying to get better with this for future videos :)
@dragonman101
@dragonman101 2 жыл бұрын
does anyone else see a lag between audio and video?
@CodeEmporium
@CodeEmporium 2 жыл бұрын
Sorry about that. It happens a couple of times through the video. Will try to correct for future videos
@dragonman101
@dragonman101 2 жыл бұрын
@@CodeEmporium no worries! :) I just couldn't tell if the issue was my computer or the video itself hahaha
@larrybird3729
@larrybird3729 2 жыл бұрын
no gamma :(
@shaelanderchauhan1963
@shaelanderchauhan1963 2 жыл бұрын
Data is is just a game of giving 100 different fancy names for the same concept to make it Extremely confusing for learners
@smokinghighnotes
@smokinghighnotes Жыл бұрын
sahi mai bhai
@tusharbedse9523
@tusharbedse9523 2 жыл бұрын
R u lipsing bro
@CodeEmporium
@CodeEmporium 2 жыл бұрын
Nah. It's your imagination
@tusharbedse9523
@tusharbedse9523 2 жыл бұрын
@@CodeEmporium thanks for replying.... Was watching sm of ur videos ...awseome stuff...thanks!!
@ssshukla26
@ssshukla26 2 жыл бұрын
One those videos where it's implicitly assumes that you know stats before hand and explicitly follow that assumption throughout the video...
@CodeEmporium
@CodeEmporium 2 жыл бұрын
I think only the normal distribution is technical here. The other 4 are a lot easier to pick up. Looking back, maybe could have easier explained the normal distribution. But I'll keep this mind for other videos
@kushagrachaturvedi2144
@kushagrachaturvedi2144 Жыл бұрын
when i hear u first time its very weird u r voice does not match u. means don't know why its feels like that u r lisping and someone else is talking
Data Scientist answers 30 Data Science Interview questions
21:45
CodeEmporium
Рет қаралды 26 М.
Teach me STATISTICS in half an hour! Seriously.
42:09
zedstatistics
Рет қаралды 2,7 МЛН
One moment can change your life ✨🔄
00:32
A4
Рет қаралды 18 МЛН
When You Get Ran Over By A Car...
00:15
Jojo Sim
Рет қаралды 29 МЛН
- А что в креме? - Это кАкАооо! #КондитерДети
00:24
Телеканал ПЯТНИЦА
Рет қаралды 7 МЛН
Incredible magic 🤯✨
00:53
America's Got Talent
Рет қаралды 77 МЛН
But what is the Central Limit Theorem?
31:15
3Blue1Brown
Рет қаралды 3,4 МЛН
Likelihood Estimation - THE MATH YOU SHOULD KNOW!
27:49
CodeEmporium
Рет қаралды 46 М.
Optimizers - EXPLAINED!
7:23
CodeEmporium
Рет қаралды 113 М.
5 concepts of A/B testing you should know as a Data Scientist
11:14
5 Concepts in Statistics You Should Know | Data Science Interview
20:48
The Beta distribution in 12 minutes!
13:31
Serrano.Academy
Рет қаралды 80 М.
Math you need as a Data Scientist
15:26
CodeEmporium
Рет қаралды 4,5 М.
Probability vs. Likelihood ... MADE EASY!!!
7:31
Brian Greco - Learn Statistics!
Рет қаралды 25 М.
The Kernel Trick - THE MATH YOU SHOULD KNOW!
7:30
CodeEmporium
Рет қаралды 170 М.
One moment can change your life ✨🔄
00:32
A4
Рет қаралды 18 МЛН