No video

Random Forests : Data Science Concepts

  Рет қаралды 46,387

ritvikmath

ritvikmath

Күн бұрын

Пікірлер: 88
@saitaro
@saitaro 3 жыл бұрын
2-hour lecture in 15 minute. ritvik rocks.
@SPeeDKiLL45
@SPeeDKiLL45 2 жыл бұрын
i swear
@yuqingliu8412
@yuqingliu8412 Жыл бұрын
I swear
@mikeshinoda2806
@mikeshinoda2806 5 ай бұрын
1 semester in 15 minutes :)
@azarel7
@azarel7 2 жыл бұрын
Great video. 1) Spoke well and explained the concepts clearly 2) Threw and caught the marker every time, with no interruption in speech while doing so. Bravissimo!
@jpark7636
@jpark7636 3 жыл бұрын
This is the best video to understand random forest in KZfaq so far for me :))
@mosama22
@mosama22 2 жыл бұрын
I'm studying Data Science at MIT, you really can't imagine man how much "ritvikmath" is helping me, and a couple more channels, before I start any topic I like to tackle it first or just take a general idea, and you can't imagine how much your videos helped! Short, concise, and to the point! Thank you man 🙂 Just one notice, It might be a good idea to choose an easy to remember / clear channel name, sometimes when I'm talking to someone, it is almost impossible to remember the name of your channel, just a clear name with spaces! Thank you again 🙂
@MayankGoel447
@MayankGoel447 3 жыл бұрын
Thanks for the video! This is definitely the best explaination of Random Forest I have seen yet. I'm really enjoying learning Data Science from you
@ritvikmath
@ritvikmath 3 жыл бұрын
Awesome, thank you!
@ashmitas
@ashmitas 3 ай бұрын
thanks, well explained to a beginner like me. appreciate how a complex method was easily explained using a basic whiteboard and a relatable example.
@alsjeu
@alsjeu 4 ай бұрын
i reeeeally liked the pen flip at 3:27!! keep up the great work!
@ritvikmath
@ritvikmath 4 ай бұрын
Thank you so much!
@jarrelldunson
@jarrelldunson 3 жыл бұрын
Ritvik, hey, thank you... this was really, really helpful - a great explanation, Jarrell
@ritvikmath
@ritvikmath 3 жыл бұрын
Glad it was helpful!
@smishra115
@smishra115 2 жыл бұрын
all hail the marker juggler and his short, crisp, easy to understand videos! Keep it up dude.
@juliocjacobo
@juliocjacobo 3 жыл бұрын
Concise and right to the point, as always. Thank you!
@jimlanzi6802
@jimlanzi6802 3 жыл бұрын
Very well organized and well put together. Simplified enough for the medium, but included just the right amount of detail to guide one in their further pursuits of the topic. Thank you.
@edmundoguerramendoza7465
@edmundoguerramendoza7465 3 жыл бұрын
Ritvik, once again you do an amazing job simplifying concepts in short periods of time, while still making them very understandable. Thanks!!
@BO2Letsplay
@BO2Letsplay 9 ай бұрын
I'm trying to learn some ML content as it relates to classification to quite a large degree, and just want to say that this video on Random Forest is one of the only ones that actually made sense to me as a layman! Thank you
@qiguosun129
@qiguosun129 2 жыл бұрын
Great lecture, help me recall random forest when I am learning the causal forest
@preetikharb8283
@preetikharb8283 3 жыл бұрын
This is THE BEST explanation of Random forest!! Thank you Ritvik :)
@t_geek9211
@t_geek9211 3 жыл бұрын
Wow! You are really good at explaining stuff! That was amazing!
@ritvikmath
@ritvikmath 3 жыл бұрын
Glad you think so!
@yuqingliu8412
@yuqingliu8412 Жыл бұрын
My favorate and best teacher in KZfaq !
@Chillos100
@Chillos100 2 жыл бұрын
He’s simply the best!! Thanks for all your effort
@internetuser2399
@internetuser2399 2 жыл бұрын
this is some high quality content. you deserve more views! great teacher.
@sourishguntipally8257
@sourishguntipally8257 9 ай бұрын
This was an amazing video and super well made. It's astonishing how this material is free to learn from!
@danspeed93
@danspeed93 2 жыл бұрын
First time I see this way of computing feature importance, thanks!
@haninalkabbani7766
@haninalkabbani7766 3 жыл бұрын
I can't describe how good your explanation is !!!
@ritvikmath
@ritvikmath 3 жыл бұрын
thanks!
@NikBearBrown
@NikBearBrown 6 ай бұрын
The algorithm described is random sampling, not bagging. Not bootstrap samples are being created as described.
@jasdeepsinghgrover2470
@jasdeepsinghgrover2470 3 жыл бұрын
Amazing video... You can also cover parts like random projections... That's something which can make them much more interesting.
@ritvikmath
@ritvikmath 3 жыл бұрын
That's a great idea!
@storyteller1900
@storyteller1900 2 жыл бұрын
This is an amazing class. It contains all the important parts of random forests.
@jamesbrown6591
@jamesbrown6591 Жыл бұрын
This is the best explanation I’ve found, thank you 🙏
@ritvikmath
@ritvikmath Жыл бұрын
Glad it was helpful!
@dinhnguyenvo3040
@dinhnguyenvo3040 3 жыл бұрын
You are godly easy to follow, big thank you from my heart
@ritvikmath
@ritvikmath 3 жыл бұрын
Glad I could help!
@SethuIyer95
@SethuIyer95 9 ай бұрын
Using associative rule mining and extracting all the rules from all decision trees we can interpret random forests.
@nelsonk1341
@nelsonk1341 Жыл бұрын
Best DS KZfaqr
@millenaalves4169
@millenaalves4169 4 ай бұрын
what an awesome video! congrats, really helpful
@pluu153
@pluu153 Жыл бұрын
Big Thanks for a clear explanation!!!
@extcresources531
@extcresources531 2 жыл бұрын
This is gold.. pure gold!!
@hameddadgour
@hameddadgour 2 жыл бұрын
Fantastic presentation!
@loveena419
@loveena419 3 жыл бұрын
Wow great explanation - I am hooked on these videos. Get the main points in a short timeframe - would be nice to have a video on Tuning RF and other ML algorithms. And the pre-req videos are very useful to have the right background to understand this one. Thank you!
@janpieterwagenaar1608
@janpieterwagenaar1608 3 жыл бұрын
Ritvikmath, i would like to complement you with the clear direct explanation video's. you make it easily accessable and clear with practical examples. please keep it up. Kind regards, Jan Pieter Wagenaar
@ritvikmath
@ritvikmath 3 жыл бұрын
You are most welcome!
@leoliao3389
@leoliao3389 11 ай бұрын
Thank you ritvik!! This video is so helpful!!
@ForcesOfOdin
@ForcesOfOdin 2 жыл бұрын
Loved the interpretability of the random forests idea! Very clever / useful. I'm guessing that you would want to reshuffle the dth feature for each i to avoid the effect that the shuffled data accidentally correlates with an important feature.
@user-lh8wy5yb2x
@user-lh8wy5yb2x Жыл бұрын
Fabulously concise and accurate!!!
@keshavsharma267
@keshavsharma267 3 жыл бұрын
Thanks for the video. can you also explain interpretability via LIME and Shapely values?
@ritvikmath
@ritvikmath 3 жыл бұрын
Great suggestion!
@ziaurrahmanutube
@ziaurrahmanutube 3 жыл бұрын
Love your videos, very helpful and well explained
@ritvikmath
@ritvikmath 3 жыл бұрын
Happy to hear that!
@Fat_Cat_Fly
@Fat_Cat_Fly 3 жыл бұрын
amazing video!! really helpful, thanks!!!
@ritvikmath
@ritvikmath 3 жыл бұрын
Glad it was helpful!
@kevinshao9148
@kevinshao9148 4 ай бұрын
Thanks for the great video! Do you have a video or any recommend for RandomForest on Regression math derivation? Thank you!
@vladimirkirichenko1972
@vladimirkirichenko1972 Жыл бұрын
excellent vid! thank you.
@karunasrees7402
@karunasrees7402 2 жыл бұрын
Thanks Professor , your explanation is very good. I am really enjoying your videos and they are helping me to focus on DS. I have seen many videos prior they only mention Idea 1 - Bagging and say it is Random Forests. But you have mentioned Idea 2 - Random Subspaces as well. Just to confirm on it , do the Random Forests use both the ideas ? Do you mean that Bagging + Random Subspaces = Random Forest ? If possible can you explain how to code it ? Thanks for your time on videos ! Many of your videos are good , even your Bias-Variance video is also super.
@geoffreyanderson4719
@geoffreyanderson4719 2 жыл бұрын
Yo I heard the RF is bad alone and needs help when: 1) a strongly predictive linear feature exists in X. You gotta help the RF out by either feeding it the residuals from running the linear model on that feature first, so each model in the ensemble can do what it does best, the linear doing linear things and the nonlinear RF doing nonlinear things. Or else just preprocess to create an additional feature which is just the output of the linear model, and give the whole augmented feature set to the RF now. 2) 2nd order associations are expected to be important, because despite its subsampling of feature space, the RF is actually NOT good at automatically finding 2nd order predictive associations in X. THus we should help the RF out by doing some feature engineering of the 2nd order terms in advance into the X and then give it to the RF NOW. Further it might help still more by telling RF to stop using the typical 0.5 ratio default of subspace sampling and instead just focus on exactly 2 columns at a time, no more, no less, forcing it to look much closer at all the 2nd order associations that you expect should be found by the RF. These are hear-say and hypotheses. It would be cool to see how to do it in sklearn's pipeline on a dataset like "jewellery" which is used for demo code by the pycaret library. Jewellery has a strongly predictive feature "carets" or "weight" in its X. But they just look at trees alone in their model search, so I think it can be improved by helping out the fancy nonlinear tree models as described above.
@squib3083
@squib3083 Жыл бұрын
Awesome explanation thank you
@TheBalhamboy
@TheBalhamboy 3 жыл бұрын
Just found your channel. Really well explained. Thanks :)
@joycwang
@joycwang 2 жыл бұрын
great explanation much easier to understand
@samuelharris4509
@samuelharris4509 Жыл бұрын
Why do we need the accuracy value on the 20% for each tree? Does that help with some weighted average?
@TheMarComplex
@TheMarComplex 2 жыл бұрын
Thank you!
@roopanjalijasrotia3946
@roopanjalijasrotia3946 2 жыл бұрын
This is great! How about a point or two about the pitfalls of using random forest for time series
@annikamoller7673
@annikamoller7673 3 жыл бұрын
what a great explanation, thanks man :)
@ritvikmath
@ritvikmath 3 жыл бұрын
You're welcome!
@ahmetcihan8025
@ahmetcihan8025 3 жыл бұрын
This is it. Thank you so much .
@trin1721
@trin1721 2 жыл бұрын
Can't we get the feature importance for free, without permuting, by looking at the accuracies of models trained with and without certain features (in the random subspace step)?
@TawhidShahrior
@TawhidShahrior 2 жыл бұрын
you are a legend
@beshosamir8978
@beshosamir8978 2 жыл бұрын
u r incredibly amazing ,but i have 2 questions : 1- What is the meaning of when i use all features the tree will be correlated to each other, i know what is the meaning of 2 features are correlated ,but what is mean when i say 2 trees are correlated ????? 2- when i need to determine how much a specific feature is important now , i trained the model using 80% of the dataset and now do i get the accuracy of this (80% dataset) of the dataset and after that shuffle my specific column and get the accuracy again of 80% of the data after shuffling then subtract them ? or i'm using 20% for both ? but u said in the video u r get the accuracy of the data that made that tree so u almost talking about the 80% , it make no sense for me using 20% of the dataset
@YingjieWu-dt8vm
@YingjieWu-dt8vm 10 ай бұрын
great video
@neuodev
@neuodev 2 жыл бұрын
You are aweomse!
@niknoor4044
@niknoor4044 3 жыл бұрын
Great!
@manueltiburtini6528
@manueltiburtini6528 2 жыл бұрын
I love it!
@SuperHddf
@SuperHddf 2 жыл бұрын
THX!
@warpathcucucu
@warpathcucucu 3 жыл бұрын
your're the goat
@KennieinKorea
@KennieinKorea Жыл бұрын
impressive, thanks
@ritvikmath
@ritvikmath Жыл бұрын
Glad you liked it!
@Ostiosti
@Ostiosti 3 жыл бұрын
Great video. But why permute on the training data and not on the test data? This should also show the importance of the feature, right?
@beniborukhov9436
@beniborukhov9436 3 жыл бұрын
I think that it's since we're trying to focus specifically on the importance of each feature to the model. We're avoiding adding the additional variable of how well the model generalizes and therefore works on the test data so we can see the features' contribution to the model's accuracy under ideal conditions.
@leroychiyangwa8320
@leroychiyangwa8320 Жыл бұрын
kkkk i like the entrance style
@Makako_Loko
@Makako_Loko 3 ай бұрын
Oh my deus thank you
@TheProblembaer2
@TheProblembaer2 11 ай бұрын
I love you.
@Fordalo
@Fordalo Жыл бұрын
you are goated
@beatss8286
@beatss8286 3 жыл бұрын
Thank you!
@ritvikmath
@ritvikmath 3 жыл бұрын
You're welcome!
Cross Validation : Data Science Concepts
10:12
ritvikmath
Рет қаралды 37 М.
Gradient Boosting : Data Science's Silver Bullet
15:48
ritvikmath
Рет қаралды 60 М.
Parenting hacks and gadgets against mosquitoes 🦟👶
00:21
Let's GLOW!
Рет қаралды 13 МЛН
Incredible Dog Rescues Kittens from Bus - Inspiring Story #shorts
00:18
Fabiosa Best Lifehacks
Рет қаралды 28 МЛН
OMG what happened??😳 filaretiki family✨ #social
01:00
Filaretiki
Рет қаралды 13 МЛН
Random Forest Algorithm Clearly Explained!
8:01
Normalized Nerd
Рет қаралды 592 М.
Hidden Markov Model : Data Science Concepts
13:52
ritvikmath
Рет қаралды 117 М.
StatQuest: Random Forests Part 1 - Building, Using and Evaluating
9:54
StatQuest with Josh Starmer
Рет қаралды 1,1 МЛН
AdaBoost, Clearly Explained
20:54
StatQuest with Josh Starmer
Рет қаралды 750 М.
What the Heck is Bayesian Stats ?? : Data Science Basics
20:30
ritvikmath
Рет қаралды 64 М.
Bias-Variance Tradeoff : Data Science Basics
12:25
ritvikmath
Рет қаралды 49 М.
Soft Margin SVM : Data Science Concepts
12:29
ritvikmath
Рет қаралды 49 М.
Decision Tree Classification Clearly Explained!
10:33
Normalized Nerd
Рет қаралды 653 М.
Bayesian Linear Regression : Data Science Concepts
16:28
ritvikmath
Рет қаралды 78 М.
Parenting hacks and gadgets against mosquitoes 🦟👶
00:21
Let's GLOW!
Рет қаралды 13 МЛН