No video

How to Select the BEST Threshold for Your Model Using the ROC Curve

  Рет қаралды 6,057

DataMListic

DataMListic

Күн бұрын

Пікірлер: 11
@datamlistic
@datamlistic Жыл бұрын
*AWKWARD VIDEO MISTAKE* : The two models used for disease detection and hiring at 05:06 should be switched in the ROC plot. Sorry for any confusion this mistake may have caused you when watching this and thanks @benedict6695 for pointing it out!
@mhaya1
@mhaya1 8 ай бұрын
Highly appreciated🙏
@datamlistic
@datamlistic 8 ай бұрын
Glad you enjoyed it! :)
@googlable
@googlable 10 ай бұрын
Thanks for the video. Did you end up making a video for ROC in multiclass too?
@datamlistic
@datamlistic 10 ай бұрын
Thanks for the feedback! I haven't started making that video yet, but it's on my list. Stay tuned. :)
@postnutclarity00
@postnutclarity00 4 ай бұрын
Can I use a similar approach for multiclass but looking at metric balanced accuracy?
@datamlistic
@datamlistic 4 ай бұрын
You can use a one vs all or one vs one approach when computing the ROC for multiclass. :)
@benedict6695
@benedict6695 Жыл бұрын
Hi Hope you are doing well! Im still a bit confused. Don't you think that the models for disease detection and hiring are switched? (05:06) We are always assuming that the alternative or the H1 as the condition that we wanna predict. Or in other terms, innocent (H0) until proven guilty (H1). Hence: *Disease detection model* Assumed: H0 = No disease H1 = Disease It's okay to have more False Positive (reject True H0) than False Negative (Accept False H0), Or in other term, It's okay for the model to classify more people having a disease (H1) than to have no disease (H0) since it will be more costly. Thus, having a higher False Positive Rate (FPR) is better in this case. The model should be placed on "Upper Right" instead of "Lower Left" *Hiring model* Assumed: H0 = Do not Hire H1 = Hire It's okay to have more False Negative (Accept False H0) than False Positive (reject True H0), Or in other term, It's okay for the model to classify more "Do not Hire" (H0) than to "Hire" (H1) since Hiring "underperfomed people" will be more costly. Thus, having a higher False Negative Rate is better in this case. The model should be placed "Lower Left" instead of "Upper Right" (Lower Left = Low FPR = Higher False Negative Rate) Thanks in advance! Please correct me if Im wrong :)
@datamlistic
@datamlistic Жыл бұрын
Thank you so much for this detailed feedback and so sorry that it took me so long to respond. I've been really busy this week at my job and only today I've found a little bit of spare time to go again through this example and check if it's correct or not. Well, after doing that, I have to awkwardly admit that you are indeed correct, the two models at 05:06 should be switched. If you look at 1:45, I say that you need to set a low threshold for the disease detection model and a high one for the hiring model. Then, starting with 03:25, I say that we start with the maximum threshold in the ROC curve and then decrease it. Finally, for some reason, at 05:06 I just switch the two models (don't ask me why, most likely I didn't pay enough attention when creating that part of the video). All in all, I will pin a comment to this video where I explain this mistake, so other don't get confused. Thanks again for pointing this out!
@benedict6695
@benedict6695 Жыл бұрын
@@datamlistic Thank you so much for all of your videos mate, I learn a lot of things. Really appreciate it that you can share this knowledges with all of us. In addition, thank you for taking your time to respond to me and making these videos Keep up the good work man! Cheers!
@datamlistic
@datamlistic Жыл бұрын
Thank you so much for your kind words! Really appreciated it! Also, I am super happy that you find helpful the content I make on this channel! :)
ROC and AUC, Clearly Explained!
16:17
StatQuest with Josh Starmer
Рет қаралды 1,5 МЛН
The ROC Curve : Data Science Concepts
17:19
ritvikmath
Рет қаралды 34 М.
WILL IT BURST?
00:31
Natan por Aí
Рет қаралды 28 МЛН
Underwater Challenge 😱
00:37
Topper Guild
Рет қаралды 39 МЛН
Understanding and Applying XGBoost Classification Trees in R
17:46
Gaussian Mixture Models (GMM) Explained
4:49
DataMListic
Рет қаралды 45 М.
Forget about LLMs What About SLMs ?
4:40
New Machina
Рет қаралды 297
ROC (Receiver Operating Characteristic) Curve in 10 minutes!
10:54
Serrano.Academy
Рет қаралды 60 М.
145 - Confusion matrix,  ROC and AUC in machine learning
25:00
DigitalSreeni
Рет қаралды 33 М.
ROC Curves and Area Under the Curve (AUC) Explained
14:06
Data School
Рет қаралды 489 М.
WILL IT BURST?
00:31
Natan por Aí
Рет қаралды 28 МЛН