R-CNN: Clearly EXPLAINED!

  Рет қаралды 22,262

Soroush Mehraban

Soroush Mehraban

Күн бұрын

In this video, we understand how R-CNN works and become familiar with the basics of object detection.
Paper link: arxiv.org/abs/1311.2524
Table of content:
00:00 - Our Goal
01:20 - Designing the model
03:36 - Object Localization
03:58 - Earliest Approach
06:33 - R-CNN
09:47 - Non-max Suppression
12:00 - Mean Average Precision
Icon made by Freepik from flaticon.com

Пікірлер: 83
@senpanwu5163
@senpanwu5163 14 күн бұрын
Great Work! You explained 1000 times better than my uni lecturer :D
@bhavanamalla954
@bhavanamalla954 7 ай бұрын
Such a great video!! Keep them coming!
@Hansly_rz
@Hansly_rz Ай бұрын
oh my it explains everything at once! Thank you for making this video!
@asamoahjeffrey6343
@asamoahjeffrey6343 4 ай бұрын
One of the best videos I have watched. Very detailed Explanations. Keep up the good work
@soroushmehraban
@soroushmehraban 4 ай бұрын
Thanks 🙂
@gotagando2449
@gotagando2449 Жыл бұрын
Great work. I like how you made youtube chapters to explain independent techniques like NMS. Really useful. Many people don't have the time to go through papers in details and just run the codes to get things done. Your videos could be helpful to solve that problem. I'm personally hoping to see videos on YOLO series especially the YOLOX model :) You could also talk about the object detection models landscape and how each model has pros/cons w.r.t. inference time (FPS) and performance.
@soroushmehraban
@soroushmehraban Жыл бұрын
Wonderful feedback, Gota. I'll make sure to create them in the future
@layer8man
@layer8man Жыл бұрын
Very nice! I can't wait to see more videos like this!
@soroushmehraban
@soroushmehraban Жыл бұрын
Thanks, Jeffrey! Wait for the better ones then 😄
@umekawaibu4838
@umekawaibu4838 5 ай бұрын
Thanks for your work! It's helps me a lot! Appreciate that~
@Broadsword07
@Broadsword07 Жыл бұрын
This is great. Nice work!! Waiting for more such videos.
@soroushmehraban
@soroushmehraban Жыл бұрын
Thanks, Raghuveer! Appreciate it.
@user-vn5dr6xp9e
@user-vn5dr6xp9e 4 ай бұрын
Simple and easy to understand! Thank you for making this video :)
@soroushmehraban
@soroushmehraban 3 ай бұрын
Glad it was helpful!
@ahmedjawadrashid666
@ahmedjawadrashid666 4 ай бұрын
Such an underrated video. Well done mate!
@soroushmehraban
@soroushmehraban 3 ай бұрын
Glad you enjoyed it!
@Vinay1272
@Vinay1272 7 ай бұрын
Thanks a lot for this! It was really clean and precisely explained. mAP explanation was on point.
@soroushmehraban
@soroushmehraban 7 ай бұрын
Glad you liked it!
@wtcbretburstjk3726
@wtcbretburstjk3726 2 ай бұрын
clean explanation give this man more sub !
@santoshkamble1290
@santoshkamble1290 6 ай бұрын
Great explanation❤
@ericsy78
@ericsy78 Жыл бұрын
Cool! Nice work💥
@amirparsa_s
@amirparsa_s Жыл бұрын
Good job Soroush, Very nice video! It helped me a lot specially to understand the mAP metric. Just Keep going :)
@soroushmehraban
@soroushmehraban Жыл бұрын
Glad you liked it :)
@MuhammadArnaldo
@MuhammadArnaldo Жыл бұрын
Nice, this topic deserves its own playlist. RCNN has so many component, you can make separated short video for each component, so it wont be overwhelming for the viewers.
@soroushmehraban
@soroushmehraban Жыл бұрын
Thanks, Muhammad. I actually want to create videos for other object detection algorithms as well and put them in a playlist. From my past experience and based on the videos I've seen, usually, long videos get more viewers. I already separated this video into different chapters and viewers can watch each one on their own time. It's a kinda subjective opinion I believe.
@zukofire6424
@zukofire6424 Жыл бұрын
@@soroushmehraban how about Yolo?
@charbelbm73
@charbelbm73 Жыл бұрын
Nice video! Keep up the great work
@soroushmehraban
@soroushmehraban Жыл бұрын
Thank you, Bellz!
@nestedhuman8951
@nestedhuman8951 2 ай бұрын
dude!!! that was such a nice explanation
@soroushmehraban
@soroushmehraban 2 ай бұрын
Thanks!
@seokeonchoi4049
@seokeonchoi4049 Жыл бұрын
Cool! Nice work.
@soroushmehraban
@soroushmehraban Жыл бұрын
Thanks, Seokeon. I hope you find it useful.
@sarahsameh9994
@sarahsameh9994 3 ай бұрын
thank you for your great explanation! keep going!
@soroushmehraban
@soroushmehraban 2 ай бұрын
Thanks!
@tandavme
@tandavme Жыл бұрын
Great explanation, keep doing it!
@soroushmehraban
@soroushmehraban Жыл бұрын
Thanks, Alexander!
@ceritatujuhdesember5393
@ceritatujuhdesember5393 10 ай бұрын
This so easy how i can uderstand about RCNN and that is because your explanation! thank you very much, i love your video
@soroushmehraban
@soroushmehraban 10 ай бұрын
Glad you liked it!
@arefmotamedi7931
@arefmotamedi7931 Жыл бұрын
Well done. That was great
@soroushmehraban
@soroushmehraban Жыл бұрын
Thanks Aref
@kaan_aksit
@kaan_aksit Жыл бұрын
Informative video!
@soroushmehraban
@soroushmehraban Жыл бұрын
Thanks, Kaan!
@chayanshrangraj4298
@chayanshrangraj4298 Жыл бұрын
Nice job! Keep up the good work!
@soroushmehraban
@soroushmehraban Жыл бұрын
Thanks for the positive energy, Chayan!
@imadsaddik
@imadsaddik 7 ай бұрын
Thank you so much
@jacobyoung2045
@jacobyoung2045 Жыл бұрын
Awesome video Now I can read the paper and use the video as a guide.
@soroushmehraban
@soroushmehraban Жыл бұрын
Glad you liked it!
@ishaanyadav6103
@ishaanyadav6103 Жыл бұрын
Nice one! Please make more
@soroushmehraban
@soroushmehraban Жыл бұрын
Thanks, Ishaan. Sure!
@anwarvic
@anwarvic Жыл бұрын
Cool video! Keep them coming
@soroushmehraban
@soroushmehraban Жыл бұрын
Thanks, Mohamed!
@yassersouri6084
@yassersouri6084 Жыл бұрын
Great video. Good job. Request for follow up videos: Faster R-CNN, Mask R-CNN, DETR
@soroushmehraban
@soroushmehraban Жыл бұрын
Thanks, Yaser. I'll post them. But first I'll post Fast R-CNN
@canxkoz
@canxkoz Жыл бұрын
Congrats. Good work.
@soroushmehraban
@soroushmehraban Жыл бұрын
Thanks, Can! Appreciate it.
@NagarajuSeru-rc7lb
@NagarajuSeru-rc7lb 8 ай бұрын
Very Nice.. Thank you so much.... I have a question related to NMS... that As you explained about NMS, IOU of classified object regions will calculated over the ground truth value at the time of training and validation but what about at the time of inference ? since you have grouth truth values at time of train and validate only but not at inference. awaiting for your response.... thank you so much adavance
@alirezaghaffartehrani1279
@alirezaghaffartehrani1279 Жыл бұрын
bright explanation Thanks
@soroushmehraban
@soroushmehraban Жыл бұрын
Thanks, Alireza. I hope you found it useful.
@pouyaaminaie6041
@pouyaaminaie6041 Жыл бұрын
Nice work
@soroushmehraban
@soroushmehraban Жыл бұрын
Thanks, Pouya.
@zukofire6424
@zukofire6424 Жыл бұрын
Thanks very much for this, it's much clearer to me know (after starting from just the paper). (Edit : this Paper is clearly explained in every way)
@soroushmehraban
@soroushmehraban Жыл бұрын
Thanks for the honest feedback 😃 looking at the previous videos posted, I’m not using that phrase anymore.
@zukofire6424
@zukofire6424 Жыл бұрын
@@soroushmehraban Oh I spoke too fast, (bc I watched some parts of the video several times, I thought you used the expression several times)... Yeah I take it back apologies, oc everyone can use this expression!
@lakshaydulani
@lakshaydulani Жыл бұрын
good work
@soroushmehraban
@soroushmehraban Жыл бұрын
Thanks, Lakshay.
@raj-nq8ke
@raj-nq8ke 8 ай бұрын
Great.
@huyinit
@huyinit 3 ай бұрын
thank you so much , such an amazing video . Can i ask which tool/app you using for this slide? i love how they working
@soroushmehraban
@soroushmehraban 3 ай бұрын
Thanks for the feedback Huy 🙂It's just a powerpoint.
@gaussic6985
@gaussic6985 Жыл бұрын
Keep up the good work
@soroushmehraban
@soroushmehraban Жыл бұрын
Thanks!
@Javad-ek4es
@Javad-ek4es 6 ай бұрын
Very nice! Thanks a lot! May you please upload your slides, too?
@alinaderiparizi7193
@alinaderiparizi7193 Жыл бұрын
Great Job, Can't wait to see more videos of you. Can you fix your microphone for next videos?
@soroushmehraban
@soroushmehraban Жыл бұрын
Thanks, Mohandes. I'll try enhancing the quality by changing my recording method but still it's not gonna be perfect. At least not in the first few videos.
@nestedhuman8951
@nestedhuman8951 2 ай бұрын
what is the background music you are using in the video ?
@soroushmehraban
@soroushmehraban 2 ай бұрын
I don't remember that was a long time ago. I'm not using any background music anymore.
@louisbertson
@louisbertson Жыл бұрын
great
@soroushmehraban
@soroushmehraban Жыл бұрын
Thanks, Louis.
@sriharsha580
@sriharsha580 Жыл бұрын
How does NMS works in inference? As we won't be having ground truth
@soroushmehraban
@soroushmehraban Жыл бұрын
That's a great question. I think I should have mentioned that. Our model might predict different bounding boxes pointing to the same object. In such a scenario, we do the following: 1) Sort all the predicted bounding boxes based on the class score (In descending order). 2) Pick the first bounding box that has the highest probability score. 3) Compute the IoU of the selected bounding box with other bounding boxes pointing to the same class. 4) If the IoU of any bounding box with this bounding box is larger than a threshold (such as 0.5), then we remove the bounding box having the lower class score. I hope it's clear.
@NagarajuSeru-rc7lb
@NagarajuSeru-rc7lb 8 ай бұрын
​@@soroushmehraban i think following conditions might not be sufficient, because even if we sort and pick highest one... again we left with question of all these are pointing to same object location or reference really in a image ? same object references might be at multiple places please clarify this doubt
@soroushmehraban
@soroushmehraban 8 ай бұрын
That's true we might have same objects at multiple places. let's say we have object A at location (x1, y1) and (x2, y2). for location (x1, y1) our model might predict multiple bounding boxes all refer to the object A. Out of all these bounding boxes we only keep the one that has the highest score and others if they have IOU higher than a threshold with this bounding box, we remove them. For object A at place (x2, y2), since it's in different area of the image, the IoU with the one having highest score is less than a threshold, so we keep the second one having the highest threshold and again others having IoU higher than a threshold, we remove them. @@NagarajuSeru-rc7lb
@failuredocumentary
@failuredocumentary Жыл бұрын
Nais work man, keep this up, I wanna see moo 🤌❤️
@soroushmehraban
@soroushmehraban Жыл бұрын
Thanks, man! I'll try my best.
Fast R-CNN: Everything you need to know from the paper
38:37
Soroush Mehraban
Рет қаралды 10 М.
How YOLO Object Detection Works
17:04
DeepBean
Рет қаралды 19 М.
Маленькая и средняя фанта
00:56
Multi DO Smile Russian
Рет қаралды 5 МЛН
Sprinting with More and More Money
00:29
MrBeast
Рет қаралды 24 МЛН
Why Computer Vision Is a Hard Problem for AI
8:39
Quanta Magazine
Рет қаралды 118 М.
Faster R-CNN: Faster than Fast R-CNN!
12:18
Soroush Mehraban
Рет қаралды 5 М.
R-CNN in depth
13:32
Datum Learning
Рет қаралды 6 М.
But what is a convolution?
23:01
3Blue1Brown
Рет қаралды 2,5 МЛН
Mask R-CNN
12:22
ComputerVisionFoundation Videos
Рет қаралды 103 М.
Deep Learning Bootcamp: Kaiming He
1:15:46
MIT Schwarzman College of Computing
Рет қаралды 49 М.
Convolutional Neural Networks Explained (CNN Visualized)
10:47
Futurology — An Optimistic Future
Рет қаралды 181 М.
Watching Neural Networks Learn
25:28
Emergent Garden
Рет қаралды 1,1 МЛН
YOLOv1 from Scratch
1:43:11
Aladdin Persson
Рет қаралды 166 М.