No video

CUDA Programming on Python

  Рет қаралды 1,169,918

Ahmad Bazzi

Ahmad Bazzi

Күн бұрын

Пікірлер: 770
@aishahoura2619
@aishahoura2619 Жыл бұрын
Thank you so much for responding to my request for making a CUDA programming. I have donated 0.1 BTC to your account as a way to thank you. My professor has done so many hours trying to explain CUDA and none of my classmates really understood. I just can not believe that you do all this for free and that is why me and my classmates have decided to collect some funds to donate to you. Thanks for all that you do and please keep going.
@AhmadBazzi
@AhmadBazzi Жыл бұрын
Thank you for the donation, it really means a lot !
@aishahoura2619
@aishahoura2619 Жыл бұрын
@@AhmadBazzi No thank you !
@mdrubelahmed6434
@mdrubelahmed6434 Жыл бұрын
Thank you so much for responding to my request for making a CUDA programming.
@btspower3844
@btspower3844 Жыл бұрын
Wow amazing
@btspower3844
@btspower3844 Жыл бұрын
Wow amazing
@nilsu1941
@nilsu1941 Жыл бұрын
12:36 This guy is a God !
@sportscorridor4064
@sportscorridor4064 Жыл бұрын
very nice
@mdrubelahmed6434
@mdrubelahmed6434 Жыл бұрын
So beautiful
@sksk-lo8kc
@sksk-lo8kc Жыл бұрын
That was very well explained. I have only have taken one course, and you made it clearer than my professor or fellow students ever did.
@denizart2255
@denizart2255 Жыл бұрын
You just opened my eyes to parallel programming. Thanks for the quick overview.
@mdrubelahmed6434
@mdrubelahmed6434 Жыл бұрын
Too hard to find high -quality content like this these days. Thank you so much
@Drex.Yt1
@Drex.Yt1 Жыл бұрын
Too hard to find high-quality content like this these days. Thank you so much
@aoungamingyt3160
@aoungamingyt3160 Жыл бұрын
Thank you so much. Probably the best introdution to CUDA with Python. The example you use, while very basic, touches on usage of blocks, which is usually omitted in other introduction-level tutorials. Great stuff! Hope you return with some more videos. I have subscribed!
@danielsalazar6637
@danielsalazar6637 Жыл бұрын
Excelent
@leonelaguilera9059
@leonelaguilera9059 Жыл бұрын
this was such an excellent video
@tatldunyas2471
@tatldunyas2471 Жыл бұрын
Just did my research and this guy is at one of the most prestigious universities in the world ! No wonder why his lectures come up neat !
@maher9422
@maher9422 2 ай бұрын
يعطيك العافية اخوي احمد بدي اطلب منك طلب بسيط انه تعمل نفس الكورس عربي عارف مش حيكون كثير المشاهدات. ولكن اخوانك يحتاجوك اكثر من الاجانب انا بفهم عليك بس فيه غيري بيحبوا المجال وبيحبوا يتعلموه بلغتهم. اذا انت ما تملك الوقت اسمحلي اترجم الفيديو واشرحه عندي بلايك من عندك على انك موافق.
@apogeetheboss9999
@apogeetheboss9999 Жыл бұрын
as a data scientist +2 years of experience, i ALWAYS learn something new with your content! please nich, never stop doing this things, and also, never cut your smile in your face, even if your are having bugs!!
@sedat3218
@sedat3218 Жыл бұрын
I have been looking into gpu programming using numba and python for a while, this seems to be the best tutorial I was able to find so far.. . thank you
@RAVIShankar-bm4ou
@RAVIShankar-bm4ou Жыл бұрын
Thank you so much for this series! It's so clear and easy to follow
@todotasks7645
@todotasks7645 Жыл бұрын
Too hard to find high-quality content like this these days. ⚡
@adeeshaamabidu9616
@adeeshaamabidu9616 Жыл бұрын
Love the channel Nicholas, have recently graduated from an NLP Master's degree and seeing you explain stuff in a simpler way and your coding challenges is really helping me connect with the material I've learned! Keep it up and I'll keep watching!
@alizz3984
@alizz3984 Жыл бұрын
I feel like Cuda has been demystified. Very glad I found your series.
@user-sh6jw1ce3m
@user-sh6jw1ce3m Жыл бұрын
Thanks for the comment... contact me for information and profitable investment strategies..⤴️
@mustafasamet2783
@mustafasamet2783 Жыл бұрын
Ayyyy, so glad you like it @Patrick. For the last two weeks I've just been making videos on stuff I find hard or want to get my head around I figure it's not just me staring there at some of these concepts like huh?!? Thanks for checking it out!!
@astaadxofficials7813
@astaadxofficials7813 Жыл бұрын
I have no idea what kind of videos i am watching ... but i sure will learn
@erenbasak7694
@erenbasak7694 Жыл бұрын
Hey this is super useful! I elected High Performance Computing and Microprocessors and Embedded Systems modules for my degree, and this channel has become my go-to guide.
@peloizol8947
@peloizol8947 Жыл бұрын
I like how you did the website for documenting the video notes for reference later
@teamchanel4384
@teamchanel4384 Жыл бұрын
I'm doing an internship in a research lab and I'll have to program some kernels to implement Blas primitives, this video really helps :)
@user-sh6jw1ce3m
@user-sh6jw1ce3m Жыл бұрын
Thanks for the comment... contact me for information and profitable investment strategies..⤴️
@calskanadam1001
@calskanadam1001 Жыл бұрын
wanted to comment that the information in this presentation is very well structured and the flow is excellent.
@-oof1016
@-oof1016 Жыл бұрын
Ahmad , thanks for taking time to create these videos. It is unfortunate that people view your videos and then feel inspired to complain about a free gift. Folks could just keep it moving or add helpful insights.
@user-yu4sr1ni3l
@user-yu4sr1ni3l Жыл бұрын
OHHHH MANNN, I thought about doing that but I was debating whether I'd hit the 15 minute deadline already. Good suggestion @Julian!
@Fiekriekd
@Fiekriekd Жыл бұрын
and that's what I call a great tutorial. Thankyou sir. I wish you make more tutorials.
@user-sh6jw1ce3m
@user-sh6jw1ce3m Жыл бұрын
Thanks for the comment... contact me for information and profitable investment strategies...⬆️
@notachannel2601
@notachannel2601 Жыл бұрын
It is effectively a very easy approach to harness the power of cuda in simple python scripts.
@emirliemirxan2687
@emirliemirxan2687 Жыл бұрын
Sir,make more detailed sessions on CUDA,your explanation is great
@muradhesenov5245
@muradhesenov5245 Жыл бұрын
the essence of Deep learning in a few lines of code... awesome
@emircoltu875
@emircoltu875 Жыл бұрын
Once you initialized lr to 0.0, I knew you were going to forget to change it lol. Love the challenges tho, keep doing them, I think it would be cool to see how you implement a neural network from scratch
@aspheronn
@aspheronn Жыл бұрын
Wow It is really awesome! It is much better than a tutorial from university! Thanks!
@mipxello7749
@mipxello7749 Жыл бұрын
Very well explained. The best CUDA explaination I have come across uptil now 😊😊. Keep up the spirits sir.👍👍
@user-sh6jw1ce3m
@user-sh6jw1ce3m Жыл бұрын
Thanks for the comment... contact me for information and profitable investment strategies...⤴️
@fatihcalidkan2254
@fatihcalidkan2254 Жыл бұрын
holy shit, i was looking into this to speed up my mandelbrot-zooms and they are what you use as an example! This is a dream come true!
@Kvmizo
@Kvmizo Жыл бұрын
This was by far one of the most enlightening videos you have put up on your channel. Thanks and keep up the good work!!
@dragnn1430
@dragnn1430 Жыл бұрын
Dear Ahmad, you are 30 years old only doing post-doc ? I'm sorry but this to me sounds very underrated. Postdocs are not always well compensated for their work but spend a lot of time working and doign research. If i were you, i'd invest more time on my youtube channel, rather than doing something that does not compensate well.
@user-mm7wc9nx7s
@user-mm7wc9nx7s Жыл бұрын
Йдйж
@user-mm7wc9nx7s
@user-mm7wc9nx7s Жыл бұрын
----
@user-mm7wc9nx7s
@user-mm7wc9nx7s Жыл бұрын
-----1--11
@user-mm7wc9nx7s
@user-mm7wc9nx7s Жыл бұрын
Уу3уййфйфйфяфффЯффффыяяыыяфяфыффффффыыяяяЯш
@bununsemid8739
@bununsemid8739 Жыл бұрын
Perfect Video! Saw was revealing to me to understand how it works. Thank you! I am a new subscriber of your channel. Regards from Buenos Aires, Argentina
@mrfcbs1251
@mrfcbs1251 Жыл бұрын
Oh Ahmad , your tutorials are incredible and inspiring....
@theatlantisreport1595
@theatlantisreport1595 Жыл бұрын
This reminds me a lot of the computer tutorial tapes from the 90s
@LouieVon
@LouieVon Жыл бұрын
This is the best introduction to CUDA I've seen, thanks a lot !
@user-sh6jw1ce3m
@user-sh6jw1ce3m Жыл бұрын
#
@SiTacuissem
@SiTacuissem Жыл бұрын
Interesting, but two remarks: Example 1: on my setup (3080Ti, i7-8700K, running in WSL2 with Ubuntu 22.04) vector multiplication runs actually *faster* on CPU (if you either use the vectorized formulation in MultiplyMyVectors with target "cpu" or, simply, a*b instead of the unnecessary for loop in the CPU code). IMO that is mostly due to the overhead of copying the data to the GPU memory. Example 2: to get a fair comparison, you should also use the JIT for FillArrayWithouGPU, decorating with @jit(target_backend="cpu"). Then, GPU array filling is still faster, but only by a factor of 2.
@prietjepruck
@prietjepruck Жыл бұрын
Thank you very much for this tutorial. I would love to have the code available because typing it in myself from the video is a bit hard especially with the atocomplete on all the time. Keep up the good work.
@cavansirmahmudov217
@cavansirmahmudov217 Жыл бұрын
You saved me, i had to read the PointNet2 implementation for my BCS thesis. this made the job much easier!
@mjmlangenihd8706
@mjmlangenihd8706 Жыл бұрын
yes, you could do this by hand, which would be a great challenge in distributed computing to code by hand. Another option is to use a framework/platform like AWS Sagemaker to do distributed kmeans. Most organizations will do this.
@uniquevlogsbyadil
@uniquevlogsbyadil Жыл бұрын
This was oddly intense. Great job Nicholas! Even though you ran out of time, this video is still a win to me. 😉
@kelechijames5577
@kelechijames5577 Жыл бұрын
This is very helpful. Most people don't realize the overheads and code refactoring necessary to take advantages of the GPUs. I am going to refactor a simple MNIST training propgram I have which currently uses only Numpy. See if I can get meaningful improvements in training time.
@Animals-vi5wt
@Animals-vi5wt Жыл бұрын
Woah congrats @Ally 🎊 🎉 glad you’re enjoying the challenges, plenty more to come!!
@tajaochrisciamae4121
@tajaochrisciamae4121 Жыл бұрын
what a passionate tutorial! I wish you were my professor for my parallel programming course. Well done!
@excolabirbuyuyecek9438
@excolabirbuyuyecek9438 Жыл бұрын
LOL. Loved the graphic at 6:23! Brought tears to my eyes.
@turkceraplyrics6701
@turkceraplyrics6701 Жыл бұрын
Hey Ahmad , I love watching your videos because of the way you tell the story. Great graphics mate. Love the reference to rocket man too... lol keep up the good work.
@bodyprodaction9718
@bodyprodaction9718 Жыл бұрын
Well just built a new rig with a 980ti and a 4790k so I'm gonna put that to test. Thank you for your wonderful explanation :D
@yigitboran5565
@yigitboran5565 Жыл бұрын
That's mostly how it works. It's more like sorting the stones by its color and pattern and counting each variety. Using the CPU way, you would need to count each variety separately. If you have 100 different colors and patterns, that would take a long time to count (even if you could count extremely accurate and fast, similar to how the CPU makes up for it's lack of parallelism). The GPU way lets many people count them. Given 100 people (like the GPU), each person would count each variety at the same time.
@merthanozer2964
@merthanozer2964 Жыл бұрын
Ohh, yes, Thank you, and the documentation at nvidia site about CUDA is highly professionally written. Thank you.
@lixaxel6815
@lixaxel6815 Жыл бұрын
Excellent example of vector addition of using for loop and using CUDA
@a.s.m.rashedchowdhury784
@a.s.m.rashedchowdhury784 Жыл бұрын
opened my eyes to parallel programming
@user-sh6jw1ce3m
@user-sh6jw1ce3m Жыл бұрын
Thanks for the comment... contact me for information and profitable investment strategies..⬆️
@yasincaferzade8069
@yasincaferzade8069 Жыл бұрын
Great video, I like this kind of video where you code some AI task counterclock, you teach us the concepts and show us the reality of implementing it👏
@beratcansamur1517
@beratcansamur1517 Жыл бұрын
An insanely underrated series!!!
@user-sh6jw1ce3m
@user-sh6jw1ce3m Жыл бұрын
Thanks for the comment... contact me for information and profitable investment strategies..⤴️
@mehmetak4349
@mehmetak4349 Жыл бұрын
What makes the CPU better than the GPU is that each core is clocked at a faster speed and has many built-in instructions like SSE, allowing data to be processed faster. This provides a tremendous benefit to programs that only run on 1 core. In rendering where multiple cores can be used, you would need the CPU to process pixels about 5+x faster to match the GPU's performance.
@ramazandemir6848
@ramazandemir6848 Жыл бұрын
wold love to see a video on what are a few CUDA programming challenges
@TheTurksxayers
@TheTurksxayers Жыл бұрын
Thank you for this great introduction to numba and more specifically numba+cuda.
@yusifhsnov1802
@yusifhsnov1802 Жыл бұрын
So stoked you liked it 🙏
@rithusvlogtime3342
@rithusvlogtime3342 Жыл бұрын
Fantastic tutorials on CUDA. You deserve more followers.
@user-sh6jw1ce3m
@user-sh6jw1ce3m Жыл бұрын
Thanks for the comment... contact me for information and profitable investment strategies..⤴️
@enescakmak6699
@enescakmak6699 Жыл бұрын
It works on both AMD and NVIDIA. If you have CUDA code, you can convert it to HIP with their automated tool, there is very little CUDA specific that can't be just translated over.
@keremkipri9436
@keremkipri9436 Жыл бұрын
Thanks for the video, subscribed! A suggestion : this small change to your code would demonstrate a real-world gradient descent solution for linear regression with noisy data. E.g. :
@aliarda9719
@aliarda9719 Жыл бұрын
Thanks a million @Lakshman!! I try to keep it pretty tight so it’s a good challenge otherwise I know I’ll just talk for 22 minutes anyway😅
@totallycz6819
@totallycz6819 Жыл бұрын
You are a lifesaver @Spencer, will do it next time i'm on the streaming rig!
@ricardomilos5889
@ricardomilos5889 Жыл бұрын
This was a great video to me, I have very limited C++ experience and was looking for an explanation of CUDA. Another video like this could easily have been 70-80% over my head. This one was only about 15% whoosh. And now I actually find C++ interesting again!
@bangzayy520
@bangzayy520 Жыл бұрын
I was needing this!!! Thanks a lot, Sir!!!!
@denizugurbiltekin622
@denizugurbiltekin622 Жыл бұрын
Thank you so very much. This is the exact kind of material I was looking for on this very specific subject. Kudos.
@maishayasuperstar3699
@maishayasuperstar3699 Жыл бұрын
The Knowledge of Ahmad knows no bounds.
@crystalannringor1025
@crystalannringor1025 Жыл бұрын
Thank you so much for this video. It has helped me massively to prepare for my computer science exam.
@mendes.02
@mendes.02 Жыл бұрын
This is an academic example that shows the process of copying data to the GPU, doing a vectorized operation, then showing the results. Actually what makes sense on the GPU vs CPU is something I didn't cover, and am hoping other can figure out some cool ideas.
@difenbahya3465
@difenbahya3465 Жыл бұрын
Thanks for the video, I found the first half and the wrap up really excellent.
@agusexclusife2577
@agusexclusife2577 Жыл бұрын
Can't wait to see Juan's better tutorial that he's definitely going to release :') lmao. Great video Ahmad .
@vanshd6884
@vanshd6884 Жыл бұрын
PS. I really so moved for your stock price episode. thank you so sosososo much.
@hgmalani21
@hgmalani21 Жыл бұрын
It's very informative and a good intro to CUDA programming. Thanks very much!
@user-sh6jw1ce3m
@user-sh6jw1ce3m Жыл бұрын
#
@gularif1
@gularif1 Жыл бұрын
On the PC side Matrox was the first company to introduce GPU's . This was followed by ATI . NVidia came into the scene after the success of these 2 Canadian companies. Matrox's original 3D board was a 3 setboard with custom asics. I believe NVIDA actually acquired ATI. So yes, NVIDIA was not the first but they are the biggest in the space now. Matrox is still around but more involved in the industrial and nice markets.
@TheGameboyTheDream123
@TheGameboyTheDream123 Жыл бұрын
@nvidia I personally think the way you did the demonstration was perfectly sufficient. IMO, fancy graphics are unnecessary. Good job.
@imsoumyajitbag
@imsoumyajitbag Жыл бұрын
Awesome video !! It's preety cool to see such theoretical concepts coded and explained like this. Keep going Nich !!
@aysezuhre8606
@aysezuhre8606 Жыл бұрын
Ahmad sorry for bother you, the problem was not installing Cuda Toolkit, srly I hate people who doesnt watch full video closely and ask stupid questions....and now I m one of them :D. Thx alot for this tutorial in 2 months i will try write my own GPU operator for my program, would be interting if this will be faster than CPU. (Btw using normal Visual code in python 3.10 env. on win 11, so far so good. (Altrough i have some code output delay problem when using openCV for some strange reason)
@aliyensagaltc3121
@aliyensagaltc3121 Жыл бұрын
You are bloody watching a master at work xD
@MTHHC
@MTHHC Жыл бұрын
Hey, thanks for explanation! Very well done 👍 I am downloading CUDA 💪
@Voicemelod
@Voicemelod Жыл бұрын
Amazing! I'm learning so much watching you code. Thank you for sharing.
@kadirylmaz356
@kadirylmaz356 Жыл бұрын
This is really helpful for my computing. Thank you.
@pusher7051
@pusher7051 Жыл бұрын
glad to see you take it as a feedback and not as a hate comment
@mobilerepairs3620
@mobilerepairs3620 Жыл бұрын
Technically, Yes. However, CUDA isn't designed to give you an extra processor to use. It's just to give you the option of using a different type of processor to do your work. GPUs have lots of processing cores (100-1000+) which helps a lot with rendering. Each core can process 1 pixel allowing 100+ pixels to processed at once. CPUs have a small number of cores (2 - 18 in the Xeons) so only 2 - 18 pixels can be processed at once. The Hyper-Threading technology can double that number, but 36 is small compared to 100.
@weds8296
@weds8296 Жыл бұрын
Thanks for making all these topics very approachable!
@crazygamingpointSv
@crazygamingpointSv Жыл бұрын
the zoom in on the unsaved icon was personal 💀
@besttwitcher4569
@besttwitcher4569 Жыл бұрын
YESSSS, right?! Glad you liked it Miguel!
@vipyt8550
@vipyt8550 Жыл бұрын
It can be found in O(1). As far as I remember the formula is derived using LDU decomposition or Diagonalising a matrix, for matrix exponentiation.
@kolaybreaworlds3178
@kolaybreaworlds3178 Жыл бұрын
Also, the CT5 simulator from 1981 may not count as being from the '70s or '60s, but from what I understand, the CT5 was capable of realtime, rasterized, 3D polygonal rendering and was $20 million at the time. It used gouraud shading, if memory serves. There were several other CT (continuous tone) simulators developed by E&S in the '70s that did something similar or of much lower capability than the CT5 of '81. There was also the Digistar planeteriums that date back to the early '80s, and the Picture System goes back to at least the early '80s. Might be vector or raster, not entirely sure myself, though.
@blackiselia8054
@blackiselia8054 Жыл бұрын
i need to say this: you are the gamechanger here!!
@mohammadaosaf5447
@mohammadaosaf5447 Жыл бұрын
Was my first time trying a coding challenge like this.
@masteredinhypotheticalbatt9410
@masteredinhypotheticalbatt9410 Жыл бұрын
Air-Grip Biker they are incredibly similar and the syntax is basically the same but c++ is object oriented and has some more modern features
@caneruc8534
@caneruc8534 Жыл бұрын
This was really good. Thanks for posting this!
@berkozdeser8460
@berkozdeser8460 Жыл бұрын
this is extremely helpful. you did an amazing job explaining the foundations
@user-sh6jw1ce3m
@user-sh6jw1ce3m Жыл бұрын
Thanks for the comment... contact me for information and profitable investment strategies..⤴️
@gamingtouryoutubechannel1131
@gamingtouryoutubechannel1131 Жыл бұрын
It's a mandelbrot set explorer that uses both CUDA- and C-extensions to calculate the iterations. The multithreaded C-implementation is definitely no slouch, but when you start doing over 10 000 iterations per pixel the CUDA-implementation becomes significantly faster. In contrast pure Python based implementation get frustratingly slow already at around 1000 iterations so it wasn't even worth adding to the comparison.
@arifefe4264
@arifefe4264 Жыл бұрын
KZfaq questions are typically the best way to handle an issue or an issue request on Github on a project demo. I will do my best to answer when I have time.
@burakcetin317
@burakcetin317 Жыл бұрын
Excellent explanation, keep going with this content man ;)
@msmusicestudio6070
@msmusicestudio6070 Жыл бұрын
Thank you for these tutorials, very helpful. Minor bug - you free the host memory at *h_c but you never malloc it in the first place.
@appnana4449
@appnana4449 Жыл бұрын
CUDA also is in the form of an API (i.e. using NVIDIA's CUDA library in C) to abstract away parallel computation tasks to the GPU - but yes its both, the API is the software side but the GPU must be CUDA compatible (have CUDA cores) to take advantage of this.
@prof.gilang
@prof.gilang Жыл бұрын
This is amazing! Thank you for taking effort to make it!
@likky2229
@likky2229 Жыл бұрын
The video was very helpful for me. Many thanks to the author for developing his audience with interesting and useful content
CUDA Explained - Why Deep Learning uses GPUs
13:33
deeplizard
Рет қаралды 235 М.
Make Python code 1000x Faster with Numba
20:33
Jack of Some
Рет қаралды 442 М.
My Cheetos🍕PIZZA #cooking #shorts
00:43
BANKII
Рет қаралды 28 МЛН
The Joker kisses Harley Quinn underwater!#Harley Quinn #joker
00:49
Harley Quinn with the Joker
Рет қаралды 10 МЛН
WHO CAN RUN FASTER?
00:23
Zhong
Рет қаралды 45 МЛН
小丑把天使丢游泳池里#short #angel #clown
00:15
Super Beauty team
Рет қаралды 47 МЛН
Running "Hello World!" in 10 FORBIDDEN Programming Languages
18:07
Unlocking your CPU cores in Python (multiprocessing)
12:16
mCoding
Рет қаралды 301 М.
comparing GPUs to CPUs isn't fair
6:30
Low Level Learning
Рет қаралды 288 М.
Striding CUDA like i'm Johnnie Walker
11:07
Ahmad Bazzi
Рет қаралды 545 М.
everything is open source if you can reverse engineer (try it RIGHT NOW!)
13:56
Low Level Learning
Рет қаралды 1,4 МЛН
Tutorial: CUDA programming in Python with numba and cupy
45:43
nickcorn93
Рет қаралды 77 М.
Why Isn't Functional Programming the Norm? - Richard Feldman
46:09
5 Useful F-String Tricks In Python
10:02
Indently
Рет қаралды 298 М.
My Cheetos🍕PIZZA #cooking #shorts
00:43
BANKII
Рет қаралды 28 МЛН