Scikit-learn Crash Course - Machine Learning Library for Python

  Рет қаралды 364,489

freeCodeCamp.org

freeCodeCamp.org

Күн бұрын

Scikit-learn is a free software machine learning library for the Python programming language. Learn how to use it in this crash course.
✏️ Course created by Vincent D. Warmerdam.
⭐️ Course Contents ⭐️
⌨️ (0:00:00) introduction
⌨️ (0:03:08) introducing scikit-learn
⌨️ (0:34:36) preprocessing
⌨️ (0:53:36) metrics
⌨️ (1:24:49) meta-estimators
⌨️ (1:45:34) human-learn
⌨️ (2:06:17) wrap-up
⭐️ Code ⭐️
💻 Full code: github.com/koaning/calm-noteb...
💻 Notebook per section:
🖥 introducing scikit-learn: github.com/koaning/calm-noteb...
🖥 preprocessing: github.com/koaning/calm-noteb...
🖥 metrics: github.com/koaning/calm-noteb...
🖥 meta estimators: github.com/koaning/calm-noteb...
🖥 human-learn: github.com/koaning/calm-noteb...
⭐️ Other Recources ⭐️
🔗 calmcode.io
🔗 scikit-learn docs: sklearn.org/index.html
🔗 spaCy course: • Intro to NLP with spaC...
🔗 PyData KZfaq channel: / pydatatv
🔗 algorithm whiteboard: • Rasa Algorithm Whitebo...
--
Learn to code for free and get a developer job: www.freecodecamp.org
Read hundreds of articles on programming: freecodecamp.org/news

Пікірлер: 216
@freecodecamp
@freecodecamp 3 жыл бұрын
Message from the creator: I hope you've all enjoyed this series of videos. It was fun to collaborate with freeCodeCamp! If you're interested in more content from me feel free to check out calmcode. Also, I'd like to give a shoutout to my employer, Rasa! We're using scikit-learn (and a whole bunch of other tools) to build open-source chatbot technology for python. If that sounds interesting, definitely check out rasa.com/docs/rasa/.
@jadkylan7774
@jadkylan7774 3 жыл бұрын
i guess I'm kinda randomly asking but do anybody know of a good place to watch newly released tv shows online ?
@ariesulises1611
@ariesulises1611 3 жыл бұрын
@Jad Kylan Try flixzone. Just search on google for it =)
@brodyodin141
@brodyodin141 3 жыл бұрын
@Aries Ulises definitely, I've been using flixzone for months myself =)
@jadkylan7774
@jadkylan7774 3 жыл бұрын
@Aries Ulises thanks, I went there and it seems like a nice service :) I really appreciate it!!
@ariesulises1611
@ariesulises1611 3 жыл бұрын
@Jad Kylan happy to help =)
@buraksenel263
@buraksenel263 2 жыл бұрын
This is by far the most beginner friendly introduction to sk-learn I've seen
@lVaNeSsA90
@lVaNeSsA90 3 жыл бұрын
Wow - I need to share this with the rest of the class! Thanks for making this video so understandable.
@flashbao1922
@flashbao1922 3 жыл бұрын
This video saved me from a 5K course! Thanks! Loads of Love!
@riccello
@riccello 2 жыл бұрын
This is the way everything should be taught! I love that you present concepts in a structured and systematic way, speaking slowly and clearly, using as few words as possible... - starting with the concept and talking through drawing a logical diagram (which is so important for developing abstract thinking in terms of high level concepts, which is how we think when we are experienced in something). - then writing clean, concise code to implement each part of the concept - showing plots that directly demonstrate the effects of the entire iteration Too many tutorials make the mistake of talking too much. A lot of videos also either assume too much or too little about the viewer's knowledge. This seems to confidently stike the nail on the head! Thanks!
@manuelcervantes1939
@manuelcervantes1939 2 жыл бұрын
Amazing review!
@abdulwahab182
@abdulwahab182 2 жыл бұрын
Exactly 👍
@abdullahshahzad333
@abdullahshahzad333 Жыл бұрын
Are you serious??? Instructor didn't even show the dataset. How would anyone understand whats going on like this?
@ThomasKuncewicz
@ThomasKuncewicz 11 ай бұрын
The way each dataset complements the associated pitfall you want to bring up at a given moment... wow. What an amazing intro -- it must have taken a lot of forethought and behind the scenes organization to make the flow of this video series seem so effortless. THANK YOU!!
@nonSensCave
@nonSensCave 9 ай бұрын
please bro can you tell me where to find appending for the plot answer ?
@gabriel1991
@gabriel1991 3 жыл бұрын
OMG! I love all the contente that Vincent makes! I must watch this video!
@universal4334
@universal4334 3 жыл бұрын
Send me a link to his channel
@imdadood5705
@imdadood5705 3 жыл бұрын
Just completed the first part of the lecture. I have been using scikit for a couple of months! Dudeee! This is an eye opener!
@develxper7931
@develxper7931 2 жыл бұрын
I was rewatching the course to make my basics better , there were actually a lot of details man!!!
@JoshJetson
@JoshJetson Жыл бұрын
This is an excellent tutorial. Im doing the coursera ibm maachine learning cert and supplementing it with this video. This overall is a much more palatable and easier to understand tutorial of scikit learn and really a machine learning model in general. Awesome work!
@codesiddhi
@codesiddhi 3 жыл бұрын
Just Amazing once again, u guys rock as always...
@dariuszspiewak5624
@dariuszspiewak5624 2 жыл бұрын
I must agree with others: this is a great lecture. I mean... REALLY good. Vincent, do you have any more of these? This stuff is not only informative, but also pleasant to watch and listen to. Good, correct, and clear English is rather rare these days. Sadly. This lecture is good because it does not shy away from details. It also goes beyond just showing the API. It tries to build something new from the available "Lego" pieces. Which is great as it shows creativity and also how to dig deeper to understand the data. Very, very good exposition. Many thanks.
@tyronefrielinghaus3467
@tyronefrielinghaus3467 8 ай бұрын
I feel you about clear and well enunciated English. I HATE having to 'interpret' what I'm hearing....too much extraneous Cognitive Load for an already high Intrinsic Load topic.
@rodiekozlovsky2415
@rodiekozlovsky2415 3 жыл бұрын
what a great course! thank you for openning the gates..
@Duh_Daily
@Duh_Daily 11 ай бұрын
the explanations are well detailed, this really helps with understanding the library and know exactly what to use and where to use it. You have helped a great community of beginners. 🙏🏾🙏🏾🙏🏾🙏🏾🙏🏾
@kateryna_today
@kateryna_today 3 жыл бұрын
Just started learning scikit! thank you for the material
@johnmo1111
@johnmo1111 Жыл бұрын
Great video. Helped me with multiple sections that I had been fumbling my way through. No hard going over some things I already knew aswell. Thanks for this..👍
@ShiftKoncepts
@ShiftKoncepts 9 ай бұрын
thank you so much! I am slowly digesting this stuff and most likely will have to review it 2 or more times.
@navneetTanks
@navneetTanks 3 жыл бұрын
Thankyou very much, much needed for beginners like me❤️, I hope one day when I'll become expert, I will make free courses for others too❤️
@dilshanchrishantha6548
@dilshanchrishantha6548 3 жыл бұрын
great series of demo videos. well explained for a beginner to learn from zero.
@dilshanchrishantha6548
@dilshanchrishantha6548 3 жыл бұрын
excellent explanation for a beginner in ML .Thanks for the course.
@gustavojuantorena
@gustavojuantorena 3 жыл бұрын
Awesome! Thank you for sharing!
@rouzbehamirazodi3001
@rouzbehamirazodi3001 6 ай бұрын
Well explained and high quality video and audio. Unlike some other videos out there.
@Treegrower
@Treegrower 3 ай бұрын
This video is awesome! Your narration style is fantastic.
@rodrigo100kk
@rodrigo100kk 3 жыл бұрын
Great video ! At 1:49:40 you could use ".values" at the end instead of np.array in the beginning.
@pw7225
@pw7225 2 жыл бұрын
Kudos! Excellent training.
@bogoodski
@bogoodski Жыл бұрын
So amazing. Either this video is especially approachable or I've been exposed to these concepts enough now that they're finally starting to click. Probably both, but the former is definitely a significant factor. Well done
@bogoodski
@bogoodski Жыл бұрын
By the way, im working through the eCornell Python for Machine Learning and certificate in Machine Learning courses and this video is a perfect supplement. This is so helpful. Thank you!
@jakobaljaz705
@jakobaljaz705 Жыл бұрын
i feel i learned so much, great job sir. Thank you :)
@albertog2196
@albertog2196 3 жыл бұрын
Very good teacher. Thanks for the content I learned a lot.
@yugosaito9704
@yugosaito9704 10 ай бұрын
Thank you for uploading this video!
@gisleberge4363
@gisleberge4363 2 жыл бұрын
Great introduction to ML, educational and well explained to the core... 🙂
@dosiedoe
@dosiedoe 2 жыл бұрын
it's insane how good this video is
@ccuny1
@ccuny1 3 жыл бұрын
Fantastic. Thank you very much.
@cristhiancasierra8265
@cristhiancasierra8265 3 жыл бұрын
PERFECT TIMING!!!
@berdeter
@berdeter Жыл бұрын
I loved the end chapter that joined machine learning with expert systems I've used 30 years ago...
@JoseRicardoXavier
@JoseRicardoXavier 3 жыл бұрын
Amazing presentation !!
@padmanabhan_s
@padmanabhan_s 3 жыл бұрын
Excited!!!
@memelol1859
@memelol1859 2 жыл бұрын
Wow thank u this really clarified my doubts :)
@reyou7
@reyou7 3 жыл бұрын
amazing content, thanks a ton!
@MrCrunsh
@MrCrunsh 3 жыл бұрын
Im busy for the next 2h.
@shivamjalotra7919
@shivamjalotra7919 3 жыл бұрын
Me too
@thomasbates9189
@thomasbates9189 9 ай бұрын
Way to go!
@thomasbates9189
@thomasbates9189 9 ай бұрын
Very helpful! Thank you!
@abcdasa1830
@abcdasa1830 Ай бұрын
thank you. your video makes me clear about scikit-learn and machine learning. you're my saint
@Gh0stiefr
@Gh0stiefr 23 күн бұрын
does this tutorial worth it to watch like in this year , its 3 year old!!?
@rodionraskolnikov6989
@rodionraskolnikov6989 Жыл бұрын
truly a great tutorial!
@salivona
@salivona 3 жыл бұрын
Beautiful lecture!
@cerioscha
@cerioscha 8 ай бұрын
great video series, thanks ! In this video @56:56 i think you meant to say that "there are way more cases without Fraud than with Fraud"
@victoran0
@victoran0 7 ай бұрын
exactly why i came to the comments
@locky916
@locky916 5 ай бұрын
Thanks for this great material about scikit-learn, it is really helpful and understanding is more comfortable with educators beatiful explanations. Huge thanks and keep going...
@SK-qj3oj
@SK-qj3oj 3 ай бұрын
Wow such an awesome course, cant believe this is free
@Natalie-rl5wz
@Natalie-rl5wz 4 ай бұрын
Hello, I just wanted to say for those who plan to do the videos. The data set 'Boston house prices' has been removed by scikit, therefore this tutorial is not really working anymore unless you change the dataset
@abdelkaderkaouane1944
@abdelkaderkaouane1944 Жыл бұрын
Very interesting, Thank you very much
@wb7779
@wb7779 4 ай бұрын
Very nice, thank you.
@sunshadow9704
@sunshadow9704 2 жыл бұрын
You are the ONE Thank you Sir
@kennethstephani692
@kennethstephani692 Жыл бұрын
Great video!
@riccello
@riccello 2 жыл бұрын
Can I ask you how you are able to draw on the screen? I understand you are probably using a Stylus pen over some touch screen surface, which mirrors your display, but what software are you using for that?
@feep1642
@feep1642 3 жыл бұрын
very nice tutorial watched the whole thing
@arnavmehta3669
@arnavmehta3669 3 жыл бұрын
How you watched 2 hr video in 27minutes
@nguyenphutho9503
@nguyenphutho9503 3 жыл бұрын
Sorry, I have a question : Which version of python and opencv are matched ? Because a lot of tutorials I had follow, but unable to find matched compatible version of python and opencv. Please help me to find solution to my own project. Thank you so much.
@howardsmith4128
@howardsmith4128 3 жыл бұрын
Great crash course.
@mehdismaeili3743
@mehdismaeili3743 2 жыл бұрын
thanks for his great video.
@azertytnt421
@azertytnt421 3 жыл бұрын
Really it is amazing course
@develxper7931
@develxper7931 2 жыл бұрын
50:00 count vecotorizer is a really good preprocessor for that too in my opinion
@juanete69
@juanete69 Жыл бұрын
Very good tutorial.
@rajveersinghanand
@rajveersinghanand 3 жыл бұрын
16:00 pipe 23:45 grid search 37:00 standard scaler 42:00 quantiles better 46:55 … 55:00 fraud ex
@tarsierontherun
@tarsierontherun Жыл бұрын
comeback dude. don't give up.
@pushpendratripathi46
@pushpendratripathi46 3 жыл бұрын
Thanks ♥️♥️
@vadimrudakov8907
@vadimrudakov8907 Жыл бұрын
Data leakage? In the introducing section (like in 28:41) we have a gridsearch that contains a pipeline with the numeric features transformer. I guess it is the right way to data leakage, because in our pipeline we first transform all the numeric features in the entire dataset and straightly after that we start our model learning through the cross-validation process within the entirely transformed dataset. Our training sets, created during cv, contain previously standardized data, so the model "knows" something about the examples that are not in the training set and can predict better when process them in the prediction step. Thus we should exclude any numeric features transformation in our grid search, am I right? If I'm not, please explain the mechanism.
@tanb13
@tanb13 3 жыл бұрын
Does Vincent has his own Channel, I just love his teaching style!!
@randomguy75
@randomguy75 3 жыл бұрын
google calmcode
@randomguy75
@randomguy75 3 жыл бұрын
you're welcome
@AcidiFy574
@AcidiFy574 3 жыл бұрын
Awesome Tutorial, I have some suggestions regarding your content: 1. Tutorial on RUST 2. Tutorial on JULIA 3. Tutorial on AWK & SED (Especially AWK) 4. Tutorial on LUA What do you guys think????
@cientifiko
@cientifiko Жыл бұрын
very useful... I run the code on idle but it didnt work well, there are something that need to revise like importation of library being after used variable.
@sonalkudva1839
@sonalkudva1839 5 ай бұрын
i am trying to learn from this course but it says that the boston data set has been removed from scikit learn. what should i do?
@juaningo24
@juaningo24 2 ай бұрын
You can still downgrade your scikit-learn version to 1.0.2 and it should be fine, also if you don't want to, you can use the fetch_california_housing instead
@khal7994
@khal7994 2 жыл бұрын
00:19 i did not underestand why after changing k value from 5 to 1 prediction diagram changed ? knn is a classification algoithm but here it was like a regration
@hassanhijazi4757
@hassanhijazi4757 Жыл бұрын
I did not succeed to reproduce the figure @ 1:16:56. I'm always getting the same figure as the one just before even I did the log transformation of the "Amount" column. Anyone have had the same problem?
@juanete69
@juanete69 Жыл бұрын
Is GridSearchCV(... ,cv=3) doing a nested crossvalidation?
@JoshKonoff1
@JoshKonoff1 2 жыл бұрын
Where are the datasets for the sklearn metric tutorial (credit card dataset, etc)? Thank you!
@xnalebb
@xnalebb 4 ай бұрын
At the metrics part, when you plot mean recall and mean precision, how is it that i got the same results for the train and test sets?
@abdougadrydiallo1318
@abdougadrydiallo1318 3 жыл бұрын
Where can we find the dataset ?
@wexwexexort
@wexwexexort 3 жыл бұрын
I like this guy.
@parzynamea4701
@parzynamea4701 2 жыл бұрын
where is that make_plots function from, at 1:31:00
@wiktorm9858
@wiktorm9858 9 ай бұрын
Rime series needed these Polynomial parameters, i think. Cool tutorial though!
@aminghafoori6496
@aminghafoori6496 3 жыл бұрын
thank you
@olhaklishchuk
@olhaklishchuk Жыл бұрын
I have one question on time of lapsing GridSearchCV pipeline: how to minimize time of running code, because my model was estimated with mean fit time at least 9 min. My processor is AMD Ryzen 5 5500U with Radeon Graphics 2.10 GHz and 6 cores. Thenk you in advance!
@abdulwahab182
@abdulwahab182 2 жыл бұрын
Great 👍
@edwardramirez824
@edwardramirez824 Жыл бұрын
what editor do you use? im new on this
@kevindandrade5307
@kevindandrade5307 3 жыл бұрын
The section on Metrics gets confusing for me. Any easy to understand books I can read for understanding metrics?
@saptarshisanyal4869
@saptarshisanyal4869 2 жыл бұрын
The metrics section was overwhelming for me as well. There has to be a pre requisite base work before going for this.
@mugumyavicent2803
@mugumyavicent2803 2 жыл бұрын
thanks my co name --- vicent, you inspire me to do machine learning
@rodionraskolnikov6989
@rodionraskolnikov6989 Жыл бұрын
great tutorial! one question: how do you make the plots at 1:29? the 'make_plots' function
@baka6884
@baka6884 8 ай бұрын
he imported matplotlib.pyplot and used scatter plot i think
@NotEnoughTime-cf2pi
@NotEnoughTime-cf2pi 4 ай бұрын
When was the data split into test/train?
@shajidmughal3386
@shajidmughal3386 Жыл бұрын
So far into the video, I don't see the data split into train and test samples. Does that mean the model is testing on seen data? If yes, how reliable are these metrics? Someone shed some light, please.
@_seeker423
@_seeker423 6 ай бұрын
@43:00 where you perform the QuantileTransformer step and plot it...shouldn't the scatter plot fn take X (non transformed) and X_new (transformed) data as params? Little confused why we passed X_new[:, 0] X_new[:, 1]. It seems like we plotted 2 different features (indexed by 0, 1) after transformation step?
@vignatej663
@vignatej663 5 ай бұрын
No, it is actually syntax of pandas, X[l1=[list...], l2=[list....]] => choose all rows in l1 and all columns in l2. so, X_new[:, 0] chooses all rows with col 0, X_new[:, 1] chooses all rows with col 1. Hope this helps
@mrrishiraj88
@mrrishiraj88 3 жыл бұрын
Thanks
@cientifiko
@cientifiko Жыл бұрын
this has an awesome didactics
@vigneshpadmanabhan
@vigneshpadmanabhan 3 жыл бұрын
Thanks!
@vigneshpadmanabhan
@vigneshpadmanabhan 3 жыл бұрын
this is one of the best videos I have seen covering sklean so well. Thanks a lot! would love to learn sklearn in more depth for different scenarios ..
@saptarshisanyal4869
@saptarshisanyal4869 2 жыл бұрын
Hi Vignesh, could you suggest a book which covers the metrics section?
@johnmito100
@johnmito100 3 жыл бұрын
What is the diference between this and the last?
@louisshengliu
@louisshengliu 2 жыл бұрын
Could you please explain why the min of recall and precision is lower than both? Could not find appendix.
@adrienpyb1611
@adrienpyb1611 2 жыл бұрын
+1, anyone knows where to find the appendix?
@ANONIM9123
@ANONIM9123 2 жыл бұрын
hint: min_both is calculated separately at every train/test split in the cross-validation
@user-wr6rb5eb5g
@user-wr6rb5eb5g 5 ай бұрын
+1, same, could not find appendix
@xuyi2893
@xuyi2893 Жыл бұрын
Do you guys know where I can download that csv file used in pre-processing part? Thanks!
@xuyi2893
@xuyi2893 Жыл бұрын
Sorry....nvm...i think i know where I can have those data. Thanks though!
@ninadkawade4681
@ninadkawade4681 15 күн бұрын
what will be the prerequisite for scikit learn ??
@AlmogYosef520
@AlmogYosef520 3 жыл бұрын
Hi, what do you guys suggest me to watch if I'm totally new to ML? I find this course a little bit beyond my knowledge, I thought because I've got the foundation of DS I can jump on this course but I think I'll need some intro to ML videos.
@Caradaoutradimensao
@Caradaoutradimensao 3 жыл бұрын
StatQuest
@AlmogYosef520
@AlmogYosef520 3 жыл бұрын
@@Caradaoutradimensao Awesome looks good! Thanks a lot!
@spiritech7162
@spiritech7162 2 жыл бұрын
@@Caradaoutradimensao thanks bro
@Ingles.con.peliculas
@Ingles.con.peliculas Жыл бұрын
it's great...
@messedinsaan
@messedinsaan 7 күн бұрын
"Building dependencies failed" error: subprocess-exited-with-error Cannot import boston housing price dataset.
@huynguyen-to9wm
@huynguyen-to9wm Жыл бұрын
ok thanks you bro
@srichaidiamond1032
@srichaidiamond1032 2 жыл бұрын
Hello, I run into an attribute error exception when i try to run the .cv_results_ on my model: 'GridSearchCV' object has no attribute 'cv_results_' df = pd.DataFrame(mod1.cv_results_) #is the line of code. where mod1 is my model. Does anyone know if there is a bug? i am using 1.1.1 versio of scikit learn
@bonettimauricio
@bonettimauricio Жыл бұрын
I'm having the very same error here as well, I have installed the specific version scikit-learn=0.23.0.
@mwaikul
@mwaikul 2 жыл бұрын
Is there a way KNN to skip the closest nearest neighbor?
@ayanah4821
@ayanah4821 Ай бұрын
awesome! continue at 46:05
@tilakrajchoubey5534
@tilakrajchoubey5534 2 жыл бұрын
Does this video contains something about ML algorithms?
The Most Important Algorithm in Machine Learning
40:08
Artem Kirsanov
Рет қаралды 342 М.
Python Django Web Framework - Full Course for Beginners
3:45:41
freeCodeCamp.org
Рет қаралды 4,2 МЛН
Happy 4th of July 😂
00:12
Alyssa's Ways
Рет қаралды 66 МЛН
Data Analysis with Python for Excel Users - Full Course
3:57:46
freeCodeCamp.org
Рет қаралды 2,1 МЛН
Large Language Models (LLMs) - Everything You NEED To Know
25:20
Matthew Berman
Рет қаралды 67 М.
I gave 127 interviews. Top 5 Algorithms they asked me.
8:36
Sahil & Sarra
Рет қаралды 625 М.
Generative AI in a Nutshell - how to survive and thrive in the age of AI
17:57
The most important skill in statistics
13:35
Very Normal
Рет қаралды 312 М.
But what is a neural network? | Chapter 1, Deep learning
18:40
3Blue1Brown
Рет қаралды 16 МЛН
How I’d learn ML in 2024 (if I could start over)
7:05
Boris Meinardus
Рет қаралды 1 МЛН