No video

Scikit-Learn Model Pipeline Tutorial

  Рет қаралды 26,018

Greg Hogg

Greg Hogg

2 жыл бұрын

Thank you for watching the video!
Learn Python, SQL, & Data Science for free at mlnow.ai/ :)
Subscribe if you enjoyed the video!
Best Courses for Analytics:
---------------------------------------------------------------------------------------------------------
+ IBM Data Science (Python): bit.ly/3Rn00ZA
+ Google Analytics (R): bit.ly/3cPikLQ
+ SQL Basics: bit.ly/3Bd9nFu
Best Courses for Programming:
---------------------------------------------------------------------------------------------------------
+ Data Science in R: bit.ly/3RhvfFp
+ Python for Everybody: bit.ly/3ARQ1Ei
+ Data Structures & Algorithms: bit.ly/3CYR6wR
Best Courses for Machine Learning:
---------------------------------------------------------------------------------------------------------
+ Math Prerequisites: bit.ly/3ASUtTi
+ Machine Learning: bit.ly/3d1QATT
+ Deep Learning: bit.ly/3KPfint
+ ML Ops: bit.ly/3AWRrxE
Best Courses for Statistics:
---------------------------------------------------------------------------------------------------------
+ Introduction to Statistics: bit.ly/3QkEgvM
+ Statistics with Python: bit.ly/3BfwejF
+ Statistics with R: bit.ly/3QkicBJ
Best Courses for Big Data:
---------------------------------------------------------------------------------------------------------
+ Google Cloud Data Engineering: bit.ly/3RjHJw6
+ AWS Data Science: bit.ly/3TKnoBS
+ Big Data Specialization: bit.ly/3ANqSut
More Courses:
---------------------------------------------------------------------------------------------------------
+ Tableau: bit.ly/3q966AN
+ Excel: bit.ly/3RBxind
+ Computer Vision: bit.ly/3esxVS5
+ Natural Language Processing: bit.ly/3edXAgW
+ IBM Dev Ops: bit.ly/3RlVKt2
+ IBM Full Stack Cloud: bit.ly/3x0pOm6
+ Object Oriented Programming (Java): bit.ly/3Bfjn0K
+ TensorFlow Advanced Techniques: bit.ly/3BePQV2
+ TensorFlow Data and Deployment: bit.ly/3BbC5Xb
+ Generative Adversarial Networks / GANs (PyTorch): bit.ly/3RHQiRj

Пікірлер: 49
@GregHogg
@GregHogg Жыл бұрын
Take my courses at mlnow.ai/!
@TheCsePower
@TheCsePower Жыл бұрын
Thanks Greg. This made me realise how non-standard my code is. I learnt: - Use copy or deepcopy and not assignment. - Always perform preprocessing on the train and test separately. - sklearn pipelines have nothing to do with ETL pipelines from Data Engineering. - sklearn transfers have nothing to do with NLP Transformers. - sk elarn estimators have nothing to do with Statistics estimators.
@GregHogg
@GregHogg Жыл бұрын
Super glad you got some useful pointers!!
@crepantherx
@crepantherx 2 жыл бұрын
Keep Posting Greg, I am Data Analyst by profession and your video certainly helps a lot
@GregHogg
@GregHogg 2 жыл бұрын
That's awesome! Thank you 😄
@kyleGrealis
@kyleGrealis 20 күн бұрын
thanks, Greg. really good explanation and structured example. this makes it easy to create a template for easy reuse!
@hansenmarc
@hansenmarc Жыл бұрын
Great stuff! I’m curious why you used FunctionTransformer instead of ColumnTransformer, which could run the two scalers in parallel? Also, since FunctionTransformer is stateless, the documentation says that fit just checks the input rather than actually fitting the scaling parameters. Doesn’t that lead to data leakage since applying transform to test data won’t use parameters learned from fitting on the training data?
@AmitabhSuman
@AmitabhSuman Жыл бұрын
A very practical video, that I came across on Pipelines. Thank you for this video!
@GregHogg
@GregHogg Жыл бұрын
Awesome that's great to hear. You're very welcome ☺️☺️
@JJGhostHunters
@JJGhostHunters 2 жыл бұрын
Great tutorial! I use the MinMaxScaler with the option to scale from -1 to 1 instead of 0 to 1 when I am dealing with values that can be positive and negative. Seems to be fine, but I may need to reconsider going forward. I have never noticed any issues though.
@ilanyutsis9653
@ilanyutsis9653 21 күн бұрын
When you do the StandardScaler().fit on the dataframe, what is the meaning of this operation? what is happening?
@alexrook5604
@alexrook5604 Жыл бұрын
I undstand what you are doing here but I have two questions that I think would be helpful and would make it easier to follow along and replicate you steps. 1) Where did you get the data. I can't the california_housing dataset that is already in the train/test form. 2) Why not use scikit-learn tooling rather than doing it yourself? Like you could have used train/test split or pipelines (or column transformer... or similar stuff). That just has me confused.
@JJGhostHunters
@JJGhostHunters 2 жыл бұрын
I would love to see a tutorial that covers using pipelines with multilayer perceptron models (MLPs), CNNs and LSTMS.
@lythien390
@lythien390 2 жыл бұрын
Thank you Greg! It's a great video!
@GregHogg
@GregHogg 2 жыл бұрын
Glad to hear it!
@rahiiqbal1294
@rahiiqbal1294 6 ай бұрын
This was very helpful, thank you :)
@brandonn8166
@brandonn8166 Жыл бұрын
Just out of curiosity, is there a reason you don't use train_test_split to get X and y values?
@user-dh6wx3fe6y
@user-dh6wx3fe6y 9 ай бұрын
yes, why he uses X_train for train_predictions instead of another dataset X_valid
@marcofogale9719
@marcofogale9719 6 ай бұрын
Perfect explanation. Thanks a lot
@GregHogg
@GregHogg 6 ай бұрын
Very welcome 😁
@allanmachado2011
@allanmachado2011 4 ай бұрын
Thank you!
@Nadia-db6nb
@Nadia-db6nb Жыл бұрын
Thanks for the great tutorial. Can you make a video on how to combine multiple feature selection methods and feature extraction using python?
@nabanitadasgupta
@nabanitadasgupta 8 ай бұрын
Thank you for the video!
@00SeijiHan00
@00SeijiHan00 8 ай бұрын
TYSM bro really appreciate this
@GregHogg
@GregHogg 8 ай бұрын
Very welcome!!
@krzysztofzaucha3592
@krzysztofzaucha3592 4 ай бұрын
nice video Greg
@GregHogg
@GregHogg 4 ай бұрын
Thanks so much!!
@junaidlatif2881
@junaidlatif2881 Жыл бұрын
How to transform y variable and then fit model. And after how to reverse transform for the scatter plotting
@TheFrankyguitar
@TheFrankyguitar 9 ай бұрын
Thanks for this amazing video! Would that work also with a statsmodels model?
@GregHogg
@GregHogg 8 ай бұрын
Thanks so much!! And I'm not sure, haven't tried :)
@adriandiaz5688
@adriandiaz5688 Жыл бұрын
Great Video!
@GregHogg
@GregHogg Жыл бұрын
Thank you Adrian!
@talyb7383
@talyb7383 Жыл бұрын
Thanks for the great tutorial! what do I need to change to create a pipeline for an image classification model? like the cifar10 model?
@GregHogg
@GregHogg Жыл бұрын
Well, everything. You probably won't be using scikit for that. And you're very welcome!
@talyb7383
@talyb7383 Жыл бұрын
@@GregHogg I didnt explained myself clearly... I want to create a pipeline that receives a trained cifar10 model an also make preprocessing on the e data set ? so I cant use your way?
@juampaaa90
@juampaaa90 Жыл бұрын
awesome ty
@Supernyv
@Supernyv 9 ай бұрын
Awesome !
@GregHogg
@GregHogg 9 ай бұрын
Thank you!
@tareq8109
@tareq8109 2 жыл бұрын
Bro can you show how to make youtube and any video downloader make by python
@fabio336ful
@fabio336ful 2 жыл бұрын
Did you say pipelines doesn't function for classifications problems? Min: 1:07
@GregHogg
@GregHogg 2 жыл бұрын
Does, not doesn't
@fabio336ful
@fabio336ful 2 жыл бұрын
@@GregHogg thanks 🙏🏼
@AceOnBase1
@AceOnBase1 7 ай бұрын
Bro you literally just copied this out of a textbook lmao but I respect the grind.
@m18293
@m18293 Жыл бұрын
Can you share this notebook?
@GregHogg
@GregHogg Жыл бұрын
dang i think i lost it, sorry
@MrAhsan99
@MrAhsan99 2 жыл бұрын
you are ❤
@GregHogg
@GregHogg 2 жыл бұрын
❤️
@user-xn8wg6yw7g
@user-xn8wg6yw7g 8 ай бұрын
Too confusing. Too many tangents, doesn't cover the main idea clearly. Downvoted.
@GregHogg
@GregHogg 8 ай бұрын
Well I upvoted it to counter you
How do I encode categorical features using scikit-learn?
27:59
Data School
Рет қаралды 138 М.
A little girl was shy at her first ballet lesson #shorts
00:35
Fabiosa Animated
Рет қаралды 22 МЛН
Can This Bubble Save My Life? 😱
00:55
Topper Guild
Рет қаралды 53 МЛН
How I Did The SELF BENDING Spoon 😱🥄 #shorts
00:19
Wian
Рет қаралды 35 МЛН
SPILLED CHOCKY MILK PRANK ON BROTHER 😂 #shorts
00:12
Savage Vlogs
Рет қаралды 45 МЛН
Feature Engineering Techniques For Machine Learning in Python
47:58
Data Pipelines Explained
8:29
IBM Technology
Рет қаралды 148 М.
LSTM Time Series Forecasting Tutorial in Python
29:53
Greg Hogg
Рет қаралды 204 М.
DBSCAN Clustering Coding Tutorial in Python & Scikit-Learn
40:31
R vs Python
7:07
IBM Technology
Рет қаралды 317 М.
Pandas for Data Science in 20 Minutes | Python Crash Course
23:06
Nicholas Renotte
Рет қаралды 120 М.
Complete Guide to Cross Validation
29:49
Rob Mulla
Рет қаралды 53 М.
A little girl was shy at her first ballet lesson #shorts
00:35
Fabiosa Animated
Рет қаралды 22 МЛН