No video

Standardization Vs Normalization- Feature Scaling

  Рет қаралды 296,116

Krish Naik

Krish Naik

Күн бұрын

Please join as a member in my channel to get additional benefits like materials in Data Science, live streaming for Members and many more
/ @krishnaik06 If you are looking for Career Tansition Advice and Real Life Data Scientist Journey. Please check the below link
Spring board India KZfaq url: / channel
Connect with me here:
Twitter: / krishnaik06
Facebook: / krishnaik06
instagram: / krishnaik06

Пікірлер: 278
@srikrithibhat6220
@srikrithibhat6220 2 жыл бұрын
One of the best and detailed explanation on Scaling Techniques. Thank You so much Krish ji.
@anujashinde5717
@anujashinde5717 4 жыл бұрын
Hi sir. I have seen lots of videos on machine learning but I personally feel like u r d only one who’s making the videos in very fantastic way. u explains all the things in such a way that even the person who is from non technical background can understand it. Just a small req for you. Can u pls make video on all the techniques that can be apply on single data set. Like when to scale the data & apply PCA, clusters, algorithms, when to do label encoding instead of one hot. Can u pls apply all these things on any dataset so that i can have clear insight on model building. Can u pls make video on this for end to end model building
@nakul469
@nakul469 3 ай бұрын
Hi, I just read your comment and I wanted to know how's your data science career going? I just completed ML and going to create an ML project for resume. Can you please give me any kind of suggestion if you are reading this comment.
@vaibhavnakrani2983
@vaibhavnakrani2983 9 ай бұрын
You explain it very simply. I love it. I even even recommend your videos to other guyz in ML.
@gc-0377
@gc-0377 3 жыл бұрын
I love you dude, thanks for the explaining you saved me, greetings from México wey you have a new sub
@sagaryadav3473
@sagaryadav3473 4 жыл бұрын
I'm really in love with the way you explain. So nice :)
@aryanchauhan9086
@aryanchauhan9086 3 жыл бұрын
I hope one day I will become data scientist like you , you are really helpful for aspiring data scientist like me
@deeptipancholi8814
@deeptipancholi8814 6 ай бұрын
Hey Have you become data scientist ? If yes please suggest me something
@tymothylim6550
@tymothylim6550 3 жыл бұрын
Thank you Krish for this video! It was fantastic in helping me understand the difference between these 2 things and some additional advice regarding how it helps with some other things (e.g. helping some kinds of models optimize faster!)
@baharehghanbarikondori1965
@baharehghanbarikondori1965 3 жыл бұрын
the best video on Standardization & Normalization
@123anandik
@123anandik 4 жыл бұрын
Good one 🤘🤘🤘 Actually z score is much widely used for most of the algorithms as i have seen. And I do practice the same all the time. The reason is the affect of the outliers. Outliers can be easily detected by z score. Normalistion between 0 to 1 just shrinks curves.
@crackthecode1372
@crackthecode1372 4 жыл бұрын
can u please explain ur outliers point
@lars1597
@lars1597 3 жыл бұрын
@@crackthecode1372 outliers are just noise
@TheMaverickanupam
@TheMaverickanupam 3 жыл бұрын
@@lars1597 Sometimes outliers are important noise. Outliers can tell a lot about data. They can't simply be dropped.
@bhaskartripathi
@bhaskartripathi 3 жыл бұрын
Minmax scaler is the most widely used in forecasting research papers. Z-score is not very good in time series forecasting
@santoshandawarapu340
@santoshandawarapu340 6 ай бұрын
I have gone through other speakers videos but they are hard to follow. I really liked the way of explanation in a very simple way with great examples. Thank you brother.
@raziekhairy5799
@raziekhairy5799 3 жыл бұрын
Thank you. I wish this world will be fulled of people like you!
@rohitkamra1628
@rohitkamra1628 4 жыл бұрын
I have completed Statistics Playlist. You explained in a very good way. Thanks for this. :)
@dungtran-vk3ed
@dungtran-vk3ed 4 жыл бұрын
Here you go. Hope it can help you guys df = pd.read_csv('raw.githubusercontent.com/rasbt/pattern_classification/master/data/wine_data.csv', header= None, usecols=[0,1,2])
@maruthiprasad8184
@maruthiprasad8184 2 жыл бұрын
Thanks
@akshatabm4491
@akshatabm4491 10 ай бұрын
Great content. Thank you for explaining in the best way possible. However a small suggestion, please include the links of dataset your are using in the description box. It will be helpful to practice along while watching the video. Thanks again, cheers!!
@imamamansoor5174
@imamamansoor5174 11 ай бұрын
Krish whenever i get confused for any Data Science topic, i search it on YT, if your video pops up for it, i definitely select your explanation for that topic.
@vgaurav3011
@vgaurav3011 4 жыл бұрын
Finally completed your statistics playlist and can definitely say learned much more than other online courses
@amanpatra8092
@amanpatra8092 2 жыл бұрын
hello , i want to learn statistics for data science i don't have prior knowledge, will this cover the basics as i want to start from scratch
@donaldngwira
@donaldngwira 2 жыл бұрын
One of the best teachings on this subject. Thanks Krish
@GuitarreroDaniel
@GuitarreroDaniel 3 жыл бұрын
Incredible explanation, thank you very much!
@vikeshgiri2369
@vikeshgiri2369 3 жыл бұрын
Today only started this playlist and today only completed, it is possible because the way sir❤️ explain is just amazing..❤️ Now I move to next part.
@aashishsahni90
@aashishsahni90 3 жыл бұрын
Great way of teaching...really helpful!! :)
@owoeyebabatope2425
@owoeyebabatope2425 2 жыл бұрын
This short video has helped me understand a great deal of feature engineering. God bless you. I wish to learn more from you. I recommend you do a video on a full data science project and focus more on the thought process. While you also do a soft touch on various alternatives to whatever method you have used. This is Great!
@user-uz5ld4oi6r
@user-uz5ld4oi6r 2 жыл бұрын
Clear message, clear structure, easy to understand, thank you
@anshukaurav2896
@anshukaurav2896 7 ай бұрын
Awesome sir, you are explaining very easy way .
@hasibullahaman50
@hasibullahaman50 2 жыл бұрын
Thanks to you Chanel... it's so helpful for my UNI Lesson
@vagheeshmk3156
@vagheeshmk3156 9 ай бұрын
Krish. You ARE the Guru of DataScience for aspirants stuck in the Dark...... #KingKrish
@ahmeddhiael-euch8105
@ahmeddhiael-euch8105 2 жыл бұрын
Very informative and helpful, thanks a lot Krish
@Kim-bn4ub
@Kim-bn4ub 3 жыл бұрын
HI, can you please add the github link in the description? the github address is missing.
@mapytekh
@mapytekh 4 жыл бұрын
Great!...very good explanation...plz keep posting...thanks
@KrishnaChaitanyakosaraju
@KrishnaChaitanyakosaraju Жыл бұрын
In linear regression, one common assumption is that all the features have 0 mean same variance. Which is similar to standardization. Hence it works.
@georgekokkinakis7288
@georgekokkinakis7288 6 ай бұрын
Have to say that your presentations really stand out , basically because of the distilled informations and to the point suggestions you make. One question though. At some point you talk about CNNs and that we have to use MinMax scaler. I am using CNN but on non image data, basically I see my data as an image of point values. Should I go with MinMax scaler or I could also use Standard scaler? And in order to be more specific lets say that I have an image of 7x7 where I want to keep the relative value differences between a pixel and its neighbours. Which scaling should I use in your opinion? Can we use standarization on the dataset in order to train a CNN or the values should be in [0,1] so we have to use minmax scaler. I am really interested to hear your opinion based on you experience.
@varshapatil
@varshapatil 4 жыл бұрын
Great explanation. Very well conveyed with proper examples
@spiderman8340
@spiderman8340 4 ай бұрын
KZfaq hid your Videos from My Feed Bro!!! Thanks for the Explaination!!!
@momaalim3086
@momaalim3086 4 жыл бұрын
Brilliant explanation. Thank you sir!
@AbcAbc-kx3xm
@AbcAbc-kx3xm 3 жыл бұрын
So clear explanation, thanks Krish
@nothing8919
@nothing8919 3 жыл бұрын
Now I completed the statistics playlist and finally can move to the third part
@matinpathan5186
@matinpathan5186 3 жыл бұрын
Me too moving for the next Playlist that is Feature Engineering... Good Luck
@nothing8919
@nothing8919 3 жыл бұрын
@@matinpathan5186 Good luck to you too
@josealjndro
@josealjndro 4 жыл бұрын
In most of the cases I reproduce this kind of videos at 1.25x velocity, this one 0.75x haha nice videos Krish!
@visualvalidator5384
@visualvalidator5384 2 жыл бұрын
Commenting exactly on the same date, such a coincidence though, Thank you Krish for this video!
@GaviniLok
@GaviniLok Жыл бұрын
Can you send the Github link for this code
@nonamenoname1942
@nonamenoname1942 3 жыл бұрын
Thank you! Perfectly explained.
@vishalk8905
@vishalk8905 2 жыл бұрын
Normalization is good to use when you know that the distribution of your data does not follow a Gaussian distribution. Standardization, on the other hand, can be helpful in cases where the data follows a Gaussian distribution. However, this does not have to be necessarily true. Also, unlike normalization, standardization does not have a bounding range. So, even if you have outliers in your data, they will not be affected by standardization.
@mujeebrahman5282
@mujeebrahman5282 4 жыл бұрын
Normalization has a problem with outliers whereas standardization eradicates the impact of outliers
@crackthecode1372
@crackthecode1372 4 жыл бұрын
can u please explain ur point?
@murmupk
@murmupk 3 жыл бұрын
Sir, you should also mention the dataset link in the description. This will help us to follow you.
@lolwhatyesme651
@lolwhatyesme651 2 жыл бұрын
You're a great teacher. Thank you.
@ashimmaity64
@ashimmaity64 4 жыл бұрын
sir, please make video on difference between GD,SGD,SGD (mini batch),SGD with momentum.
@keerthanpu808
@keerthanpu808 5 ай бұрын
super guru! U made it a cake walk
@aditisrivastava7079
@aditisrivastava7079 4 жыл бұрын
Thanks for your suggestion 🙏
@bonikapp6504
@bonikapp6504 3 жыл бұрын
I'm always humble and greatfull to you.... your teaching is wonderful ❤️..And your selfless attitude and practical approach is truly amazing ...❤️❤️❤️🙏 I just stuck to your Chanel Thank you so so much...🙏
@rasha9462
@rasha9462 2 жыл бұрын
wow! great explanation .. Thank you 🙏
@ashabhumza3394
@ashabhumza3394 2 жыл бұрын
I had been watching all your previous statistics videos and understood each concept well. Since I am not from mathematics background, In this video I couldn't understand what you explained in the part while telling what process to use when. Will this be a matter to bother in my data science learning journey?
@najmeh5707
@najmeh5707 4 жыл бұрын
Thanks for the intuitive explaining.
@ShahnawazKhan-xy1ll
@ShahnawazKhan-xy1ll Жыл бұрын
Great Job very well explained
@alexeivodopianov5440
@alexeivodopianov5440 2 жыл бұрын
Absolutely excellent explanation
@fasttimeboy
@fasttimeboy 3 жыл бұрын
You have to put more emphasis when you are explaining the property of standard normalization. It transform values between -3 to +3. The property 0 mean and standard deviation 1 means the 68 % of data points fall below with 1 std deviation, 95 % of data points fall below 2 std deviation and 99.7 % data points fall below 3 std deviation.
@TheMaverickanupam
@TheMaverickanupam 3 жыл бұрын
In Standardization, values can theoretically go outside -3 and + 3 also. They will lie in .15% of values in each tail. Basically, the outliers.
@user-es3wr6uf2l
@user-es3wr6uf2l Жыл бұрын
Thank you. Great explanation.
@shindepratibha31
@shindepratibha31 4 жыл бұрын
Thank you for the video. It was useful. Can you please provide the github link?
@igechasamuel3573
@igechasamuel3573 Жыл бұрын
This channel is a goldmine
@chinmaybhat9636
@chinmaybhat9636 4 жыл бұрын
@Krish Naik Sir Github Link is not there in the description link, for the Jupyter Notebook shown in this video, Can you Share the Same ?? Thanks & Regards, CHINMAY N BHAT
@YourDailyBits
@YourDailyBits 4 жыл бұрын
9:20 when to use Std and Nrm
@marcoreiter2795
@marcoreiter2795 4 жыл бұрын
Thank you for your video Krish, it was really helpful!
@idrissjairi
@idrissjairi 2 жыл бұрын
Great Explanation, Thank you!
@sheelstera
@sheelstera Жыл бұрын
it needs to be mentioned that Standardization *DOES NOT* transform a non-normally distributed column into a normally distributed column although the mean becomes 0 and SD becomes 1.
@nirmalpatil5370
@nirmalpatil5370 2 жыл бұрын
Thank you so much !!!
@purnimaps9819
@purnimaps9819 Жыл бұрын
Very informative thank u
@uchchwasdas2675
@uchchwasdas2675 4 жыл бұрын
best explanation, keep it up
@lokeshgaikwad4337
@lokeshgaikwad4337 2 ай бұрын
thanks for the great explanation sir .the link of channel you are talking about is not working pls help with that
@kamaladey2442
@kamaladey2442 2 жыл бұрын
Thanks Sir for sharing all wonderful videos, kindly provide the github link to download the dataset ,not getting from description box
@akshaykrishnan7985
@akshaykrishnan7985 4 жыл бұрын
Good evening sir.. I have a doubt.. When you say collecting data, how exactly is it done? Is it done through market research methods or any other methods are there for collecting it? Please do elaborate..
@RitwikDandriyal
@RitwikDandriyal 4 жыл бұрын
Hey. Data collection or Data Acquisition is one of the very first steps in designing a data science pipeline. Often Data Scientists working in tech companies get easy access to this data but if you want to collect data for yourself (on a small scale) then stuff like web scraping comes into play. Web scraping is nothing but collecting data from a website. Web Scraping is a popular means of collecting data. Often companies survey people to collect data. It can be an online survey or an offline one. The means of collecting data are endless.
@akshaykrishnan7985
@akshaykrishnan7985 4 жыл бұрын
@@RitwikDandriyal thank you for the reply.. 😊
@manojkumarpalaparti397
@manojkumarpalaparti397 4 жыл бұрын
Bro video about difference between fit, transform and fit_transform !!
@utkarshpandya3155
@utkarshpandya3155 2 жыл бұрын
Hi Krishna.You're saviour.Apologies in advance if it is already asked question.(please advise if you have already answered and will find out the video). 1.Do you have any usecase where you do standardisation (with mean & std ) followed by min-max normalization so that you can compare same scalled features and then fit them into 0 to 1 or let's say 0 to 100 or -50 to +50 etc ? 2. any pros and cons of standardisation followed by min-max normalization ? 3. am i missing any logic by asking ? is there any solution for a scanario where you have more than 5 + features and user want it to scale in a single number so that instead of viewing the movement or change of 5 features,you will only focus on final score by means of min-max norm....hope it's clear out my question looking forward to see your answer.Regards & TIA + Thanks for this video.
@user-qz1hd4xp1p
@user-qz1hd4xp1p 4 жыл бұрын
thank you so much you are the man !!
@AugustNocturne
@AugustNocturne 3 жыл бұрын
Thank you this was very helpful.
@AlbertRyanstein
@AlbertRyanstein 3 жыл бұрын
Hi, I really enjoyed the video. I was wondering is this the same as normalisation on keras.
@saltanatkhalyk3397
@saltanatkhalyk3397 3 жыл бұрын
always the best explanation!
@username3543
@username3543 4 жыл бұрын
krish bro, you forgot to put the git hub link in the description.
@sandipansarkar9211
@sandipansarkar9211 4 жыл бұрын
yes i was also searching for the GitHub link .Why is it missing??
@merveozdas1193
@merveozdas1193 2 жыл бұрын
You need to tell about all lessons, I like your way of telling :) but I couldn't understand just eucledian distance.
@belimmohsin
@belimmohsin 3 жыл бұрын
Thank you sir..nice explanation :)
@ahmedelsabagh6990
@ahmedelsabagh6990 2 жыл бұрын
Great explanation
@bhabanisankardash7387
@bhabanisankardash7387 3 жыл бұрын
Nice and useful information 👍
@durgasthan
@durgasthan 4 жыл бұрын
I do not think for linear regression you need to do the normalization or standardization. because OLS will take of that.
@krishnaik06
@krishnaik06 4 жыл бұрын
Yes u r right OLS takes care of it
@adhipathis12
@adhipathis12 3 жыл бұрын
Thanks a lot Krish :)
@sayantansinha4545
@sayantansinha4545 4 жыл бұрын
Thanks for the awesome explanation
@andresherrera4023
@andresherrera4023 4 жыл бұрын
Great explanation !! Thanks
@MB-sh9ur
@MB-sh9ur 6 ай бұрын
this man is the god of data science
@latabisht3591
@latabisht3591 3 жыл бұрын
Great explanation Sir
@slappy703
@slappy703 3 ай бұрын
when i click the channel link which you provided in description, it shows the channel doesn't exist
@tosinlitics949
@tosinlitics949 3 жыл бұрын
Excellent explanation!
@jedidiaholadele2086
@jedidiaholadele2086 2 жыл бұрын
Your videos are 🔥🔥
@maximilianovazquez59
@maximilianovazquez59 3 жыл бұрын
I love you mate, thx Cheers!
@shadiyapp5552
@shadiyapp5552 Жыл бұрын
Thank you sir ♥️
@techstackgochannel
@techstackgochannel 3 жыл бұрын
In Logistic Regression Normalization is not required as there only decision boundary need to be created
@Pierluigi-ns4ms
@Pierluigi-ns4ms 2 жыл бұрын
The information related to the title of the video (Standardization Vs Normalization) starts at 11:00 out of this 13 minutes of video. Also, at that part, you provided very weak explanations; there is no a true cause effect relationship; I kept asking "Ok, but why?"
@csprusty
@csprusty 4 жыл бұрын
if mu = 0 and sd =1, then will z not be equal to x? i am confused how values appear as -.0125 etc
@illiagerasimenko4793
@illiagerasimenko4793 4 жыл бұрын
Had the same question. Turned out he made a mistake, StandardScaler calculates mu and sd for the dataset you fit into it. Next time have similar question - go right to the documentation. StandardScaler - scikit-learn.org/stable/modules/generated/sklearn.preprocessing.StandardScaler.html.
@princemensah1866
@princemensah1866 3 жыл бұрын
Great explanation!
@abhinavmahajan448
@abhinavmahajan448 3 жыл бұрын
Paji tusi great ho :)
@louerleseigneur4532
@louerleseigneur4532 3 жыл бұрын
Thanks Krish
@danielagbaniyaka6196
@danielagbaniyaka6196 3 жыл бұрын
Clean explanation Thanks
@nutsom
@nutsom 2 жыл бұрын
@krish - i didn't quite understand when to use Normalisation and when to use Standard Scaler. Can u share with an example why standard scaler was used and another one where normalizer or min max scaler was used, and why.
@sejalchandra2114
@sejalchandra2114 4 жыл бұрын
Sir as you said in case of bagging boosting etc we need not to use scaling, but what if we have many one hot encoded columns in the dataset?
@bonalareddy5339
@bonalareddy5339 4 жыл бұрын
Hi Sejal, the topic you are asking will not fall under feature scaling. As you are using the one hot encoding it will result in large no.of columns, instead of using one hot encoding you can use some other encoding technique such as feature encoding,target encoding,binary encoding,k-fold target encoding. In these encoding techniques every one has its own pros and cons, you have to compromise in some of the things and you have to do the modelling
@sejalchandra2114
@sejalchandra2114 4 жыл бұрын
@@bonalareddy5339 okay so you mean instead of feature encoding i apply other encoding techniques and then scale the dataset?
@bonalareddy5339
@bonalareddy5339 4 жыл бұрын
@@sejalchandra2114, every feature not needed to be scaled. The feature that are to be scaled are the numerical one's which doesn't fall in the same range and same unit, as categorical features are converted into some meaningful numerical feature with the help encoding techniques and those numerical formats(in general) doesn't affect the model that much. For suppose there are 2 numerical features f1, f2. f1 has a range btw 10 to 1000 and f2 has a range btw 50 to 300 in this case the features are needed to be scaled as those both features are different and the units f1, f2 are talking might be different. f1 might be talking weight in kgs, f2 might be talking about height in cms, if you are using anyone of the modelling algorithm such as linear regression, logistic regression, knn, k means, cnn, ann(as mentioned in the video)you are supposed to do the feature scaling. If you are using some other techniques such as decision trees, random forest or anyother bagging/boosting techniques there is no point in scaling the features as it doesn't affect the performance of the algorithm(in case of tuning or faster output, also mentioned in the video). If you feel that for dealing with the categorical variables one hot encoding is producing a large no.of columns you can use some other encoding (that's what i meant in my previous reply). I hope this helps.
@sejalchandra2114
@sejalchandra2114 4 жыл бұрын
@@bonalareddy5339 okay thanks I got the point. Really appreciate your detailed response.
@Sarasara-dg8gb
@Sarasara-dg8gb 3 жыл бұрын
What is the most adequate way of features scaling for ANFIS algorithm, normalization or standardization?
@KausthabDutta
@KausthabDutta 3 жыл бұрын
Here is the dataset raw.githubusercontent.com/rasbt/pattern_classification/master/data/wine_data.csv
@bhavanasaraswat7808
@bhavanasaraswat7808 3 жыл бұрын
Thanks sir for explanation
What Is P Value In Statistics In Simple Language?
11:18
Krish Naik
Рет қаралды 296 М.
Normalization Vs. Standardization (Feature Scaling in Machine Learning)
19:48
7 Days Stranded In A Cave
17:59
MrBeast
Рет қаралды 97 МЛН
Glow Stick Secret Pt.4 😱 #shorts
00:35
Mr DegrEE
Рет қаралды 11 МЛН
What will he say ? 😱 #smarthome #cleaning #homecleaning #gadgets
01:00
Zombie Boy Saved My Life 💚
00:29
Alan Chikin Chow
Рет қаралды 29 МЛН
Standardization vs Normalization Clearly Explained!
5:48
Normalized Nerd
Рет қаралды 133 М.
Stanford's FREE data science book and course are the best yet
4:52
Python Programmer
Рет қаралды 693 М.
Feature Scaling  (How it really works?) Explained !!
6:57
Priyanshu Vats
Рет қаралды 12 М.
Batch normalization | What it is and how to implement it
13:51
AssemblyAI
Рет қаралды 59 М.
Normalization & Standardization
15:36
DataR Labs
Рет қаралды 9 М.
Python Feature Scaling in SciKit-Learn (Normalization vs Standardization)
11:59
Ryan & Matt Data Science
Рет қаралды 9 М.
Why Do We Need to Perform Feature Scaling?
8:01
Krish Naik
Рет қаралды 134 М.
7 Days Stranded In A Cave
17:59
MrBeast
Рет қаралды 97 МЛН