Understanding Cooks Distance to detect influential observations

  Рет қаралды 1,658

Selva Prabhakaran (ML+)

Selva Prabhakaran (ML+)

Күн бұрын

Welcome to the fourteenth video of the series "Build your First Machine Learning Project". In this, we'll see cook's distance for outlier detection.
Notebook Link: github.com/machinelearningplu...
Cook’s distance, is used in Regression Analysis to find influential outliers in a set of predictor variables. In other words, it’s a way to identify points that largely impacts the observation values / predictions.
The measurement is a combination of each observation’s leverage and residual values; the higher the leverage and residuals, the higher the Cook’s distance.
So let's understand it in brief.
Chapters
0:00 Intro to Cooks Distance
1:58 First step
4:15 Second Step
9:22 Implementing in Python
14:58 Conclusion
In order to make the best out of this, please watch this series in the order in playlist: Build Your First ML Model Playlist: • Build Your FIRST Machi...
Checkout Complete Machine Learning Plus Self Paced Online Courses here:
edu.machinelearningplus.com/s...
Join ML+ membership for exclusive Data science content
Previous Lesson:
Why mahalanobis distance is incredibly powerful for outlier detection : • Why mahalanobis distan...
Earlier Lessons:
1. Build your first ML Project: • Build Your FIRST Machi...
2. How to Formulate ML Problem: • Build Your First ML Pr...
3. Setup Python Environment: • Setup Python Environme...
4. Jupyter Notebook Tutorial: • Jupyter Notebook Tutor...
5. What is ML Modeling: • What is ML Modeling? (...
6. Reduce the size of Pandas Dataframe: • Reduce the memory size...
7. What is EDA: • Exploratory Data Analy...
8. How to impute missing Data: • How to handle missing ...
9. Mice Imputation Algorithm: • Multiple Imputation by...
10. How to impute missing data in categorical Variables: • How to impute missing ...
11. How to Detect Outliers with Z Score: • How to Detect Outliers...
Let me know in the comments section if you have any questions!
🤝 Like, Share, Subscribe for more!
Follow us on our social media handles for all updates, events and live sessions-
✅ Instagram: / machinelearningplus
✅ LinkedIn: / machine-learning-plus
✅ KZfaq: / @machinelearningplus
✅ Twitter: / r_programming
✅ Website: www.machinelearningplus.com/
If you enjoyed this video, be sure to throw it a like and make sure to subscribe to not miss any future videos!
Thanks for watching!
#mlmodeling, #python, #machinelearning, #artificialintelligence, #pandas, #datascience

Пікірлер: 4
@kaido453
@kaido453 4 ай бұрын
Amazing video! thank you
@machinelearningplus
@machinelearningplus 4 ай бұрын
Thank you :)
@jamalnuman
@jamalnuman 4 ай бұрын
Great. How the Cook's D can be computed within the GWR?
@machinelearningplus
@machinelearningplus Жыл бұрын
Want to learn more ML? Checkout edu.machinelearningplus.com/s/pages/ds-career-path - Become fundamentally strong in Data Science and ML!
Isolation Forest: A Tree based approach for Outlier Detection (Clearly Explained)
18:02
Llegó al techo 😱
00:37
Juan De Dios Pantoja
Рет қаралды 60 МЛН
A little girl was shy at her first ballet lesson #shorts
00:35
Fabiosa Animated
Рет қаралды 17 МЛН
ОБЯЗАТЕЛЬНО СОВЕРШАЙТЕ ДОБРО!❤❤❤
00:45
ЧУТЬ НЕ УТОНУЛ #shorts
00:27
Паша Осадчий
Рет қаралды 10 МЛН
Leverage and Influential Points in Simple Linear Regression
7:14
jbstatistics
Рет қаралды 116 М.
(Simplified) Linear Mixed Model in R with lme()
26:17
LiquidBrain Bioinformatics
Рет қаралды 66 М.
Isolation Forests: Identify Outliers in Data
4:41
Elder Research
Рет қаралды 11 М.
Learn Statistical Regression in 40 mins! My best video ever. Legit.
40:25
Linear Regression in Python - Full Project for Beginners
50:52
Alejandro AO - Software & Ai
Рет қаралды 15 М.
Understanding Cook's Distance in SPSS
8:48
Dr. Todd Grande
Рет қаралды 61 М.
Time Series Forecasting with XGBoost - Advanced Methods
22:02
Rob Mulla
Рет қаралды 117 М.
Master Qualitative Data Analysis with ChatGPT: An 18-Minute Guide
18:02
Philip Adu, Ph.D
Рет қаралды 73 М.
Llegó al techo 😱
00:37
Juan De Dios Pantoja
Рет қаралды 60 МЛН