Lecture21 (Data2Decision) Leverage in Regression

  Рет қаралды 13,222

Chris Mack

Chris Mack

8 жыл бұрын

Leverage, the hat matrix, internally and externally studentized residuals, the Williams graph.
Course Website: www.lithoguru.com/scientist/st...

Пікірлер: 14
@looollol7910
@looollol7910 5 жыл бұрын
Thank you so much!!! you are very clear and helpful!!
@zikviewsdotcom5827
@zikviewsdotcom5827 5 жыл бұрын
Count Dooku is the best statistics teacher
@massimo8740
@massimo8740 3 жыл бұрын
Thanks!!!
@krishnaiyer2556
@krishnaiyer2556 2 жыл бұрын
sir difference between multicollinearity and leverage vs perfect collinearity in x variables?
@krishnaiyer2556
@krishnaiyer2556 2 жыл бұрын
defining leverage as distance between x's, but formula says cov (that,actual)/ var(y) why so?
@krishnaiyer2556
@krishnaiyer2556 2 жыл бұрын
cov can be negative, so do we take absolute values?
@empaulstube6947
@empaulstube6947 4 жыл бұрын
What is basically the difference between Standardized and Studentized residuals?
@ChrisMack
@ChrisMack 3 жыл бұрын
See slides 9 and 10: "standardized" is the same as "internally studentized", which is different from "externally studentized".
@muonneutrino
@muonneutrino 4 жыл бұрын
Great lectures! Very clear. Too bad I haven't learned outlier detection in regression models, although studied B.Sc. in computer engineering. I have some questions, hope you can answer them. 1) Why residuals should be normally distributed? 2) In Williams Graph, why do we use 2 means as a threshold? I would expect to see a multiply of stdev(lev * n/p). I watched the following lecture and I saw you calculated Cook's Distance as well, but you didn't use it for filtering outliers, or I missed it? Thank you so much for this quality content!
@muonneutrino
@muonneutrino 4 жыл бұрын
Oh, sorry you have a dedicated lecture about residuals distribution. So it's pretty much empirical, as I understood it.
@chrismack783
@chrismack783 4 жыл бұрын
1) residuals are often non-normally distributed, but sometime they are normal. You should always check if the assumption of normality makes a difference in your statistical analysis. 2) The choice of twice the average leverage as a threshold is arbitrary, but a convenient rule of thumb.
@muonneutrino
@muonneutrino 4 жыл бұрын
Chris Mack thank you Chris! Just now I saw that you have experience in semiconductors industry :) what a coincidence! I’m analyzing correlation in CD measured on wafer in different fields. It looks like CD distribution is normal. At least sometimes Jacque-Bera test confirms it, sometimes not. Sometimes Shapiro-Wilk confirms it sometimes not. Thank you so much for your great lectures! They are very helpful!
@chrismack783
@chrismack783 4 жыл бұрын
@@muonneutrino Good luck - I've worked a lot in mapping CD across the wafer.
@muonneutrino
@muonneutrino 4 жыл бұрын
Chris Mack very interesting. While there are many factors contributing to CD, you can tune the mask to compensate for them, or at least most of them. From your experience there is good correlation between different fields if CD is measured on the same locations?
Lecture22 (Data2Decision) Influence in Regression
20:07
Chris Mack
Рет қаралды 8 М.
Lecture56 (Data2Decision) Robust Regression
21:52
Chris Mack
Рет қаралды 22 М.
Nastya and SeanDoesMagic
00:16
Nastya
Рет қаралды 42 МЛН
Statistics 101: Linear Regression, Residual Analysis
19:56
Brandon Foltz
Рет қаралды 95 М.
Lecture26 (Data2Decision) Correcting for Heteroscedasticity
16:10
Introduction to the Hat matrix in regression
7:24
Phil Chan
Рет қаралды 31 М.
Lecture53 (Data2Decision) Principal Component Analysis
25:18
Chris Mack
Рет қаралды 5 М.
Lecture5 (Data2Decision) Regression part 1
26:35
Chris Mack
Рет қаралды 6 М.
Lecture74 (Data2Decision) Bayesian Regression, part 1
25:45
Chris Mack
Рет қаралды 8 М.
Finite Mixture Regression With R
10:51
Regorz Statistik
Рет қаралды 611
Leverage and Cook's distance
26:39
Edward Malthouse
Рет қаралды 3,2 М.