Beyond Empirical Risk Minimization: the lessons of deep learning

  Рет қаралды 6,410

MITCBMM

MITCBMM

Күн бұрын

Mikhail Belkin, Professor, The Ohio State University - Department of Computer Science and Engineering, Department of Statistics, Center for Cognitive Science
Abstract: "A model with zero training error is overfit to the training data and will typically generalize poorly" goes statistical textbook wisdom. Yet, in modern practice, over-parametrized deep networks with near perfect fit on training data still show excellent test performance. This apparent contradiction points to troubling cracks in the conceptual foundations of machine learning. While classical analyses of Empirical Risk Minimization rely on balancing the complexity of predictors with training error, modern models are best described by interpolation. In that paradigm a predictor is chosen by minimizing (explicitly or implicitly) a norm corresponding to a certain inductive bias over a space of functions that fit the training data exactly. I will discuss the nature of the challenge to our understanding of machine learning and point the way forward to first analyses that account for the empirically observed phenomena. Furthermore, I will show how classical and modern models can be unified within a single "double descent" risk curve, which subsumes the classical U-shaped bias-variance trade-off.
Finally, as an example of a particularly interesting inductive bias, I will show evidence that deep over-parametrized auto-encoders networks, trained with SGD, implement a form of associative memory with training examples as attractor states.

Пікірлер: 4
@JasminUwU
@JasminUwU 2 жыл бұрын
Very high quality talk, thank you very much!
@egorpanfilov
@egorpanfilov 3 жыл бұрын
Great talk! Thank you Mikhail for wonderful explanations!
@straumits
@straumits 2 жыл бұрын
Fascinating!
@selfhelp119
@selfhelp119 3 жыл бұрын
Good presentation! thank you for sharing.
бесит старшая сестра!? #роблокс #анимация #мем
00:58
КРУТОЙ ПАПА на
Рет қаралды 3,3 МЛН
когда повзрослела // EVA mash
00:40
EVA mash
Рет қаралды 3,3 МЛН
Haha😂 Power💪 #trending #funny #viral #shorts
00:18
Reaction Station TV
Рет қаралды 15 МЛН
From Classical Statistics to Modern Machine Learning
49:47
Simons Institute
Рет қаралды 7 М.
Reconciling modern machine learning and the bias-variance trade-off
18:54
Empirical Risk Minimization
12:27
Alexander Jung
Рет қаралды 23 М.
From Classical Statistics to Modern ML: the Lessons of Deep Learning - Mikhail Belkin
37:29
Deep Networks Are Kernel Machines (Paper Explained)
43:04
Yannic Kilcher
Рет қаралды 59 М.
CBMM10 Panel: Research on Intelligence in the Age of AI
1:27:21
Lecture 3 | Learning, Empirical Risk Minimization, and Optimization
1:18:43
Carnegie Mellon University Deep Learning
Рет қаралды 16 М.
GamePad İle Bisiklet Yönetmek #shorts
0:26
Osman Kabadayı
Рет қаралды 67 М.
Спутниковый телефон #обзор #товары
0:35
Product show
Рет қаралды 1,8 МЛН
iPhone 16 с инновационным аккумулятором
0:45
ÉЖИ АКСЁНОВ
Рет қаралды 737 М.
Will the battery emit smoke if it rotates rapidly?
0:11
Meaningful Cartoons 183
Рет қаралды 38 МЛН