Peter Norvig - The Unreasonable Effectiveness of Data

  Рет қаралды 70,111

UBC Computer Science

UBC Computer Science

12 жыл бұрын

How Billions of Trivial Data Points can Lead to Understanding
Peter Norvig (Director of Research, Google) presents as part of the UBC Department of Computer Science's Distinguished Lecture Series, September 23, 2010.
In decades past, models of human language were wrought from the sweat and pencils of linguists. In the modern day, it is more common to think of language modeling as an exercise in probabilistic inference from data: we observe how words and combinations of words are used, and from that build computer models of what the phrases mean. This approach is hopeless with a small amount of data, but somewhere in the range of millions or billions of examples, we pass a threshold, and the hopeless suddenly becomes effective, and computer models sometimes meet or exceed human performance. This talk gives examples of the data available in large repositories of text, images, and videos, and shows some tasks that can be accomplished with the resulting models.

Пікірлер: 33
@KaplaBen
@KaplaBen 4 жыл бұрын
0:11 "you'll have to make do with me". Love the humility and self-deprecating humor
@user-kj8qy6ek6n
@user-kj8qy6ek6n 5 ай бұрын
came across this in my junior year in college, was really inspired and I actually decided to do a master in machine learning because of this. Thank you Mr.Norvig!
@pablo_brianese
@pablo_brianese 3 жыл бұрын
Beautiful work!
@saifuddinraja
@saifuddinraja 2 жыл бұрын
this is so underrated , needs more views
@gabrieltenenbaum
@gabrieltenenbaum 8 жыл бұрын
It's funny because the video is amazing, there are great insights about data science and all, however, if one turns on the automatic subtitles, it has mostly nothing to do with what he's actually saying.
@datalicious43
@datalicious43 7 жыл бұрын
Caused they trained the base model using neutral accent , not American . :):)
@pleabargain
@pleabargain 11 жыл бұрын
15:27 predicting the present... very powerful
@rangjungyeshe
@rangjungyeshe 12 жыл бұрын
Very interesting insights in how comp sci is now using inference from data to solve problems previously tackled by rules. Maybe this is closer to how humans learn languages....it sure isn't by learning all that grammar...
@pablo_brianese
@pablo_brianese 3 жыл бұрын
39:45 You can bet Borges understood exponential growth, given he knew enough mathematics to understand the theory of transfinite numbers.
@pleabargain
@pleabargain 11 жыл бұрын
28:30 the spelling corrector function described
@pleabargain
@pleabargain 11 жыл бұрын
25:50 a bit of levity with word play
@JGunlimited
@JGunlimited 9 жыл бұрын
o_0! Norvig is amazing!
@pleabargain
@pleabargain 11 жыл бұрын
19:07 define the function that will read smashed words
@pleabargain
@pleabargain 11 жыл бұрын
26:45 harder problems spelling corrections
@pleabargain
@pleabargain 11 жыл бұрын
32:30 the python code for his spell correction app
@pleabargain
@pleabargain 11 жыл бұрын
12:42 how to do orbital mechanics just using text!
@pleabargain
@pleabargain 11 жыл бұрын
21:29 the code for his parser. It looks like Python but not commented correctly.
@pleabargain
@pleabargain 11 жыл бұрын
49:00 questions from the audience
@AnimeshSharma1977
@AnimeshSharma1977 9 жыл бұрын
brain behind the Google translate ;)
@57v60n85t
@57v60n85t 12 жыл бұрын
Although it is an enlightening talk, the graph at 52:00 is slightly misleading. If you set the lower limit of y-axis to zero, you will see what I mean.
@DominickGuzzo
@DominickGuzzo 6 жыл бұрын
Aren't you assuming that there's little value in the difference of ~48% vs ~53% accuracy? Depending upon the scenario, incremental improvement like that can be quite valuable. Imagine a similar chart for, say, website uptime: What would it look like with 99.9% plotted against 99.999999% when the y-axis starts at zero? Is that a useful visual?
@pleabargain
@pleabargain 11 жыл бұрын
4:15 expert system approach to learning
@pleabargain
@pleabargain 11 жыл бұрын
10:20 Text models
@pleabargain
@pleabargain 11 жыл бұрын
11:22 Google's corpus of N grams... 13M+ unique words...
@pleabargain
@pleabargain 11 жыл бұрын
6:50 first problem is scene completion
@pleabargain
@pleabargain 11 жыл бұрын
4:44 Statistical machine learning Judea Pearl see his wikipedia page
@pleabargain
@pleabargain 11 жыл бұрын
5:03 Image models and then text models. Image models first.
@pleabargain
@pleabargain 11 жыл бұрын
10:36 props to Frederick Jelinek see wikipedia.
@thinley108
@thinley108 12 жыл бұрын
part starting 24:50 is very funny
@pleabargain
@pleabargain 11 жыл бұрын
8:05 Old CS school method of solving problems...
@pleabargain
@pleabargain 11 жыл бұрын
18:13 let's read Chinese!
"The Mess We're In" by Joe Armstrong
45:50
Strange Loop Conference
Рет қаралды 379 М.
Jan Vondrak - Submodular Functions and Their Applications
1:18:53
UBC Computer Science
Рет қаралды 8 М.
39kgのガリガリが踊る絵文字ダンス/39kg boney emoji dance#dance #ダンス #にんげんっていいな
00:16
💀Skeleton Ninja🥷【にんげんっていいなチャンネル】
Рет қаралды 8 МЛН
Inside Out Babies (Inside Out Animation)
00:21
FASH
Рет қаралды 16 МЛН
The moment we stopped understanding AI [AlexNet]
17:38
Welch Labs
Рет қаралды 816 М.
Generative AI And Programming, Peter Norvig, Director of Research, Google
15:44
Machine Learning:  The Basics, with Ron Bekkerman
56:34
LinkedInTechTalks
Рет қаралды 167 М.
Edward Witten - Why the ‘Unreasonable Effectiveness’ of Mathematics
7:32
Bret Victor - Inventing on Principle
54:20
Rui Oliveira
Рет қаралды 292 М.
"Lawrence Krauss - Life, the Universe, and Nothing: A Cosmic Mystery Story "
1:17:35
The University of British Columbia
Рет қаралды 163 М.
Colin Ware - Visual Thinking Algorithms
1:17:33
UBC Computer Science
Рет қаралды 2,9 М.
39kgのガリガリが踊る絵文字ダンス/39kg boney emoji dance#dance #ダンス #にんげんっていいな
00:16
💀Skeleton Ninja🥷【にんげんっていいなチャンネル】
Рет қаралды 8 МЛН