Peter Norvig - The Unreasonable Effectiveness of Data

Рет қаралды 70,111

12 жыл бұрын

How Billions of Trivial Data Points can Lead to Understanding
Peter Norvig (Director of Research, Google) presents as part of the UBC Department of Computer Science's Distinguished Lecture Series, September 23, 2010.
In decades past, models of human language were wrought from the sweat and pencils of linguists. In the modern day, it is more common to think of language modeling as an exercise in probabilistic inference from data: we observe how words and combinations of words are used, and from that build computer models of what the phrases mean. This approach is hopeless with a small amount of data, but somewhere in the range of millions or billions of examples, we pass a threshold, and the hopeless suddenly becomes effective, and computer models sometimes meet or exceed human performance. This talk gives examples of the data available in large repositories of text, images, and videos, and shows some tasks that can be accomplished with the resulting models.

Пікірлер: 33

@KaplaBen 4 жыл бұрын

0:11 "you'll have to make do with me". Love the humility and self-deprecating humor

@user-kj8qy6ek6n 5 ай бұрын

came across this in my junior year in college, was really inspired and I actually decided to do a master in machine learning because of this. Thank you Mr.Norvig!

@pablo_brianese 3 жыл бұрын

Beautiful work!

@saifuddinraja 2 жыл бұрын

this is so underrated , needs more views

@gabrieltenenbaum 8 жыл бұрын

It's funny because the video is amazing, there are great insights about data science and all, however, if one turns on the automatic subtitles, it has mostly nothing to do with what he's actually saying.

@datalicious43 7 жыл бұрын

Caused they trained the base model using neutral accent , not American . :):)

@pleabargain 11 жыл бұрын

15:27 predicting the present... very powerful

@rangjungyeshe 12 жыл бұрын

Very interesting insights in how comp sci is now using inference from data to solve problems previously tackled by rules. Maybe this is closer to how humans learn languages....it sure isn't by learning all that grammar...

@pablo_brianese 3 жыл бұрын

39:45 You can bet Borges understood exponential growth, given he knew enough mathematics to understand the theory of transfinite numbers.

@pleabargain 11 жыл бұрын

28:30 the spelling corrector function described

@pleabargain 11 жыл бұрын

25:50 a bit of levity with word play

@JGunlimited 9 жыл бұрын

o_0! Norvig is amazing!

@pleabargain 11 жыл бұрын

19:07 define the function that will read smashed words

@pleabargain 11 жыл бұрын

26:45 harder problems spelling corrections

@pleabargain 11 жыл бұрын

32:30 the python code for his spell correction app

@pleabargain 11 жыл бұрын

12:42 how to do orbital mechanics just using text!

@pleabargain 11 жыл бұрын

21:29 the code for his parser. It looks like Python but not commented correctly.

@pleabargain 11 жыл бұрын

49:00 questions from the audience

@AnimeshSharma1977 9 жыл бұрын

brain behind the Google translate ;)

@57v60n85t 12 жыл бұрын

Although it is an enlightening talk, the graph at 52:00 is slightly misleading. If you set the lower limit of y-axis to zero, you will see what I mean.

@DominickGuzzo 6 жыл бұрын

Aren't you assuming that there's little value in the difference of ~48% vs ~53% accuracy? Depending upon the scenario, incremental improvement like that can be quite valuable. Imagine a similar chart for, say, website uptime: What would it look like with 99.9% plotted against 99.999999% when the y-axis starts at zero? Is that a useful visual?

@pleabargain 11 жыл бұрын

4:15 expert system approach to learning

@pleabargain 11 жыл бұрын

10:20 Text models

@pleabargain 11 жыл бұрын

11:22 Google's corpus of N grams... 13M+ unique words...

@pleabargain 11 жыл бұрын

6:50 first problem is scene completion

@pleabargain 11 жыл бұрын

4:44 Statistical machine learning Judea Pearl see his wikipedia page

@pleabargain 11 жыл бұрын

5:03 Image models and then text models. Image models first.