tinyML Talks: A Practical Guide to Neural Network Quantization

  Рет қаралды 23,554

The tinyML Foundation

The tinyML Foundation

2 жыл бұрын

"A Practical Guide to Neural Network Quantization"
Marios Fournarakis
Deep Learning Researcher
Qualcomm AI Research, Amsterdam
Neural network quantization is an effective way of reducing the power requirements and latency of neural network inference while maintaining high accuracy. The success of quantization has led to a large volume of literature and competing methods in recent years, and Qualcomm has been at the forefront of this research. This talk aims to cut through the noise and introduce a practical guide for quantizing neural networks inspired by our research and expertise at Qualcomm. We will begin with an introduction to quantization and fixed-point accelerators for neural network inference. We will then consider implementation pipelines for quantizing neural networks with near floating-point accuracy for popular neural networks and benchmarks. Finally, you will leave this talk with a set of diagnostic and debugging tools to address common neural network quantization issues.
You can find more information about the theory and algorithms we will discuss in this talk in our White Paper on Neural Network Quantization at the following arXiv link: arxiv.org/abs/2106.08295

Пікірлер
MIT Introduction to Deep Learning | 6.S191
1:09:58
Alexander Amini
Рет қаралды 324 М.
Quantization vs Pruning vs Distillation: Optimizing NNs for Inference
19:46
THEY WANTED TO TAKE ALL HIS GOODIES 🍫🥤🍟😂
00:17
OKUNJATA
Рет қаралды 13 МЛН
Khóa ly biệt
01:00
Đào Nguyễn Ánh - Hữu Hưng
Рет қаралды 21 МЛН
ROCK PAPER SCISSOR! (55 MLN SUBS!) feat @PANDAGIRLOFFICIAL #shorts
00:31
ОСКАР ИСПОРТИЛ ДЖОНИ ЖИЗНЬ 😢 @lenta_com
01:01
tinyML Talks: Constrained Object Detection on Microcontrollers with FOMO
47:22
The tinyML Foundation
Рет қаралды 10 М.
But what is a neural network? | Chapter 1, Deep learning
18:40
3Blue1Brown
Рет қаралды 16 МЛН
Model Quantization for Edge Devices with AIMET
42:25
The TWIML AI Podcast with Sam Charrington
Рет қаралды 1,3 М.
Quantization of Neural Networks [in Russian]
1:09:49
BayesGroup.ru
Рет қаралды 1,4 М.
ML Was Hard Until I Learned These 5 Secrets!
13:11
Boris Meinardus
Рет қаралды 224 М.
Quantization in Deep Learning (LLMs)
13:04
AI Bites
Рет қаралды 4,3 М.
Tom Goldstein: "What do neural loss surfaces look like?"
50:26
Institute for Pure & Applied Mathematics (IPAM)
Рет қаралды 17 М.
Что не так с яблоком Apple? #apple #macbook
0:38
Не шарю!
Рет қаралды 345 М.
Игровой Комп с Авито за 4500р
1:00
ЖЕЛЕЗНЫЙ КОРОЛЬ
Рет қаралды 1,7 МЛН