Рет қаралды 2,365
This tutorial explains the basics behind different quantization approaches explaining the math and the intuitions. Explains how the mapping is done from float32 precision to int8 precision.
Link to the Slides : drive.google.com/file/d/10kzMLVAVntXAr-lbOXc4RmGmX9iqjjIZ/view?usp=sharing
----------------------------------------------------------------------------------------------------------------
Reference materials for further reading.
A White Paper on Neural Network Quantization - arxiv.org/abs/2106.08295
Introduction to Quantization on PyTorch - pytorch.org/blog/introduction-to-quantization-on-pytorch/
Nvidia docs on Quantisation Basics - docs.nvidia.com/deeplearning/tensorrt/tensorflow-quantization-toolkit/docs/docs/intro_to_quantization.html
----------------------------------------------------------------------------------------------------------------
BGM Credits
🔻
Song: "Sappheiros - Falling (Ft. eSoreni) [Chill]" is under a Creative Commons license (CC-BY)
Music promoted by BreakingCopyright: bit.ly/Sappheiros-Falling
🔺