model_int8_convert

2 years ago

source link: https://davidchan0519.github.io/2019/07/25/model-int8-convert/
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.

model_int8_convert

Posted on

2019-07-25

Edited on 2019-07-31

| Views: 24

遵循的几点准则：

对于weights的int8量化使用的是不饱和的方式；（ -|max| 和 |max| FP32 value 映射为 -127 和 127 ，中间值按照线性关系进行映射。）

对输入数据的int8量化使用的是饱和的量化方式。（即确定阈值 |T| ，将 ±|T| 映射为±127，这里 |T|<|max|。）

每一层的tensor 的 |T| 值都是不一样的。

确定每一层的 |T|值的过程称为校准（Calibration ）

您的鼓励是我持之以恒的动力

0 comments

Markdown is supported

Be the first person to leave a comment!

Recommend

model_int8_convert

model_int8_convert

Recommend

Go类型转换

linux命令-realpath

Go之slice陷阱

caffe 权重介绍

Data Theorem launches attack surface management product that identifies 3P asset...

《如何有效阅读一本书》读书笔记

App performance monitoring platform Sentry nabs $90M

我们拉取了小红书美食类目top300笔记，总结了如下几点

【科学家养成日记#12】挽救被黑钱包里面的剩余资产

Here are 3 vital insights installers shared about the state of solar in 2021

About Joyk