Quantization method is different from paper?

Hello, thanks for your excellent work, and this project is beneficial to me.

However, I found that the quantization method in this project is somewhat different from that mentioned in the [paper](http://openaccess.thecvf.com/content_cvpr_2018/papers/Zhuang_Towards_Effective_Low-Bitwidth_CVPR_2018_paper.pdf).

In the paper, the quantization function is:<a href="https://www.codecogs.com/eqnedit.php?latex=\inline&space;z_q&space;=&space;Q(z_r)&space;=&space;\frac{1}{2^k&space;-&space;1}round((2^k&space;-&space;1)z_r)" target="_blank"><img src="https://latex.codecogs.com/svg.latex?\inline&space;z_q&space;=&space;Q(z_r)&space;=&space;\frac{1}{2^k&space;-&space;1}round((2^k&space;-&space;1)z_r)" title="z_q = Q(z_r) = \frac{1}{2^k - 1}round((2^k - 1)z_r)" /></a>

Weights are quantized using:
<a href="https://www.codecogs.com/eqnedit.php?latex=\inline&space;w_q&space;=&space;Q(\frac{\tanh(w)}{2\max(|\tanh(w)|)}&space;&plus;&space;\frac{1}{2})" target="_blank"><img src="https://latex.codecogs.com/svg.latex?\inline&space;w_q&space;=&space;Q(\frac{\tanh(w)}{2\max(|\tanh(w)|)}&space;&plus;&space;\frac{1}{2})" title="w_q = Q(\frac{\tanh(w)}{2\max(|\tanh(w)|)} + \frac{1}{2})" /></a>
which I think may be a mistake in the paper.

Activations are quantized using:
<a href="https://www.codecogs.com/eqnedit.php?latex=\inline&space;x_q&space;=&space;Q(clip(x,&space;0,&space;1))" target="_blank"><img src="https://latex.codecogs.com/svg.latex?\inline&space;x_q&space;=&space;Q(clip(x,&space;0,&space;1))" title="x_q = Q(clip(x, 0, 1))" /></a>

In this project, weights are quantized using:
<a href="https://www.codecogs.com/eqnedit.php?latex=\inline&space;w_q&space;=&space;2Q(\frac{\tanh(w)}{2\max(|\tanh(w)|)}&space;&plus;&space;\frac{1}{2})-1" target="_blank"><img src="https://latex.codecogs.com/svg.latex?\inline&space;w_q&space;=&space;2Q(\frac{\tanh(w)}{2\max(|\tanh(w)|)}&space;&plus;&space;\frac{1}{2})-1" title="w_q = 2Q(\frac{\tanh(w)}{2\max(|\tanh(w)|)} + \frac{1}{2})-1" /></a>

Activations are quantized using: <a href="https://www.codecogs.com/eqnedit.php?latex=\inline&space;x_q&space;=&space;2Q(clip(x,&space;0,&space;1))-1" target="_blank"><img src="https://latex.codecogs.com/svg.latex?\inline&space;x_q&space;=&space;2Q(clip(x,&space;0,&space;1))-1" title="x_q = 2Q(clip(x, 0, 1))-1" /></a> which I think may be a mistake.

In fact, according to  [quantize_module_.py](https://github.com/nowgood/QuantizeCNNModel/blob/master/quantize/quantize_module_.py), weights and activations are quantized using `gemm` method(an asymmetric uniform quantization method)

Is there any comparison between the two methods?

Best wishes.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Quantization method is different from paper? #1

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Quantization method is different from paper? #1

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions