Q-BERT: Hessian Based Ultra Low Precision Quantization of BERT
Shen, Sheng, Dong, Zhen, Ye, Jiayu, Ma, Linjian, Yao, Zhewei, Gholami, Amir, Mahoney, Michael W., Keutzer, KurtVolume:
34
Journal:
Proceedings of the AAAI Conference on Artificial Intelligence
DOI:
10.1609/aaai.v34i05.6409
Date:
April, 2020
File:
PDF, 1.24 MB
2020