项目作者: vineeths96

项目描述 :
In this repository, we explore model compression for transformer architectures via quantization. We specifically explore quantization aware training of the linear layers and demonstrate the performance for 8 bits, 4 bits, 2 bits and 1 bit (binary) quantization.
高级语言: Python
项目地址: git://github.com/vineeths96/Compressed-Transformers.git
创建时间: 2020-11-07T12:28:13Z
项目社区:https://github.com/vineeths96/Compressed-Transformers

开源协议:MIT License

下载