论文标题

双精度深神经网络

Dual Precision Deep Neural Network

论文作者

Park, Jae Hyun, Choi, Ji Sub, Ko, Jong Hwan

论文摘要

深神经网络(DNN)的在线精确可伸缩性是支持DNN推断期间准确性和复杂性权衡的关键特征。在本文中,我们提出了双精度DNN,其中包含单个模型中的两种不同的精度模式,从而支持在线精确开关而无需重新训练。提出的两阶段训练过程优化了低精度和高精度模式。

On-line Precision scalability of the deep neural networks(DNNs) is a critical feature to support accuracy and complexity trade-off during the DNN inference. In this paper, we propose dual-precision DNN that includes two different precision modes in a single model, thereby supporting an on-line precision switch without re-training. The proposed two-phase training process optimizes both low- and high-precision modes.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源