论文标题

三分获胜:通过启用输入自适应推断,提高准确性,鲁棒性和效率

Triple Wins: Boosting Accuracy, Robustness and Efficiency Together by Enabling Input-Adaptive Inference

论文作者

Hu, Ting-Kuei, Chen, Tianlong, Wang, Haotao, Wang, Zhangyang

论文摘要

最近,建议深层网络面对精度(在清洁的自然图像上)和鲁棒性(在对抗性扰动图像上)之间的几率(Tsipras等,2019)。这种困境被证明源于固有的样品复杂性(Schmidt等,2018)和/或模型容量(Nakkiran,2019),用于学习高临界性和鲁棒分类器。考虑到这一点,赋予分类任务,增强模型能力似乎有助于在准确性和鲁棒性之间取得双赢,但以模型大小和延迟为代价,因此对资源受限的应用程序构成了挑战。是否可以共同设计模型的准确性,鲁棒性和效率来实现三重胜利?本文研究了与输入自适应有效推断相关的多外观网络,表明了他们在达到模型的准确性,鲁棒性和效率方面实现“甜蜜点”方面的强烈希望。我们提出的解决方案被称为强大的动态推理网络(RDI-NETS),允许每个输入(干净或对抗性)自适应地选择多个输出层(早期分支或最终分支)之一,以输出其预测。多层次的适应性为对抗性攻击和防御增添了新的变化和灵活性,我们在其中提出了系统的研究。我们通过实验表明,通过将现有的骨架与如此强大的自适应推断装备,与卫冕的原始模型相比,由此产生的RDI-NET可以实现更好的准确性和鲁棒性,但具有超过30%的计算节省。

Deep networks were recently suggested to face the odds between accuracy (on clean natural images) and robustness (on adversarially perturbed images) (Tsipras et al., 2019). Such a dilemma is shown to be rooted in the inherently higher sample complexity (Schmidt et al., 2018) and/or model capacity (Nakkiran, 2019), for learning a high-accuracy and robust classifier. In view of that, give a classification task, growing the model capacity appears to help draw a win-win between accuracy and robustness, yet at the expense of model size and latency, therefore posing challenges for resource-constrained applications. Is it possible to co-design model accuracy, robustness and efficiency to achieve their triple wins? This paper studies multi-exit networks associated with input-adaptive efficient inference, showing their strong promise in achieving a "sweet point" in cooptimizing model accuracy, robustness and efficiency. Our proposed solution, dubbed Robust Dynamic Inference Networks (RDI-Nets), allows for each input (either clean or adversarial) to adaptively choose one of the multiple output layers (early branches or the final one) to output its prediction. That multi-loss adaptivity adds new variations and flexibility to adversarial attacks and defenses, on which we present a systematical investigation. We show experimentally that by equipping existing backbones with such robust adaptive inference, the resulting RDI-Nets can achieve better accuracy and robustness, yet with over 30% computational savings, compared to the defended original models.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源