三分获胜：通过启用输入自适应推断，提高准确性，鲁棒性和效率

论文标题

三分获胜：通过启用输入自适应推断，提高准确性，鲁棒性和效率

Triple Wins: Boosting Accuracy, Robustness and Efficiency Together by Enabling Input-Adaptive Inference

论文作者

Hu, Ting-Kuei, Chen, Tianlong, Wang, Haotao, Wang, Zhangyang

论文摘要

最近，建议深层网络面对精度（在清洁的自然图像上）和鲁棒性（在对抗性扰动图像上）之间的几率（Tsipras等，2019）。这种困境被证明源于固有的样品复杂性（Schmidt等，2018）和/或模型容量（Nakkiran，2019），用于学习高临界性和鲁棒分类器。考虑到这一点，赋予分类任务，增强模型能力似乎有助于在准确性和鲁棒性之间取得双赢，但以模型大小和延迟为代价，因此对资源受限的应用程序构成了挑战。是否可以共同设计模型的准确性，鲁棒性和效率来实现三重胜利？本文研究了与输入自适应有效推断相关的多外观网络，表明了他们在达到模型的准确性，鲁棒性和效率方面实现“甜蜜点”方面的强烈希望。我们提出的解决方案被称为强大的动态推理网络（RDI-NETS），允许每个输入（干净或对抗性）自适应地选择多个输出层（早期分支或最终分支）之一，以输出其预测。多层次的适应性为对抗性攻击和防御增添了新的变化和灵活性，我们在其中提出了系统的研究。我们通过实验表明，通过将现有的骨架与如此强大的自适应推断装备，与卫冕的原始模型相比，由此产生的RDI-NET可以实现更好的准确性和鲁棒性，但具有超过30％的计算节省。

Deep networks were recently suggested to face the odds between accuracy (on clean natural images) and robustness (on adversarially perturbed images) (Tsipras et al., 2019). Such a dilemma is shown to be rooted in the inherently higher sample complexity (Schmidt et al., 2018) and/or model capacity (Nakkiran, 2019), for learning a high-accuracy and robust classifier. In view of that, give a classification task, growing the model capacity appears to help draw a win-win between accuracy and robustness, yet at the expense of model size and latency, therefore posing challenges for resource-constrained applications. Is it possible to co-design model accuracy, robustness and efficiency to achieve their triple wins? This paper studies multi-exit networks associated with input-adaptive efficient inference, showing their strong promise in achieving a "sweet point" in cooptimizing model accuracy, robustness and efficiency. Our proposed solution, dubbed Robust Dynamic Inference Networks (RDI-Nets), allows for each input (either clean or adversarial) to adaptively choose one of the multiple output layers (early branches or the final one) to output its prediction. That multi-loss adaptivity adds new variations and flexibility to adversarial attacks and defenses, on which we present a systematical investigation. We show experimentally that by equipping existing backbones with such robust adaptive inference, the resulting RDI-Nets can achieve better accuracy and robustness, yet with over 30% computational savings, compared to the defended original models.

下载PDF全文

下载文献需遵守相关版权规定

论文标题