神经网络中的确切特征碰撞

论文标题

神经网络中的确切特征碰撞

Exact Feature Collisions in Neural Networks

论文作者

Ozbulak, Utku, Gasparyan, Manvel, Rao, Shodhan, De Neve, Wesley, Van Messem, Arnout

论文摘要

深度神经网络做出的预测被证明对在输入空间中所做的小变化高度敏感，在该空间中，这种恶意制作的包含小扰动的数据点被称为对抗性示例。另一方面，最近的研究表明，同一网络也可能对大小的变化极为不敏感，在两个很大程度上不同的数据点可以映射到大致相同的输出。在这种情况下，据说两个数据点的特征大致相撞，因此导致了很大相似的预测。我们的结果改善并扩展了Li等人（2019年）的工作，为具有神经网络权重的角度勾结特征的数据点阐明了理论基础，这表明神经网络不仅遭受了近似碰撞的特征，而且还遭受了恰好相撞的功能。我们确定存在此类情况的必要条件，从而研究了大量用于解决各种计算机视觉问题的DNN。此外，我们提出了不依赖启发式方法的数值方法的空空间搜索，以为任何输入和任何任务（包括但不限于分类，本地化和细分）创建数据点。

Predictions made by deep neural networks were shown to be highly sensitive to small changes made in the input space where such maliciously crafted data points containing small perturbations are being referred to as adversarial examples. On the other hand, recent research suggests that the same networks can also be extremely insensitive to changes of large magnitude, where predictions of two largely different data points can be mapped to approximately the same output. In such cases, features of two data points are said to approximately collide, thus leading to the largely similar predictions. Our results improve and extend the work of Li et al.(2019), laying out theoretical grounds for the data points that have colluding features from the perspective of weights of neural networks, revealing that neural networks not only suffer from features that approximately collide but also suffer from features that exactly collide. We identify the necessary conditions for the existence of such scenarios, hereby investigating a large number of DNNs that have been used to solve various computer vision problems. Furthermore, we propose the Null-space search, a numerical approach that does not rely on heuristics, to create data points with colliding features for any input and for any task, including, but not limited to, classification, localization, and segmentation.

下载PDF全文

下载文献需遵守相关版权规定

论文标题