论文标题
通过结构化对抗扰动进行数据增强
Data Augmentation via Structured Adversarial Perturbations
论文作者
论文摘要
数据增强是许多机器学习方法具有最先进性能的主要组成部分。共同的增强策略通过从转换空间中绘制随机样本来起作用。不幸的是,这种采样方法的表现性有限,因为它们无法扩展到由于维度的诅咒而取决于许多参数的丰富变换。对抗性示例可以视为数据增强的替代方案。通过对输入最困难的修改进行培训,因此,结果模型有望可以处理其他更容易的修改。对抗性增强的优点是,它用使用单个,计算出的扰动取代了采样,从而最大程度地增加了损失。然而,不利的是,这些原始的对抗性扰动似乎相当非结构化。与理想的数据增强技术相反,应用它们通常不会产生自然转化。为了解决这个问题,我们提出了一种生成维持某些自然结构的对抗示例的方法。我们首先构建一个仅包含所需结构的扰动的子空间。然后,我们将原始对抗梯度投射到该空间上,以选择结构化转换,该变换将在应用时最大程度地增加损失。我们通过两种类型的图像转换来证明这种方法:光度法和几何形状。此外,我们表明对这种结构化对抗图像进行训练可改善概括。
Data augmentation is a major component of many machine learning methods with state-of-the-art performance. Common augmentation strategies work by drawing random samples from a space of transformations. Unfortunately, such sampling approaches are limited in expressivity, as they are unable to scale to rich transformations that depend on numerous parameters due to the curse of dimensionality. Adversarial examples can be considered as an alternative scheme for data augmentation. By being trained on the most difficult modifications of the inputs, the resulting models are then hopefully able to handle other, presumably easier, modifications as well. The advantage of adversarial augmentation is that it replaces sampling with the use of a single, calculated perturbation that maximally increases the loss. The downside, however, is that these raw adversarial perturbations appear rather unstructured; applying them often does not produce a natural transformation, contrary to a desirable data augmentation technique. To address this, we propose a method to generate adversarial examples that maintain some desired natural structure. We first construct a subspace that only contains perturbations with the desired structure. We then project the raw adversarial gradient onto this space to select a structured transformation that would maximally increase the loss when applied. We demonstrate this approach through two types of image transformations: photometric and geometric. Furthermore, we show that training on such structured adversarial images improves generalization.