切断虚假关联的数据修剪方法

Severing Spurious Correlations with Data Pruning

摘要 Abstract

深度神经网络已被证明会学习并依赖于训练数据中存在的虚假关联。对这些虚假关联的依赖可能导致这些网络在实际应用中失效,因为在实际环境中这些关联可能不再成立。为了克服对虚假关联的学习和依赖,近期研究提出了几种有效的方法。然而,这些方法主要研究了虚假信号强度远大于核心不变信号强度的场景,这使得更容易检测单个训练样本中的虚假特征并进行进一步处理。本文识别出新的场景,其中虚假信号的强度相对较弱,难以检测到任何虚假信息但仍会导致灾难性后果。我们还发现,虚假关联主要是由于训练数据中少量包含虚假特征的样本导致的,并提出了一种新颖的数据修剪技术,该技术能够识别并修剪包含这些样本的小型训练数据子集。我们的方法无需推断领域知识、关于样本级虚假信息的存在或性质的信息,也不需要人工干预。最后,我们展示了这种数据修剪技术在先前研究的场景中取得了最先进的性能,这些场景中的虚假信息是可以识别的。

Deep neural networks have been shown to learn and rely on spurious correlations present in the data that they are trained on. Reliance on such correlations can cause these networks to malfunction when deployed in the real world, where these correlations may no longer hold. To overcome the learning of and reliance on such correlations, recent studies propose approaches that yield promising results. These works, however, study settings where the strength of the spurious signal is significantly greater than that of the core, invariant signal, making it easier to detect the presence of spurious features in individual training samples and allow for further processing. In this paper, we identify new settings where the strength of the spurious signal is relatively weaker, making it difficult to detect any spurious information while continuing to have catastrophic consequences. We also discover that spurious correlations are learned primarily due to only a handful of all the samples containing the spurious feature and develop a novel data pruning technique that identifies and prunes small subsets of the training data that contain these samples. Our proposed technique does not require inferred domain knowledge, information regarding the sample-wise presence or nature of spurious information, or human intervention. Finally, we show that such data pruning attains state-of-the-art performance on previously studied settings where spurious information is identifiable.

切断虚假关联的数据修剪方法 - arXiv