强调区分性特征用于复杂场景中的数据蒸馏
Emphasizing Discriminative Features for Dataset Distillation in Complex Scenarios
摘要 Abstract
数据蒸馏在CIFAR、MNIST和TinyImageNet等简单数据集上表现出色,但在更复杂的场景中难以取得类似的结果。本文提出了一种名为EDF(强调区分性特征)的数据蒸馏方法,该方法利用Grad-CAM激活图增强合成图像中的关键区分区域。我们的方法基于一个重要观察:在简单数据集中,高激活区域通常占据大部分图像,而在复杂场景中,这些区域的大小要小得多。与以往对所有像素一视同仁的方法不同,EDF利用Grad-CAM激活图增强高激活区域。从监督的角度来看,我们弱化了损失较小的监督信号,因为它们包含的是常见模式。此外,为帮助数据蒸馏(DD)社区更好地探索复杂场景,我们构建了Complex Dataset Distillation (Comp-DD)基准,通过精心挑选ImageNet-1K的十六个子集(八个简单子集和八个困难子集)形成。特别是,EDF在复杂场景(如ImageNet-1K子集)中始终优于当前最先进的结果。希望更多研究者能受到启发并致力于提高数据蒸馏的实际可行性和有效性。我们的代码和基准将在https://github.com/NUS-HPC-AI-Lab/EDF公开。
Dataset distillation has demonstrated strong performance on simple datasets like CIFAR, MNIST, and TinyImageNet but struggles to achieve similar results in more complex scenarios. In this paper, we propose EDF (emphasizes the discriminative features), a dataset distillation method that enhances key discriminative regions in synthetic images using Grad-CAM activation maps. Our approach is inspired by a key observation: in simple datasets, high-activation areas typically occupy most of the image, whereas in complex scenarios, the size of these areas is much smaller. Unlike previous methods that treat all pixels equally when synthesizing images, EDF uses Grad-CAM activation maps to enhance high-activation areas. From a supervision perspective, we downplay supervision signals that have lower losses, as they contain common patterns. Additionally, to help the DD community better explore complex scenarios, we build the Complex Dataset Distillation (Comp-DD) benchmark by meticulously selecting sixteen subsets, eight easy and eight hard, from ImageNet-1K. In particular, EDF consistently outperforms SOTA results in complex scenarios, such as ImageNet-1K subsets. Hopefully, more researchers will be inspired and encouraged to improve the practicality and efficacy of DD. Our code and benchmark will be made public at https://github.com/NUS-HPC-AI-Lab/EDF.