持续小样本行人再识别:CFReID

CFReID: Continual Few-shot Person Re-Identification

摘要 Abstract

实际监控系统处于动态演化过程中,要求行人再识别模型能够持续处理来自不同域的新数据。为应对这些动态变化,提出了终身行人再识别(LReID),通过增量方式学习并积累多域知识。然而,LReID 模型需要针对每个未见过的域在大规模标注数据上进行训练,但由于隐私和成本问题,这些数据通常是不可访问的。本文提出了一种新的范式,称为持续小样本行人再识别(CFReID),要求模型使用少量样本数据进行增量训练,并在所有已见域上进行测试。在小样本条件下,CFReID 面临两个核心挑战:1)从未见域的小样本数据中学习知识;2)避免对已见域的知识遗忘。为解决这两个挑战,我们提出了基于特征分布视角的稳定分布对齐(SDA)框架。具体而言,我们的 SDA 包含两个模块,即元分布对齐(MDA)和基于原型的小样本适应(PFA)。为了支持 CFReID 的研究,我们在五个公开可用的行人再识别数据集上建立了评估基准。大量实验表明,我们的 SDA 在小样本条件下增强了小样本学习和抗遗忘能力。值得注意的是,我们的方法仅使用 5% 的数据(即 32 个身份 ID),显著优于需要 700 到 1,000 个身份 ID 的 LReID 最新方法。

Real-world surveillance systems are dynamically evolving, requiring a person Re-identification model to continuously handle newly incoming data from various domains. To cope with these dynamics, Lifelong ReID (LReID) has been proposed to learn and accumulate knowledge across multiple domains incrementally. However, LReID models need to be trained on large-scale labeled data for each unseen domain, which are typically inaccessible due to privacy and cost concerns. In this paper, we propose a new paradigm called Continual Few-shot ReID (CFReID), which requires models to be incrementally trained using few-shot data and tested on all seen domains. Under few-shot conditions, CFREID faces two core challenges: 1) learning knowledge from few-shot data of unseen domain, and 2) avoiding catastrophic forgetting of seen domains. To tackle these two challenges, we propose a Stable Distribution Alignment (SDA) framework from feature distribution perspective. Specifically, our SDA is composed of two modules, i.e., Meta Distribution Alignment (MDA) and Prototype-based Few-shot Adaptation (PFA). To support the study of CFReID, we establish an evaluation benchmark for CFReID on five publicly available ReID datasets. Extensive experiments demonstrate that our SDA can enhance the few-shot learning and anti-forgetting capabilities under few-shot conditions. Notably, our approach, using only 5\% of the data, i.e., 32 IDs, significantly outperforms LReID's state-of-the-art performance, which requires 700 to 1,000 IDs.