上海交通大学学报

• •    

面向跨区域场景的无监督域自适应行人重识别(网络首发)

  

  1. 昆明理工大学信息工程与自动化学院云南省人工智能重点实验室
  • 基金资助:
    国家自然科学基金资助项目(62276120); 云南省基础研究专项(202301AV070004)

Unsupervised Domain Adaptation for Cross-Regional Scenes Person Re-identification

  1. (Faculty of Information Engineering and Automation;Key Laboratory of Artificial Intelligence in Yunnan Province, Kunming University of Science and Technology, Kunming 650500, China)

摘要: 在大规模监控系统中,由于跨区域场景间的距离较远,从不同区域的相机中获取行人正样本变得极为困难,这限制了行人重识别模型在跨区域场景中的有效应用。为解决跨区域场景中跨相机缺乏正样本的问题,提出一种多粒度特征挖掘和域不变特征学习的无监督域自适应行人重识别方法。该方法主要包含多粒度特征学习模块和域分布对齐模块。在多粒度特征学习模块中,通过全局特征学习提取行人的全局判别性特征。为进一步提升所提取行人特征的判别性,提出了局部一致性特征学习模块来加强行人局部特征之间的交互。通过全局和局部特征的学习,促进网络提取行人多粒度的判别性特征来提升行人重识别模型的性能。此外,设计了域分布对齐模块,通过风格迁移为目标域数据样本构建跨相机不同风格的正样本,解决了跨区域场景中跨相机缺乏正样本的问题,同时提升了模型的域自适应能力。在Market-1501、DukeMTMC、CUHK03和MSMT17数据集上的实验表明,所提方法相较于当前先进的域自适应行人重识别方法具有明显优势。

关键词: 行人重识别, 域自适应, 多粒度特征挖掘, 域分布对齐

Abstract: In large-scale surveillance systems, the absence of positive cross-camera pedestrian samples in cross-regional scenes poses a limitation on the performance of person re-identification models. To tackle this challenge, an unsupervised domain adaptive person re-identification method incorporating multi-granularity feature mining and domain-invariant feature learning is proposed. The method comprises a multi-granularity feature learning module and a domain distribution alignment module. Within the multi-granularity feature learning module, global discriminant features of pedestrians are extracted through global features learning. To further enhance the discriminative features of pedestrians, a local consistency feature learning module is proposed to strengthen interactions among local features. Through the learning of both global and local features, the network is encouraged to extract multi-granularity discriminative features, thereby elevating the performance of the person re-identification model. Additionally, this study incorporates a domain distribution alignment module, conducting style transfer to construct positive samples with diverse styles across cameras for target domain. This not only addresses the issue of the lack of positive samples across cameras in cross-regional scenes but also enhances the domain adaptation capabilities of the model. Extensive experiments conducted on the Market-1501, DukeMTMC, CUHK03 and MSMT17 datasets demonstrate the effectiveness of the proposed method compared to state-of-the-art person re-identification methods.

Key words: person re-identification, domain adaptation, multi-granularity feature mining, domain distribution alignment

中图分类号: