上海交通大学学报 ›› 2025, Vol. 59 ›› Issue (12): 1878-1890.doi: 10.16183/j.cnki.jsjtu.2023.635

• 电子信息与电气工程 • 上一篇    下一篇

面向跨区域场景的无监督域自适应行人重识别

毛彦嵋1,2, 李华锋1,2(), 张亚飞1,2   

  1. 1 昆明理工大学 信息工程与自动化学院
    2 云南省人工智能重点实验室, 昆明 650500
  • 收稿日期:2023-12-19 修回日期:2024-01-31 接受日期:2024-03-25 出版日期:2025-12-28 发布日期:2025-12-30
  • 通讯作者: 李华锋 E-mail:hfchina99@163.com
  • 作者简介:毛彦嵋(1998—),硕士生,从事计算机视觉研究.
  • 基金资助:
    国家自然科学基金资助项目(62276120);云南省基础研究专项(202301AV070004)

Unsupervised Domain Adaptation for Cross-Regional Scenes Person Re-Identification

MAO Yanmei1,2, LI Huafeng1,2(), ZHANG Yafei1,2   

  1. 1 Faculty of Information Engineering and Automation
    2 Key Laboratory of Artificial Intelligence in Yunnan Province, Kunming University of Science and Technology, Kunming 650500, China
  • Received:2023-12-19 Revised:2024-01-31 Accepted:2024-03-25 Online:2025-12-28 Published:2025-12-30
  • Contact: LI Huafeng E-mail:hfchina99@163.com

摘要:

在大规模监控系统中,由于跨区域场景间的距离较远,从不同区域的相机中获取行人正样本变得极为困难,这限制了行人重识别模型在跨区域场景中的有效应用.为解决跨区域场景中跨相机缺乏正样本的问题,提出一种多粒度特征挖掘和域不变特征学习的无监督域自适应行人重识别方法.该方法主要包含多粒度特征学习模块和域分布对齐模块.在多粒度特征学习模块中,通过全局特征学习提取行人的全局判别性特征.为进一步提升所提取行人特征的判别性,提出了局部一致性特征学习模块来加强行人局部特征之间的交互.通过全局和局部特征的学习,促进网络提取行人多粒度的判别性特征来提升行人重识别模型的性能.此外,设计了域分布对齐模块,通过风格迁移为目标域数据样本构建跨相机不同风格的正样本,解决了跨区域场景中跨相机缺乏正样本的问题,同时提升了模型的域自适应能力.在Market-1501、DukeMTMC、CUHK03和MSMT17数据集上的实验表明,所提方法相较于当前先进的域自适应行人重识别方法具有明显优势.

关键词: 行人重识别, 域自适应, 多粒度特征挖掘, 域分布对齐

Abstract:

In large-scale surveillance systems, the lack of positive cross-camera pedestrian samples in cross-regional scenes limits the performance of person re-identification (Re-ID) models. To tackle this challenge, an unsupervised domain adaptive person re-identification method incorporating multi-granularity feature mining and domain-invariant feature learning is proposed. The method consists of a multi-granularity feature learning module and a domain distribution alignment module. Within the multi-granularity feature learning module, global discriminant features of pedestrians are extracted through global features learning. To further enhance the discriminative capability of pedestrian features, a local consistency feature learning module is proposed to strengthen the interactions among local features. By jointly learning global and local features, the network is encouraged to extract multi-granularity discriminative features, thereby improving the performance of the person re-identification model. Additionally, a domain distribution alignment module is incorporated, leveraging style transfer to generate positive samples with diverse styles across cameras for target domain. This not only addresses the issue of the lack of positive samples across cameras in cross-regional scenes but also enhances the domain adaptation capabilities of the model. Extensive experiments conducted on the Market-1501, DukeMTMC, CUHK03, and MSMT17 datasets demonstrate the effectiveness of the proposed method compared to state-of-the-art person re-identification methods.

Key words: person re-identification (Re-ID), domain adaptation, multi-granularity feature mining, domain distribution alignment

中图分类号: