J Shanghai Jiaotong Univ Sci ›› 2026, Vol. 31 ›› Issue (2): 305-318.doi: 10.1007/s12204-024-2700-9
收稿日期:2023-08-17
接受日期:2023-09-07
出版日期:2026-04-01
发布日期:2024-01-16
陆聚首1,陈浩1,柏玉川2,胡川3,张希3
Received:2023-08-17
Accepted:2023-09-07
Online:2026-04-01
Published:2024-01-16
摘要: 针对过街场景下智能车与行人冲突多发的情况,提出了一套针对行人检测、跟踪和意图识别的集成方法。首先提出基于C2f_CA模块改进YOLOv8模型完成对行人的准确检测、跟踪和姿态估计;然后提出多种意图识别特征在空间和时域关系下表征行人的位置与姿态;最后以特征数据为输入,基于以SVM、 KNN和随机森林三者为基模型的Stacking异质集成方法完成行人的意图识别建模。对上述模型进行实验验证,结果表明,改进后的YOLOv8模型相较于原模型检测精度提高了5.4%,基于Stacking异质集成模型的行为意图识别在JAAD数据集上可以达到94.0%的准确率,相比于现有的意图识别模型提升了3.4%以上;在行人不同部位被遮挡的情况下,模型的准确率依旧达到65.8%~73.3%,验证了该方法的鲁棒性。该方法为自动驾驶汽车的决策规划提供了可靠的输入,有利于提高自动驾驶的安全性。
中图分类号:
. 基于骨架特征的行人过街意图识别[J]. J Shanghai Jiaotong Univ Sci, 2026, 31(2): 305-318.
Lu Jushou, Chen Hao, Bai Yuchuan, Hu Chuan, Zhang Xi. Recognition of Pedestrians’ Street-Crossing Intentions Based on Skeleton Features[J]. J Shanghai Jiaotong Univ Sci, 2026, 31(2): 305-318.
| [1] HU Y C, LI M K. Challenges and responses of self-driving vehicles to road traffic safety law [J]. Journal of Shanghai Jiao Tong University (Philosophy and Social Sciences), 2019, 27(1): 44-53 (in Chinese). [2] ZHOU M C. Criminal liability of traffic accident caused by self-driving vehicles [J]. Journal of Shanghai Jiao Tong University (Philosophy and Social Sciences), 2019, 27(1): 36-43 (in Chinese). [3] YANG B, FAN F C, YANG J C, et al. Recognition of pedestrians’ street-crossing intentions based on action prediction and environment context [J]. Automotive Engineering, 2021, 43(7): 1066-1076 (in Chinese). [4] WANG R P, CUI Y, SONG X A, et al. Multi-information-based convolutional neural network with attention mechanism for pedestrian trajectory prediction [J]. Image and Vision Computing, 2021, 107: 104110. [5] ABUGHALIEH K M, ALAWNEH S G. Predicting pedestrian intention to cross the road [J]. IEEE Access, 2020, 8: 72558-72569. [6] FANG H S, LI J F, TANG H Y, et al. AlphaPose: Whole-body regional multi-person pose estimation and tracking in real-time [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023, 45(6): 7157-7173. [7] OSOKIN D. Real-time 2D multi-person pose estimation on CPU: Lightweight OpenPose [DB/OL]. (2018-11-29). https://arxiv.org/abs/1811.12004 [8] GHORI O, MACKOWIAK R, BAUTISTA M, et al. Learning to forecast pedestrian intention from pose dynamics [C]//2018 IEEE Intelligent Vehicles Symposium. Changshu: IEEE, 2018: 1277-1284. [9] HU Y Z, JIANG T, LIU X, et al. Pedestrian-crossing intention-recognition based on dual-stream adaptive graph-convolutional neural-network [J]. Journal of Automotive Safety and Energy, 2022, 13(2): 325-332 (in Chinese). [10] LÜ C, CUI G G, MENG X H, et al. Graph representation method for pedestrian intention recognition of intelligent vehicle [J]. Transactions of Beijing Institute of Technology, 2022, 42(7): 688-695 (in Chinese). [11] ZHANG Y F, SUN P Z, JIANG Y, et al. ByteTrack: multi-object tracking by associating every detection box [M]// Computer vision – ECCV 2022. Cham: Springer, 2022: 1-21. [12] HOU Q B, ZHOU D Q, FENG J S. Coordinate attention for efficient mobile network design [C]//2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville: IEEE, 2021: 13708-13717. [13] HU J, SHEN L, SUN G. Squeeze-and-excitation networks [C]//2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Salt Lake City: IEEE, 2018: 7132-7141. [14] WOO S, PARK J, LEE J Y, et al. CBAM: convolutional block attention module [M]// Computer vision – ECCV 2018. Cham: Springer, 2018: 3-19. [15] LI C X, LU S B, ZHANG B H, et al. Human-vehicle steering collision avoidance path planning based on pedestrian location prediction [J]. Automotive Engineering, 2021, 43(6): 877-884 (in Chinese). [16] NAVEED H, KHAN G, KHAN A U, et al. Human activity recognition using mixture of heterogeneous features and sequential minimal optimization [J]. International Journal of Machine Learning and Cybernetics, 2019, 10(9): 2329-2340. [17] WANG J, LIU Z C, WU Y, et al. Mining actionlet ensemble for action recognition with depth cameras [C]//2012 IEEE Conference on Computer Vision and Pattern Recognition. Providence: IEEE, 2012: 1290-1297. [18] SUN J H, GE H Y, ZHANG Z H. AS-YOLO: An improved YOLOv4 based on attention mechanism and SqueezeNet for person detection [C]//2021 IEEE 5th Advanced Information Technology, Electronic and Automation Control Conference. Chongqing: IEEE, 2021: 1451-1456. [19] LI D P, REN X M, YAN N N. Real-time detection of insulator drop string based on UAV aerial photography [J]. Journal of Shanghai Jiao Tong University, 2022, 56(8): 994-1003 (in Chinese). [20] GIRSHICK R. Fast R-CNN [C]//2015 IEEE International Conference on Computer Vision. Santiago: IEEE, 2015: 1440-1448. [21] LIU W, ANGUELOV D, ERHAN D, et al. SSD: single shot MultiBox detector [M]// Computer vision – ECCV 2016. Cham: Springer, 2016: 21-37. [22] XING Z W, KAN B, LIU Z S, et al. Airport pavement snow and ice state perception based on improved YOLOX-S [J]. Journal of Shanghai Jiao Tong University, 2023, 57(10): 1292-1304 (in Chinese). [23] ZHANG S L, ABDEL-ATY M, WU Y N, et al. Pedestrian crossing intention prediction at red-light using pose estimation [J]. IEEE Transactions on Intelligent Transportation Systems, 2022, 23(3): 2331-2339. [24] YAN S J, XIONG Y J, LIN D H. Spatial temporal graph convolutional networks for skeleton-based action recognition [J]. Proceedings of the AAAI Conference on Artificial Intelligence, 2018, 32(1): 7444-7452. |
| [1] | . YOLO-VSF:一个引入注意机制的改进YOLO模型用于交通场景目标检测[J]. J Shanghai Jiaotong Univ Sci, 2026, 31(2): 334-347. |
| [2] | . 基于改进YOLOv5l的交通信号灯识别[J]. J Shanghai Jiaotong Univ Sci, 2026, 31(2): 319-333. |
| [3] | 夏筱彦, 张宇, 胡锡坤, 钟平. 基于扩散模型的无人机遥感目标检测物理对抗攻击方法研究[J]. 空天防御, 2026, 9(1): 52-62. |
| [4] | . 机场飞行区域空域入侵物视觉检测[J]. J Shanghai Jiaotong Univ Sci, 2026, 31(1): 176-186. |
| [5] | . 图卷积网络融合脑电与肌电用于上肢运动意图识别[J]. J Shanghai Jiaotong Univ Sci, 2026, 31(1): 12-23. |
| [6] | . MAGPNet: 基于多域注意力引导的红外弱小目标检测网络[J]. J Shanghai Jiaotong Univ Sci, 2025, 30(5): 935-951. |
| [7] | . 用于内窥镜图像息肉检测的实时轻量级卷积神经网络[J]. J Shanghai Jiaotong Univ Sci, 2025, 30(3): 521-534. |
| [8] | 薛昂, 姜恩宇, 张文涛, 林顺富, 米阳. 基于窗口自注意力网络与YOLOv5融合的输电线路通道异物检测[J]. 上海交通大学学报, 2025, 59(3): 413-423. |
| [9] | 詹泽辉, 钟铭恩, 袁彬淦, 谭佳威, 范康. 随机平视摄像条件下的路边车辆违停检测[J]. 上海交通大学学报, 2025, 59(10): 1568-1580. |
| [10] | 王一力, 李强, 沈俊逸, 杨翊东, 王琦. 多模态舰船图像生成及其关键部位检测[J]. 空天防御, 2025, 8(1): 77-85. |
| [11] | 李楚晨, 唐善军, 赵冰青. 一种基于无人机探测图像区块信息的弱小目标检测算法[J]. 空天防御, 2025, 8(1): 41-47. |
| [12] | 刘婧, 郭晓雷, 张欣海, 毛靖军, 吕瑞恒. 空面导弹轻量化空中斜框目标检测算法[J]. 空天防御, 2024, 7(4): 106-113. |
| [13] | 张彦军1,4,5,6,7, 王碧云2,3 , 蔡云泽1,4,5,6,7. 基于注意力的多通道网络红外弱小目标检测[J]. J Shanghai Jiaotong Univ Sci, 2024, 29(3): 414-427. |
| [14] | 牛国臣, 孙翔宇, 苑峥岩. 基于双流特征提取的车路协同感知方法[J]. 上海交通大学学报, 2024, 58(11): 1826-1834. |
| [15] | 章涛, 张雪瑞, 陈勇, 钟科林, 罗其俊. 面向民机可视导航的场面多尺度目标检测[J]. 上海交通大学学报, 2024, 58(11): 1816-1825. |
| 阅读次数 | ||||||
|
全文 |
|
|||||
|
摘要 |
|
|||||