J Shanghai Jiaotong Univ Sci ›› 2026, Vol. 31 ›› Issue (2): 305-318.doi: 10.1007/s12204-024-2700-9

• • 上一篇    下一篇

基于骨架特征的行人过街意图识别

  

  1. 1. 上海理工大学 机械工程学院,上海 200093;2. 山东交通技师学院,山东临沂 276002;3. 上海交通大学 机械与动力工程学院,上海 200030
  • 收稿日期:2023-08-17 接受日期:2023-09-07 出版日期:2026-04-01 发布日期:2024-01-16

Recognition of Pedestrians’ Street-Crossing Intentions Based on Skeleton Features

陆聚首1,陈浩1,柏玉川2,胡川3,张希3   

  1. 1. School of Mechanical Engineering, University of Shanghai for Science and Technology, Shanghai 200093, China; 2. Shandong Technician College of Transportation, Linyi 276002, Shandong, China; 3. School of Mechanical Engineering, Shanghai Jiao Tong University, Shanghai 200030, China
  • Received:2023-08-17 Accepted:2023-09-07 Online:2026-04-01 Published:2024-01-16

摘要: 针对过街场景下智能车与行人冲突多发的情况,提出了一套针对行人检测、跟踪和意图识别的集成方法。首先提出基于C2f_CA模块改进YOLOv8模型完成对行人的准确检测、跟踪和姿态估计;然后提出多种意图识别特征在空间和时域关系下表征行人的位置与姿态;最后以特征数据为输入,基于以SVM、 KNN和随机森林三者为基模型的Stacking异质集成方法完成行人的意图识别建模。对上述模型进行实验验证,结果表明,改进后的YOLOv8模型相较于原模型检测精度提高了5.4%,基于Stacking异质集成模型的行为意图识别在JAAD数据集上可以达到94.0%的准确率,相比于现有的意图识别模型提升了3.4%以上;在行人不同部位被遮挡的情况下,模型的准确率依旧达到65.8%~73.3%,验证了该方法的鲁棒性。该方法为自动驾驶汽车的决策规划提供了可靠的输入,有利于提高自动驾驶的安全性。

关键词: 目标检测, 意图识别, Stacking模型, 骨架特征

Abstract: An integrated method is proposed to solve the problem of frequent conflicts between autonomous vehicles and pedestrians in the street crossing scene. The method involves pedestrian detection, tracking, and intention recognition. First, an enhanced YOLOv8 is introduced by combining the C2f CA module to achieve accurate pedestrian detection, tracking and pose estimation. Second, a variety of intention recognition features are proposed to characterize the position and pose of pedestrians in spatial and time domains. Finally, by taking the feature data as input for the base learners, the intention classification model is proposed based on the Stacking model with SVM, KNN, and random forest as the base learners and XGBoost as the meta learner. The experimental results show that the enhanced YOLOv8 improves the detection accuracy by 5.4% compared with the original model, and the intention recognition based on the Stacking model can achieve 94.0% accuracy on the JAAD dataset, which is improved by more than 3.4% compared with the existing intention recognition models. Furthermore, when different parts of a pedestrian are occluded, the accuracy of the Stacking model still reaches 65.8%—73.3%, which verifies the robustness of the proposed model. The proposed model provides reliable inputs for decision planning of autonomous vehicles, which is conducive to improving the safety of self-driving.

中图分类号: