J Shanghai Jiaotong Univ Sci ›› 2026, Vol. 31 ›› Issue (1): 154-166.doi: 10.1007/s12204-025-2851-3
• Intelligent Robots • Previous Articles Next Articles
夏洁1,吴晓东1,许敏2
Received:2024-12-16
Revised:2025-02-25
Accepted:2025-03-21
Online:2026-02-28
Published:2025-10-14
CLC Number:
Xia Jie, Wu Xiaodong, Xu Min. BEV-Fused Imitation and Reinforcement Learning for Autonomous Driving Planning[J]. J Shanghai Jiaotong Univ Sci, 2026, 31(1): 154-166.
| [1] CHEN L, WU P H, CHITTA K, et al. End-to-end autonomous driving: Challenges and frontiers [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024, 46(12): 10164-10183. [2] HU S C, CHEN L, WU P H, et al. ST-P3: End-to-end vision-based autonomous driving viaSpatial-temporal feature learning [M]//Computer Vision – ECCV 2022. Cham: Springer, 2022: 533-549. [3] HU Y H, YANG J Z, CHEN L, et al. Planning-oriented autonomous driving [C]//2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Vancouver: IEEE, 2023: 17853-17862. [4] YE T, JING W, HU C, et al. FusionAD: Multi-modality fusion for prediction and planning tasks of autonomous driving [DB/OL]. (2023-08-02). https://arxiv.org/abs/2308.01006 [5] MUHAMMAD K, ULLAH A, LLORET J, et al. Deep learning for safe autonomous driving: Current challenges and future directions [J]. IEEE Transactions on Intelligent Transportation Systems, 2021, 22(7): 4316-4336. [6] BASILE G, LECCESE S, PETRILLO A, et al. Sustainable DDPG-based path tracking for connected autonomous electric vehicles in extra-urban scenarios [J]. IEEE Transactions on Industry Applications, 2024, 60(6): 9237-9250. [7] REN Y G, DUAN J L, LI S E, et al. Improving generalization of reinforcement learning with minimax distributional soft actor-critic [C]//2020 IEEE 23rd International Conference on Intelligent Transportation Systems. Rhodes: IEEE, 2020: 1-6. [8] LI S Y, LI M Z, JING Z L. Multi-agent path planning method based on improved deep Q-network in dynamic environments [J]. Journal of Shanghai Jiao Tong University (Science), 2024, 29(4): 601-612. [9] WU J D, HUANG Z Y, HANG P, et al. Digital twin-enabled reinforcement learning for end-to-end autonomous driving [C]//2021 IEEE 1st International Conference on Digital Twins and Parallel Intelligence. Beijing: IEEE, 2021: 62-65. [10] HUANG Z Q, ZHANG J, TIAN R, et al. End-to-end autonomous driving decision based on deep reinforcement learning [C]//2019 5th International Conference on Control, Automation and Robotics. Beijing: IEEE, 2019: 658-662. [11] LI H Y, SIMA C, DAI J F, et al. Delving into the Devils of bird’s-eye-view perception: A review, evaluation and recipe [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024, 46(4): 2151-2170. [12] MA Y X, WANG T, BAI X Y, et al. Vision-centric BEV perception: A survey [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024, 46(12): 10978-10997. [13] LI Z Q, WANG W H, LI H Y, et al. BEVFormer: Learning bird’s-eye-view representation from multi-camera images via spatiotemporal transformers [M]//Computer Vision – ECCV 2022. Cham: Springer, 2022: 1-18. [14] LIU Z J, TANG H T, AMINI A, et al. BEVFusion: Multi-task multi-sensor fusion with unified bird’s-eye view representation [C]//2023 IEEE International Conference on Robotics and Automation. London: IEEE, 2023: 2774-2781. [15] RAMRAKHYA R, BATRA D, WIJMANS E, et al. PIRLNav: Pretraining with imitation and RL finetuning for OBJECTNAV [C]//2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Vancouver: IEEE, 2023: 17896-17906. [16] Y G D, NAIR N G, SATPATHY P, et al. Covariate shift: A review and analysis on classifiers [C]//2019 Global Conference for Advancement in Technology. Bangalore: IEEE, 2019: 1-6. [17] KURNIAWATI H. Partially observable Markov decision processes (POMDPs) and robotics [DB/OL]. (2021-07-15). https://arxiv.org/abs/2107.07599 [18] HUBMANN C, SCHULZ J, BECKER M, et al. Automated driving in uncertain environments: Planning with interaction and uncertain maneuver prediction [J]. IEEE Transactions on Intelligent Vehicles, 2018, 3(1): 5-17. [19] DOSOVITSKIY A, ROS G, CODEVILLA F, et al. CARLA: An open urban driving simulator [C]// 1st Annual Conference on Robot Learning. Mountain View: PMLR, 2017: 1-16. [20] LI Y. Deep reinforcement learning: An overview [DB/OL]. (2017-01-25). https://arxiv.org/abs/1701.07274 [21] PHILION J, FIDLER S. Lift, splat, shoot: Encoding images from arbitrary camera rigs by implicitly unprojecting to 3D [M]//Computer Vision – ECCV 2020. Cham: Springer, 2020: 194-210. [22] HUANG J, HUANG G, ZHU Z, et al. BEVDet: High-performance multi-camera 3D object detection in bird-eye-view [DB/OL]. (2021-12-22). https://arxiv.org/abs/2112.11790 [23] CHEN X Z, MA H M, WAN J, et al. Multi-view 3D object detection network for autonomous driving [C]//2017 IEEE Conference on Computer Vision and Pattern Recognition. Honolulu: IEEE, 2017: 6526-6534. [24] O'SHEA K, NASH R. An introduction to convolutional neural networks [DB/OL]. (2015-11-26). https://arxiv.org/abs/1511.08458 [25] SCHULMAN J, WOLSKI F, DHARIWAL P, et al. Proximal policy optimization algorithms [DB/OL]. (2017-07-20). https://arxiv.org/abs/1707.06347 [26] SCHULMAN J, MORITZ P, LEVINE S, et al. High-dimensional continuous control using generalized advantage estimation [DB/OL]. (2015-06-08). https://arxiv.org/abs/1506.02438 [27] ZARE M, KEBRIA P M, KHOSRAVI A, et al. A survey of imitation learning: Algorithms, recent developments, and challenges [J]. IEEE Transactions on Cybernetics, 2024, 54(12): 7173-7186. [28] ZHANG Z J, LINIGER A, DAI D X, et al. End-to-end urban driving by imitating a reinforcement learning coach [C]//2021 IEEE/CVF International Conference on Computer Vision. Montreal: IEEE, 2021: 15202-15212. |
| [1] | Wang Longsheng, Yuan Wei, Zhuang Hanyang, Wang Chunxiang, Yang Ming. Acceleration Optimization-Based Speed Planning Method for High-Precision Longitudinal Control of Wheeled Robots [J]. J Shanghai Jiaotong Univ Sci, 2026, 31(1): 48-58. |
| [2] | Ceng Yuxuan, Zhao Wentao, Chen Yongtao, Xiao Peng, Wang Jingchuan, Guo Rui. SDA-Loc: A Semantic-Driven Alignment Algorithm for Cross-Modal Localization in Point Cloud Maps [J]. J Shanghai Jiaotong Univ Sci, 2026, 31(1): 117-129. |
| [3] | Peng Chengyu, Chen Baifan, Li Siyu, Jin Yuxuan, Wan Jiadong, Fu Yuesi. Hybrid Topological Map Fusion Based on Memory Sphere [J]. J Shanghai Jiaotong Univ Sci, 2026, 31(1): 130-142. |
| [4] | ZHAO Yanfei1,2,3(赵艳飞), XIAO Peng4 (肖鹏), WANG Jingchuan1,2,3* (王景川), GUO Rui4*(郭锐). Semi-Autonomous Navigation Based on Local Semantic Map for Mobile Robot [J]. J Shanghai Jiaotong Univ Sci, 2025, 30(1): 27-33. |
| [5] | LI Shuyi (李舒逸), LI Minzhe (李旻哲), JING Zhongliang∗ (敬忠良). Multi-Agent Path Planning Method Based on Improved Deep Q-Network in Dynamic Environments [J]. J Shanghai Jiaotong Univ Sci, 2024, 29(4): 601-612. |
| [6] | ZHAO Yingce(赵英策), ZHANG Guanghao(张广浩), XING Zhengyu(邢正宇), LI Jianxun(李建勋). Hierarchical Reinforcement Learning Adversarial Algorithm Against Opponent with Fixed Offensive Strategy [J]. J Shanghai Jiaotong Univ Sci, 2024, 29(3): 471-479. |
| [7] | CAO Bingquan1,2,3 (曹炳全), HE Yuesheng1,2,3∗ (贺越生), ZHUANG Hanyang4 (庄瀚洋), YANG Ming1,2,3 (杨 明). Infrastructure-Based Vehicle Localization System for Indoor Parking Lot Using RGB-D Cameras [J]. J Shanghai Jiaotong Univ Sci, 2023, 28(1): 61-69. |
| [8] | MAO Tianyang (茅天阳), ZHAO Wentao (赵文韬), WANG Jingchuan∗ (王景川), CHEN Weidong (陈卫东). Lidar-Visual-Inertial Odometry with Online Extrinsic Calibration [J]. J Shanghai Jiaotong Univ Sci, 2023, 28(1): 70-76. |
| [9] | LÜ Qibing (吕其兵), LIU Tianyuan (刘天元), ZHANG Rong (张荣), JIANG Yanan (江亚南), XIAO Lei (肖雷), BAO Jingsong∗ (鲍劲松). Generation Approach of Human-Robot Cooperative Assembly Strategy Based on Transfer Learning [J]. J Shanghai Jiaotong Univ Sci, 2022, 27(5): 602-613. |
| [10] | LIU Dasheng∗ (刘大生), YAN Guozheng (颜国正). Biomechanical Analysis of a Radial Expansion Mechanism of Intestinal Robot Coupling with Hyperelastic Intestinal Wall [J]. J Shanghai Jiaotong Univ Sci, 2022, 27(4): 552-560. |
| [11] | LI Yanbiao∗ (李研彪), CHEN Ke (陈 科), SUN Peng (孙 鹏), WANG Zesheng (王泽胜). Dynamic Modeling and Performance Evaluation of a Novel Humanoid Ankle Joint [J]. J Shanghai Jiaotong Univ Sci, 2022, 27(4): 570-578. |
| Viewed | ||||||
|
Full text |
|
|||||
|
Abstract |
|
|||||