J Shanghai Jiaotong Univ Sci ›› 2026, Vol. 31 ›› Issue (1): 187-194.doi: 10.1007/s12204-025-2816-6
• Intelligent Robots • Previous Articles Next Articles
曲星儒,李初,江雨泽,龙飞飞,张汝波
Received:2024-11-13
Accepted:2024-12-02
Online:2026-02-28
Published:2026-02-12
CLC Number:
Qu Xingru, Li Chu, Jiang Yuze, Long Feifei, Zhang Rubo. Cooperative Pursuit of Unmanned Surface Vehicles Using Multi-Agent Reinforcement Learning[J]. J Shanghai Jiaotong Univ Sci, 2026, 31(1): 187-194.
| [1] MU Z X, PAN J, ZHOU Z Y, et al. A survey of the pursuit–evasion problem in swarm intelligence [J]. Frontiers of Information Technology & Electronic Engineering, 2023, 24(8): 1093-1116. [2] GAN W H, QU X Q, SONG D L, et al. Multi-USV cooperative chasing strategy based on obstacles assistance and deep reinforcement learning [J]. IEEE Transactions on Automation Science and Engineering, 2024, 21(4): 5895-5910. [3] CHEN L, DUAN H B. Cooperative enclosing control for networked unmanned aerial vehicles to faster target [J]. Journal of Guidance, Control, and Dynamics, 2024, 47(2): 366-374. [4] ZHOU M, WANG Z H, WANG J, et al. Multi-robot collaborative hunting in cluttered environments with obstacle-avoiding voronoi cells [J]. IEEE/CAA Journal of Automatica Sinica, 2024, 11(7): 1643-1655. [5] XING N, ZHANG H T, ZHU L J. Prescribed-time collective evader-capturing for autonomous surface vehicles [J]. Automatica, 2024, 167: 111761. [6] FAN Z L, YANG H Y, LIU F, et al. Reinforcement learning method for target hunting control of multi-robot systems with obstacles [J]. International Journal of Intelligent Systems, 2022, 37(12): 11275-11298. [7] FANG X, WANG C, XIE L H, et al. Cooperative pursuit with multi-pursuer and one faster free-moving evader [J]. IEEE Transactions on Cybernetics, 2022, 52(3): 1405-1414. [8] CHEN C, LIANG X, ZHANG Z, et al. Cooperative strategy based on a two-layer game model for inferior USVs to intercept a superior USV [J]. Ocean Engineering, 2024, 293: 116600. [9] SUN W, TSIOTRAS P, LOLLA T, et al. Multiple-pursuer/one-evader pursuit–evasion game in dynamic flowfields [J]. Journal of Guidance, Control, and Dynamics, 2017, 40(7): 1627-1637. [10] QU X R, JIANG Y Z, ZHANG R B, et al. A deep reinforcement learning-based path-following control scheme for an uncertain under-actuated autonomous marine vehicle [J]. Journal of Marine Science and Engineering, 2023, 11(9): 1762. [11] DONG Y B, CUI T, ZHOU Y F, et al. Reward function design method for long episode pursuit tasks under polar coordinate in multi-agent reinforcement learning [J]. Journal of Shanghai Jiao Tong University (Science), 2024, 29(4): 646-655. [12] DU W B, GUO T, CHEN J, et al. Cooperative pursuit of unauthorized UAVs in urban airspace via multi-agent reinforcement learning [J]. Transportation Research Part C: Emerging Technologies, 2021, 128: 103122. [13] MA J C, LU H M, XIAO J H, et al. Multi-robot target encirclement control with collision avoidance via deep reinforcement learning [J]. Journal of Intelligent & Robotic Systems, 2020, 99(2): 371-386. [14] XIA J W, LUO Y S, LIU Z K, et al. Cooperative multi-target hunting by unmanned surface vehicles based on multi-agent reinforcement learning [J]. Defence Technology, 2023, 29: 80-94. [15] NANTOGMA S, ZHANG S Y, YU X W, et al. Multi-USV dynamic navigation and target capture: A guided multi-agent reinforcement learning approach [J]. Electronics, 2023, 12(7): 1523. [16] QU X Q, GAN W H, SONG D L, et al. Pursuit-evasion game strategy of USV based on deep reinforcement learning in complex multi-obstacle environment [J]. Ocean Engineering, 2023, 273: 114016. [17] LI F B, YIN M M, WANG T D, et al. Distributed pursuit-evasion game of limited perception USV swarm based on multiagent proximal policy optimization [J]. IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2024, 54(10): 6435-6446. [18] ZHANG H Q, SHI J H, WU L H, et al. Multi-agent self-organizing cooperative hunting in non-convex environment with improved MADDPG algorithm [J]. Journal of Frontiers of Computer Science and Technology, 2024, 18(8): 2080-2090 (in Chinese). [19] FOSSEN T. Handbook of marine craft hydrodynamics and motion control [M]. Chichester: Wiley, 2011. [20] HE Z C, DONG L, SONG C W, et al. Multiagent soft actor-critic based hybrid motion planner for mobile robots [J]. IEEE Transactions on Neural Networks and Learning Systems, 2023, 34(12): 10980-10992. [21] WANG N, SUN Z, JIAO Y H, et al. Surge-heading guidance-based finite-time path following of underactuated marine vehicles [J]. IEEE Transactions on Vehicular Technology, 2019, 68(9): 8523-8532. |
| [1] | Wang Longsheng, Yuan Wei, Zhuang Hanyang, Wang Chunxiang, Yang Ming. Acceleration Optimization-Based Speed Planning Method for High-Precision Longitudinal Control of Wheeled Robots [J]. J Shanghai Jiaotong Univ Sci, 2026, 31(1): 48-58. |
| [2] | Zhang Dong, Liu Sheng, Shi Mengyao, Cai Yu, Wang Dazhong. Misaligned Parallel-Chamber Soft Pneumatic Network Actuator for Multi-Mode Gripping [J]. J Shanghai Jiaotong Univ Sci, 2026, 31(1): 59-70. |
| [3] | Ceng Yuxuan, Zhao Wentao, Chen Yongtao, Xiao Peng, Wang Jingchuan, Guo Rui. SDA-Loc: A Semantic-Driven Alignment Algorithm for Cross-Modal Localization in Point Cloud Maps [J]. J Shanghai Jiaotong Univ Sci, 2026, 31(1): 117-129. |
| [4] | Peng Chengyu, Chen Baifan, Li Siyu, Jin Yuxuan, Wan Jiadong, Fu Yuesi. Hybrid Topological Map Fusion Based on Memory Sphere [J]. J Shanghai Jiaotong Univ Sci, 2026, 31(1): 130-142. |
| [5] | Xia Jie, Wu Xiaodong, Xu Min. BEV-Fused Imitation and Reinforcement Learning for Autonomous Driving Planning [J]. J Shanghai Jiaotong Univ Sci, 2026, 31(1): 154-166. |
| [6] | Li Mingwang, Li Xinde, Zhang Zhentong, Wang Zeyu, Zhao Haoming. Haptic-Aided Navigation Vehicle: Enhancing Obstacle Detection in Blind Spots and Transparent Object Scenarios [J]. J Shanghai Jiaotong Univ Sci, 2026, 31(1): 167-175. |
| [7] | Zhang Han, Zhang Guoliang, Feng Shengjie, Li Qingyun, Qu Jieming, Xie Le. Development of Surgical Robot for CT-Guided Lung Biopsy [J]. J Shanghai Jiaotong Univ Sci, 2026, 31(1): 1-11. |
| [8] | Li Mengwen, Lv Penghao, Liu Qiao, Dai Yu, Zhang Jianxun. Leader-Follower Control Algorithm for Minimally Invasive Surgical Robot [J]. J Shanghai Jiaotong Univ Sci, 2026, 31(1): 24-35. |
| [9] | YU Xinyi, XU Siyu, FAN Yuehai, OU Linlin. Self-Adaptive LSAC-PID Approach Based on Lyapunov Reward Shaping for Mobile Robots [J]. J Shanghai Jiaotong Univ Sci, 2025, 30(6): 1085-1102. |
| [10] | LI Chunyang, ZHU Xiaoqing, RUAN Xiaogang, LIU Xinyuan, ZHANG Siyuan. Gait Learning Reproduction for Quadruped Robots Based on Experience Evolution Proximal Policy Optimization [J]. J Shanghai Jiaotong Univ Sci, 2025, 30(6): 1125-1133. |
| [11] | ZHAO Xiangtang, ZHAO Zhigang, WEI Qizhe, SU Cheng. Dynamic Analysis and Trajectory Solution of Multi-Robot Coordinated Towing System [J]. J Shanghai Jiaotong Univ Sci, 2025, 30(6): 1134-1143. |
| [12] | SUN Bowen, YANG Jianhua, LI Baofeng, LI Shangyuan, WANG Liang, XU Zhongqi. Wire Rope Inspection Robots: A Review [J]. J Shanghai Jiaotong Univ Sci, 2025, 30(6): 1144-1161. |
| [13] | SU Cheng, ZHAO Xiangtang, YAN Zengzhen, ZHAO Zhigang, MENG Jiadong. Load Stability Analysis of a Floating Multi-Robot Coordinated Towing System [J]. J Shanghai Jiaotong Univ Sci, 2025, 30(6): 1162-1170. |
| [14] | BANKOLE Adesola Temitope, IGBONOBA Ezekiel Endurance Chukwuemeke. Simulation-Based Novel Hybrid Proportional Derivative/H-Infinity Controller Design for Improved Trajectory Tracking of a Two-Link Robot Arm [J]. J Shanghai Jiaotong Univ Sci, 2025, 30(6): 1179-1187. |
| [15] | Dong Kaijie,Li Ziqi,Gao Mingxing,Zhang Jianhua,Li Duanling. Review: Development of Micro-Scale Planetary Surface Exploration Robots [J]. J Shanghai Jiaotong Univ Sci, 2026, 31(1): 221-240. |
| Viewed | ||||||
|
Full text |
|
|||||
|
Abstract |
|
|||||