|
|
Reinforcement Learning-Based Target Assignment Method for Many-to-Many Interceptions |
GUO Jianguo1, HU Guanjie1, XU Xinpeng1,2, LIU Yue2, CAO Jin2 |
1. Institute of Precision Guidance and Control, School of Astronautics, Northwestern Polytechnical University, Xi’an 710072, Shaanxi, China;
2. Shanghai Electro-Mechanical Engineering Institute, Shanghai 201109, China |
|
|
Abstract Aiming at the issue of weapon target assignment for a many-to-many interception in the air confrontation environment, this study has proposed a multi-target intelligent assignment method based on reinforcement learning. Under the many-to-many interception engagement scenario, a mathematical model of target assignment was established based on the engagement posture evaluation. By introducing the concepts of target threat degree and interception effectiveness degree, the interception urgency of each target and the interception capability characterization of each interceptor were fully reflected, allowing a comprehensive evaluation of the engagement posture of the attacking and defending sides. Based on the target assignment model, the target assignment issue was built up using a Markov decision process and was trained to be solved by a reinforcement learning algorithm using deep Q-network. Relying on the self-learning and reward mechanism under environment interaction, the dynamic generation of optimal assignment schemes was effectively realized. A many-to-many interception scenario was created and its effectiveness was verified through mathematical simulation, and the result shows that the trained target assignment method satisfies the requirements of continuous and dynamic task assignment in many-to-many interception.
|
Received: 20 September 2023
Published: 04 March 2024
|
|
|
|
|
[1] |
MA Chi, ZHANG Guoqun, SUN Junge, LYU Guangzhe, ZHANG Tao. Deep Reinforcement Learning-Based Reconfiguration Method for Integrated Electronic Systems[J]. Air & Space Defense, 2024, 7(1): 63-70. |
[2] |
WANG Xu, CAI Yuanli, ZHANG Xuecheng, ZHANG Rongliang, HAN Chenglong. Intercept Guidance Law with a Low Acceleration Ratio Based on Hierarchical Reinforcement Learning[J]. Air & Space Defense, 2024, 7(1): 40-47. |
[3] |
LI Mengxuan, GUO Jianguo, XU Xinpeng, SHEN Yuheng. Guidance Law Based on Proximal Policy Optimization[J]. Air & Space Defense, 2023, 6(4): 51-57. |
[4] |
SHANG Xi, YANG Gewen, DAI Shaohuai, JIANG Yilin. Research on Resource Allocation Strategy of One-to-Many Radar Jamming Based on Reinforcement Learning[J]. Air & Space Defense, 2022, 5(1): 94-101. |
[5] |
HE Linkun, ZHANG Ran, GONG Qinghai. Landing Guidance of Reusable Launch Vehicle Based on Reinforcement Learning[J]. Air & Space Defense, 2021, 4(3): 33-40. |
[6] |
LI Zhaoying, OU Yiming, SHI Ruoling. Improved RRT Path Planning Algorithm Based on Deep Q-network[J]. Air & Space Defense, 2021, 4(3): 17-23. |
[7] |
CHEN Tao, ZHANG Ying, HUANG Xiangsong. Adaptive Interference Waveform Design Based on Reinforcement Learning[J]. Air & Space Defense, 2021, 4(2): 59-. |
[8] |
Zhou Lai, Jin Xiaowei, Zheng Yikai. Researchon Operational Decision Support Based on Deep Reinforcement Learning[J]. Air & Space Defense, 2018, 1(1): 31-35. |
|
|
|
|