Journal of Shanghai Jiao Tong University(Science) ›› 2020, Vol. 25 ›› Issue (5): 561-568.doi: 10.1007/s12204-020-2226-8

• • 上一篇    下一篇

Missile-Target Situation Assessment Model Based on Reinforcement Learning

ZHANG Yun (张贇), Lü Runyan (吕润妍), CAI Yunze (蔡云泽)   

  1. (Department of Automation; Key Laboratory of System Control and Information Processing of Ministry of Education;
    Key Laboratory of Marine Intelligent Equipment and System of Ministry of Education,
    Shanghai Jiao Tong University, Shanghai 200240, China)
  • 出版日期:2020-10-28 发布日期:2020-09-11
  • 通讯作者: CAI Yunze (蔡云泽) E-mail:yzcai@sjtu.edu.cn

Missile-Target Situation Assessment Model Based on Reinforcement Learning

ZHANG Yun (张贇), Lü Runyan (吕润妍), CAI Yunze (蔡云泽)   

  1. (Department of Automation; Key Laboratory of System Control and Information Processing of Ministry of Education;
    Key Laboratory of Marine Intelligent Equipment and System of Ministry of Education,
    Shanghai Jiao Tong University, Shanghai 200240, China)
  • Online:2020-10-28 Published:2020-09-11
  • Contact: CAI Yunze (蔡云泽) E-mail:yzcai@sjtu.edu.cn

摘要: In situation assessment (SA) of missile versus target fighter, the traditional SA models generally
have the characteristics of strong subjectivity and poor dynamic adaptability. This paper considers SA as an
expectation of future returns and establishes a missile-target simulation battle model. The actor-critic (AC)
algorithm in reinforcement learning (RL) is used to train the evaluation network, and a missile-target SA model
is established in simulation battle training. Simulation and comparative experiments show that the model can
effectively estimate the expected effect of missile attack under the current situation, and it provides an effective
basis for missile attack decision.

关键词: situation assessment (SA), battle model, reinforcement learning (RL), actor-critic (AC) algorithm

Abstract: In situation assessment (SA) of missile versus target fighter, the traditional SA models generally
have the characteristics of strong subjectivity and poor dynamic adaptability. This paper considers SA as an
expectation of future returns and establishes a missile-target simulation battle model. The actor-critic (AC)
algorithm in reinforcement learning (RL) is used to train the evaluation network, and a missile-target SA model
is established in simulation battle training. Simulation and comparative experiments show that the model can
effectively estimate the expected effect of missile attack under the current situation, and it provides an effective
basis for missile attack decision.

Key words: situation assessment (SA), battle model, reinforcement learning (RL), actor-critic (AC) algorithm

中图分类号: