风扰下无人机栖落机动的强化学习控制设计 |
张威振, 何真, 汤张帆 |
Reinforcement Learning Control Design for Perching Maneuver of Unmanned Aerial Vehicles with Wind Disturbances |
ZHANG Weizhen, HE Zhen, TANG Zhangfan |
图8 不同策略学习阶段奖励值变化 |
Fig.8 Reward variation of different policy learning stages |
![]() |