Obstacle Avoidance in Multi-Agent Formation Process Based on Deep Reinforcement Learning

doi:10.1007/s12204-021-2357-6

J Shanghai Jiaotong Univ Sci ›› 2021, Vol. 26 ›› Issue (5): 680-685.doi: 10.1007/s12204-021-2357-6

收稿日期:2020-11-28 出版日期:2021-10-26 发布日期:2021-10-28
通讯作者: LUO Wenguang? (罗文广), ?E-mail: wgluo@gxust.edu.cn

Obstacle Avoidance in Multi-Agent Formation Process Based on Deep Reinforcement Learning

JI Xiukun¹ (冀秀坤), HAI Jintao¹ (海金涛), LUO Wenguang¹ (罗文广), LIN Cuixia¹ (林翠霞), XIONG Yu² (熊禹), OU Zengkai² (殴增开), WEN Jiayan¹ (文家燕)

(1. Guangxi Key Laboratory of Auto Parts and Vehicle Technology; School of Electrical and Information Engineering, Guangxi University of Science and Technology, Liuzhou 545006, Guangxi, China; 2. Technology Center of Dongfeng Liuzhou Automobile Co., Ltd., Liuzhou 545000, Guangxi, China)

Received:2020-11-28 Online:2021-10-26 Published:2021-10-28

摘要/Abstract

Abstract: To solve the problems of di?cult control law design, poor portability, and poor stability of traditional multi-agent formation obstacle avoidance algorithms, a multi-agent formation obstacle avoidance method based on deep reinforcement learning (DRL) is proposed. This method combines the perception ability of convolutional neural networks (CNNs) with the decision-making ability of reinforcement learning in a general form and realizes direct output control from the visual perception input of the environment to the action through an end-to-end learning method. The multi-agent system (MAS) model of the follow-leader formation method was designed with the wheelbarrow as the control object. An improved deep Q netwrok (DQN) algorithm (we improved its discount factor and learning e?ciency and designed a reward value function that considers the distance relationship between the agent and the obstacle and the coordination factor between the multi-agents) was designed to achieve obstacle avoidance and collision avoidance in the process of multi-agent formation into the desired formation. The simulation results show that the proposed method achieves the expected goal of multi-agent formation obstacle avoidance and has stronger portability compared with the traditional algorithm.

Key words: wheelbarrow, multi-agent, deep reinforcement learning (DRL), formation, obstacle avoidance

中图分类号:

O 231.5

. [J]. J Shanghai Jiaotong Univ Sci, 2021, 26(5): 680-685.

JI Xiukun (冀秀坤), HAI Jintao (海金涛), LUO Wenguang (罗文广), LIN Cuixia (林翠霞), XIONG Yu(熊禹), OU Zengkai (殴增开), WEN Jiayan(文家燕). Obstacle Avoidance in Multi-Agent Formation Process Based on Deep Reinforcement Learning[J]. J Shanghai Jiaotong Univ Sci, 2021, 26(5): 680-685.

参考文献 12

[1]	XIE G, ZHANG Y. Survey of consensus problem in cooperative control of multi-agent systems [J]. Appli-cation Research of Computers, 2011, 28(6): 2035-2039 (in Chinese).
[2]	CHEN Z, LIN L, YAN G. An approach to scienti?c cooperative robotics: Through MAS (multi-agent sys-tem) [J]. Robot, 2001, 23(4): 368-373 (in Chinese). [3] DUAN Y, YANG H, CUI B, et al. Application of re-inforcement learning to basic action learning of soccer robot [J]. Robot, 2008, 30(5): 453-459 (in Chinese).
[4]	LITTMAN M L. Reinforcement learning improves be-haviour from evaluative feedback [J]. Nature, 2015, 521(7553): 445-451.
[5]	ZHU Y, ZHAO D. Probably approximately correct re-inforcement learning solving continuous-state control problem [J]. Control Theory and Applications, 2016, 33(12): 1603-1613 (in Chinese).
[6]	ZHOU W. The application of deep learning algo-rithms in intelligent collaborative robots [J]. China New Telecommunications, 2017, 19(21): 129-130 (in Chinese).
[7]	POLYDOROS A S, NALPANTIDIS L. Survey of model-based reinforcement learning: Applications on robotics [J]. Journal of Intelligent & Robotic Systems, 2017, 86(2): 153-173.
[8]	LIMA H, KUROE Y. Swarm reinforcement learning methods improving certaintyof learningfor amulti-robot formation problem [C]//2015 IEEE Congress on Evolutionary Computation (CEC). Sendai: IEEE, 2015: 3026-3033.
[9]	LIU Q, ZHAI J, ZHANG Z, et al. A survey on deep reinforcement learning [J]. Chinese Journal of Com-puters, 2018, 41(1): 1-27 (in Chinese).
[10]	RIEDMILLER M. Neural ?tted Q iteration: First ex-periences with a data e?cient neural reinforcement learning method [M]//Machine learning: ECML2005. Berlin, Heidelberg: Springer, 2005: 317-328.
[11]	LANGE S, RIEDMILLER M. Deep auto-encoder neu-ral networks in reinforcement learning [C]//The 2010 International Joint Conference on Neural Networks (IJCNN). Barcelona: IEEE, 2010: 1-8.
[12]	ABTAHI F, FASEL I. Deep belief nets as func-tion approximators for reinforcement learning [C]//Workshops at the Twenty-Fifth AAAI Confer-ence on Arti?cial Intelligence. Frankfurt: AAAI, 2011:
	2- 7.

编辑推荐 0

Metrics

阅读次数

全文

HTML			PDF

最新录用	在线预览	正式出版	最新录用	在线预览	正式出版
0	0	0	0	0	89

来源	本网站	其他网站

次数	78	11
比例	88%	12%

摘要

431

最新录用	在线预览	正式出版

0	0	431

来源	本网站	其他网站

次数	299	132
比例	69%	31%

Obstacle Avoidance in Multi-Agent Formation Process Based on Deep Reinforcement Learning

PDF (PC)

可视化

摘要/Abstract

引用本文

使用本文

参考文献 12

相关文章 0

编辑推荐 0

Metrics

本文评价