基于深度强化学习的电网拓扑优化及潮流控制
周毅, 周良才, 丁佳立, 高佳宁

Power Network Topology Optimization and Power Flow Control Based on Deep Reinforcement Learning
ZHOU Yi, ZHOU Liangcai, DING Jiali, GAO Jianing
图7 使用ε-贪婪策略训练智能体的过程
Fig.7 Training process of AI agent by using ε-greedy strategy