基于深度强化学习的电网拓扑优化及潮流控制 |
周毅, 周良才, 丁佳立, 高佳宁 |
Power Network Topology Optimization and Power Flow Control Based on Deep Reinforcement Learning |
ZHOU Yi, ZHOU Liangcai, DING Jiali, GAO Jianing |
图7 使用ε-贪婪策略训练智能体的过程 |
Fig.7 Training process of AI agent by using ε-greedy strategy |