上海交通大学学报 ›› 2025, Vol. 59 ›› Issue (10): 1464-1475.doi: 10.16183/j.cnki.jsjtu.2023.529

• 新型电力系统与综合能源 • 上一篇    下一篇

基于负荷准线和深度强化学习的含电动汽车集群系统新能源消纳策略

刘雁行1, 乔如妤1, 梁楠1, 陈宇1, 于凯1, 吴汉霄2()   

  1. 1 内蒙古电力(集团)有限责任公司 电力营销服务与运营管理分公司, 呼和浩特 010011
    2 上海交通大学 电力传输与功率变换控制教育部重点实验室, 上海 200240
  • 修回日期:2023-11-20 接受日期:2023-11-30 出版日期:2025-10-28 发布日期:2025-10-24
  • 通讯作者: 吴汉霄,硕士生;E-mail:1362257580@qq.com.
  • 作者简介:刘雁行(1986—),副高级工程师,从事电力营销服务研究.

Renewable Energy Consumption Strategies of Power System Integrated with Electric Vehicle Clusters Based on Load Alignment and Deep Reinforcement Learning

LIU Yanhang1, QIAO Ruyu1, LIANG Nan1, CHEN Yu1, YU Kai1, WU Hanxiao2()   

  1. 1 Power Marketing Service and Operation Management Branch, Inner Mongolia Power (Group) Co., Ltd., Hohhot 010011, China
    2 Key Laboratory of Control of Power Transmission and Conversion of the Ministry of Education, Shanghai Jiao Tong University, Shanghai 200240, China
  • Revised:2023-11-20 Accepted:2023-11-30 Online:2025-10-28 Published:2025-10-24

摘要:

当前,我国正加快构建以新能源为主体的新型电力系统,然而大规模新能源的接入也使得弃风、弃光问题日益突出.为提升电力系统对新能源的消纳能力,提出一种基于负荷准线与深度强化学习的新能源消纳新方法.首先,建立基于线性化潮流计算的节点负荷准线生成模型,该模型能够引导可调负荷调整用电时段,从而促进新能源消纳.与直流潮流模型相比,该模型在考虑电压约束等电力系统的相关约束的基础上,实现了对全部非线性化约束的线性化处理,显著降低了计算复杂度.其次,构建基于负荷准线机制的市场框架,并以电动汽车集群作为可调负荷对象,研究负荷准线激励价格的求解方法.负荷准线机制框架包括独立系统运营商、区域电网售电商和电动汽车可调负荷聚合商3类主体,负荷准线激励价格的求解涉及三者之间的主从博弈问题.由于该模型数学解析求解难度较大,故采用深度强化学习算法求解:以各节点边际电价为状态空间,以负荷准线激励价格作为动作空间,以区域电网售电商的成本作为反馈,通过持续训练使智能体找到最大化区域电网售电商利益的负荷准线激励价格.最后,算例分析表明:所提出的负荷准线机制不仅能够有效提升新能源消纳水平,还可同时增加独立系统运营商、区域电网售电商和电动汽车聚合商的收益;同时,深度强化学习算法在实现区域电网售电商的利益最大化方面表现出良好效果.

关键词: 负荷准线, 新能源消纳, 深度强化学习, 需求侧响应

Abstract:

As China accelerates the construction of power systems with renewable energy as the mainstay, the large-scale integration of renewables has led to prominent issues such as wind and light curtailment. To improve the utilization of new energy consumption in power systems, this paper proposes a novel renewable energy consumption method based on load alignment and deep reinforcement learning. First, it proposes a node load line formation model based on linearized power flow calculations, which can guide adjustable loads to shift the electricity consumption period, thereby promoting the improvement of new energy consumption. Unlike the direct current (DC) power flow model, the proposed alternating current (AC) model accounts for voltage constraints and other related constraints of the power system. Compared with other AC power flow models, this model linearizes all nonlinear constraints and has lower computational costs. Then, this paper constructs a market framework for load alignment mechanism. The framework involves three main entities: independent system operators, regional power grid sellers, and electric vehicle adjustable load aggregators. It also explores the solution for load alignment incentive prices using electric vehicle clusters as adjustable loads. As the solution of the load benchmark incentive price involves a master-slave game between three entities, conventional mathematical analysis methods face high complexity. Therefore, it employs deep reinforcement learning algorithm to solve the problem. The deep reinforcement learning algorithm takes the marginal electricity price of each node as state space, the load benchmark incentive price as action space, and the cost of regional power grid sellers as feedback. The agent can find the load line incentive price that maximizes the benefits of regional power grid sellers after continuous training. Finally, the example analysis shows that the load alignment mechanism not only effectively promotes the improvement of new energy consumption level, but also enhances the interests of independent system operators, regional power grid sellers, and electric vehicle aggregators. The results further confirm that the deep reinforcement learning algorithm maximizes the benefits of regional power grid sellers.

Key words: load alignment, renewable energy consumption, deep reinforcement learning, demand response

中图分类号: