基于深度强化学习的电网薄弱线路时序故障辨识策略
CSTR:
作者:
作者单位:

(1.三峡大学电气与新能源学院 ,湖北 宜昌 443002;2.新能源微电网湖北省协同创新中心 ,湖北 宜昌 443002;3.国网湖北省电力有限公司武 汉市供电公司 ,湖北 武汉 430010)

作者简介:

通讯作者:

苏攀(1985—),男,硕士研究生,主要从事输电线路工程方面的研究;E-mail:2512400720@qq.com

中图分类号:

TM712

基金项目:

国家自然科学基金(52407118);梯级水电站运行与控制湖北省重点实验室(三峡大学)开放基金课题(2023KJX06);电力系统智能运行与安全防御宜昌市重点实验室(三峡大学)开放基金课题(2020DLXY06)


Timing fault identification strategy for weak lines in power grid based on deep reinforcement learning
Author:
Affiliation:

(1. College of Electrical Engineering and New Energy , China Three Gorges University , Yichang 443002, China; 2. Hubei Collaborative Innovation Center for New Energy Microgrid , Yichang 443002, China; 3. Wuhan Power Supply Company , State Grid Hubei Electric Power Co ., Ltd., Wuhan 430010, China)

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    为有效预防线路时序故障导致的电网连 锁故障,提出一种融合深度强化学习 (deep reinforcement learning,DRL )与暂态稳定约束的辨识方法。该方法核心在于将辨识任务形式化为一个马尔可夫决策过程 (Markov decision process,MDP )问题,使DRL智能体能够通过与暂态仿真环境的交互学习,高效筛选出导致系统失稳的关键故障路径。其先引入一种结合 Q值与时序累积效应的薄弱度指标,实现对薄弱线路的精准定位;再结合电网时域暂态仿真计算,筛选易导致电网失稳的关键故障;然后,结合时序故障累积效应与 Q学习,提出线路薄弱度指标,计算得到时序故障累积效应情况下考虑暂态稳定约束的电网薄弱线路;最后,使用 IEEE 5节点、IEEE 39节点和 IEEE 300节点系统作为测试案例,进行仿真,其仿真结果均验证了所提方法在学习效率和识别电网薄弱环节方面的适用性。研究结果表明,该DRL方法在 Q学习的基础上结合乐观的初始猜测和贪心算法,可实现对关键故障的选择,并能评估在不同线路故障情况下的学习效率,其稳定性和训练速度均较好。

    Abstract:

    To effectively prevent cascadin g faults of the power grid caused by line timing faults,an identification method that integrates deep reinforcement learning (DRL) and transient stability constraints is proposed.The core of this method lies in formalizing the identification task as a Markov decision process (MDP) problem,enabling the DRL agent to efficiently screen out the key fault paths that cause system instability through interactive learning with the transient simulation environment.Firstly,a vulnerability index combining Q value and timing cumulative effect is introduced,achieving precise positioning of weak lines.By combining the time-domain transient simulation calculation of the power grid,the key faults that are prone to cause power grid instability are screened out.Then,the line weakness index is proposed through Q learning combined with the cumulative effect of timing faults,and the weak lines in the power grid considering the transient stability constraint under the cumulative effect of timing faults are calculated and obtained.Finally,IEEE 5 node,IEEE 39 node,and IEEE 300 node systems are used to simulate as test cases.The simulation results all verify the applicability of the proposed method in terms of learning efficiency and weak link identification in the power grid.Research results show that the proposed DRL method,based on Q learning,combines optimistic initial guessing and greedy algorithm to achieve selection for critical faults and evaluate the learning efficiency under different line fault conditions,with favorable stability and fast training speed.

    参考文献
    相似文献
    引证文献
引用本文

刘颂凯,艾宇坤,苏攀,等.基于深度强化学习的电网薄弱线路时序故障辨识策略[J].电力科学与技术学报,2025,40(6):54-66.
LIU Songkai, AI Yukun, SU Pa n, et al. Timing fault identification strategy for weak lines in power grid based on deep reinforcement learning[J]. Journal of Electric Power Science and Technology,2025,40(6):54-66.

复制
分享
相关视频

文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2024-12-26
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2026-02-03
  • 出版日期:
文章二维码