Intelligent frequency control strategy based on multi‑objective reinforcement learning of cooperative reward function
Author:
Affiliation:

(1.State Grid Henan Electric Power Company, Zhengzhou 450052,China;2.China Electric Power Research Institute, Beijing 100192, China;3.Songyang Power Supply Company, State Grid Zhejiang Electric Power Co., Ltd., Songyang 323400, China; 4.School of Electrical Engineering and Automation, Wuhan University, Wuhan 430072, China)

Clc Number:

TM933

Fund Project:

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • |
  • Materials
  • |
  • Comments
    Abstract:

    In the intelligent frequency control strategy with large?scale wind power grid?connected system, only considering the CPS control criterion can easily cause the frequency off?limit in a short time, which seriously affects the control effect of the intelligent AGC control strategy. This paper proposes a multi?objective collaborative reward function reinforcement learning algorithm (TOPQ?MORL) intelligent frequency control strategy, which constructs a collaborative reward function that takes into account the multi?dimensional frequency control performance evaluation criteria, and realizes the coordinating evaluation of multi?dimensional frequency control performance standards on the time scale .The TOPQ learning strategy is used to optimize the action space of the agent globally, which effectively solves the problem of poor calculation efficiency of the Q function linear weighted multi?objective reinforcement learning algorithm under the traditional greedy strategy. The simulation results of the AGC control model of the standard two?region interconnected power grid shows that the intelligent AGC control strategy proposed in this paper can effectively improve the frequency control performance and improve the frequency quality of the system on the full?time scale obviously.

    Reference
    Related
    Cited by
Get Citation

韩保军,高 强,代 飞,杨 宵,吕 颖,许忠义,付希越.基于协同奖励函数多目标强化学习的智能频率控制策略研究[J].电力科学与技术学报英文版,2023,38(2):18-29. HAN Baojun, GAO Qiang, DAI fei, YANG Xiao, Lü Ying, XU Zhongyi, , FU Xiyue. Intelligent frequency control strategy based on multi‑objective reinforcement learning of cooperative reward function[J]. Journal of Electric Power Science and Technology,2023,38(2):18-29.

Copy
Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:
  • Revised:
  • Adopted:
  • Online: June 29,2023
  • Published: