Deep Reinforcement Learning for Distribution System Operations: A Tutorial and Survey

Research output: Contribution to journalArticlepeer-review

1 Scopus Citations

Abstract

The rapid evolution of modern electric power distribution systems into complex networks of interconnected active devices, distributed generation (DG), and storage poses increasing difficulties for system operators. The large-scale integration of distributed energy resources (DERs) and the rapid exchange of measurement data via communication networks present major opportunities for advancing grid operations but also introduce greater uncertainty, higher data dimensionality, more complex network and device models, and challenging control and optimization problems. Deep reinforcement learning (DRL) algorithms are promising in addressing these challenges. However, they have not been effectively adapted for power systems applications, requiring extensive customization for implementation and evaluation. This has resulted in reproducibility challenges and a steep learning curve for researchers new to applying DRL algorithms to the power systems domain. To bridge these gaps, this tutorial aims to serve as a valuable resource for researchers interested in exploring learning-based algorithms to operate active power distribution networks. Specifically, this work presents a generalized process for translating sequential decision-making problems in power distribution systems into Markov decision process (MDP) formulations, illustrated through concrete grid service examples. Additionally, we introduce a simple environment design strategy to develop and evaluate example DRL algorithms for distribution system applications, complete with an included code repository to guide users through environment construction.
Original languageAmerican English
Number of pages29
JournalProceedings of the IEEE
DOIs
StatePublished - 2025

NLR Publication Number

  • NREL/JA-6A40-97039

Keywords

  • deep reinforcement learning (DRL)
  • grid services
  • Markov decision process (MDP)
  • optimization
  • power distribution systems

Fingerprint

Dive into the research topics of 'Deep Reinforcement Learning for Distribution System Operations: A Tutorial and Survey'. Together they form a unique fingerprint.

Cite this