Safe Deep Reinforcement Learning for Active Distribution System Model Predictive Control with EVs and DERs

Research output: Contribution to journalArticlepeer-review

Abstract

The temporal and spatial mismatch between PV generation and electric vehicle (EV) charging and discharging may cause voltage violations in active distribution networks. Despite the widespread use of deep reinforcement learning (DRL) in power system optimization and control, it lacks guarantees on constraint satisfaction during both training and deployment. This paper proposes a Lagrangian-based safe DRL approach for model predictive control (MPC) of active distribution systems with large-scale integration of PVs, EVs, and energy storage systems (ESSs). A Transformer-LSTM time-series model is proposed to forecast EV charging demand, which is then formulated as a constraint to ensure charging requirements are met. Using this prediction, a Lagrangian-based safe soft actor-critic (SAC) framework is developed for real-time control in a three-phase unbalanced distribution system, enforcing voltage safety constraints while optimizing the cumulative net reward. By integrating the forecasting model with multi-period constraints, the proposed framework jointly coordinates PV systems, EV charging and discharging, and ESS scheduling within the MPC horizon. Numerical experiments on a modified IEEE 123-bus system with real-world data show that, under a high PV penetration scenario, the proposed method increases the net reward by 30.74% and reduces average voltage violations from 0.0011 p.u. to 0.0002 p.u. compared with standard SAC. Compared with the optimal power flow (OPF) approach, it achieves similar voltage security while yielding lower line losses. It also maintains real-time control capability, reducing operation latency to 53.21 ms per 15-minute control interval. The proposed method remains effective under varying PV/EV penetrations and load conditions.
Original languageAmerican English
Number of pages14
JournalIEEE Transactions on Industry Applications
DOIs
StatePublished - 2026

NLR Publication Number

  • NLR/JA-5D00-99182

Keywords

  • deep reinforcement learning
  • distribution systems
  • electric vehicle (EV)
  • energy storage system (ESS)
  • Lagrangian relaxation
  • model predictive control (MPC)
  • PV

Fingerprint

Dive into the research topics of 'Safe Deep Reinforcement Learning for Active Distribution System Model Predictive Control with EVs and DERs'. Together they form a unique fingerprint.

Cite this