TY - JOUR
T1 - Data-Driven Switching Control Technique Based on Deep Reinforcement Learning for Packed E-Cell as Smart EV Charger
AU - Gheisarnejad, Meysam
AU - Fathollahi, Arman
AU - Sharifzadeh, Mohammad
AU - Laurendeau, Eric
AU - Al-Haddad, Kamal
N1 - Publisher Copyright:
© 2015 IEEE.
PY - 2025/2
Y1 - 2025/2
N2 - Among hybrid multilevel rectifiers (HMRs), packed E-cell appeared as an interesting topology due to the generation of nine-level voltage with minimum active/passive devices, but appropriate control design of PEC rectifier is vital demand to keep capacitors voltages well-regulated even under unbalanced/variable dc loads. Therefore, the backstepping control (BSC) strategy is developed to control a nine-level packed E-cell (PEC9) rectifier to be used as a smart EV charger. Proximal policy optimization (PPO) with actor and critic deep neural networks (ADNNs and CDNNs) is trained to adjust the BSC controller, where the PEC9 rectifier can intelligently deal with asymmetrical/symmetrical dc loads. By maximizing a reward function, the PPO agent tries to find the optimal policy to design the control coefficients of BSC with the aim of regulating the PEC9 capacitor's voltages. The developed BSC based on the PPO tuner is validated using hardware-in-the-loop (HiL) and experimental implementation of the PEC9 rectifier to assess the performance of the proposed control scheme.
AB - Among hybrid multilevel rectifiers (HMRs), packed E-cell appeared as an interesting topology due to the generation of nine-level voltage with minimum active/passive devices, but appropriate control design of PEC rectifier is vital demand to keep capacitors voltages well-regulated even under unbalanced/variable dc loads. Therefore, the backstepping control (BSC) strategy is developed to control a nine-level packed E-cell (PEC9) rectifier to be used as a smart EV charger. Proximal policy optimization (PPO) with actor and critic deep neural networks (ADNNs and CDNNs) is trained to adjust the BSC controller, where the PEC9 rectifier can intelligently deal with asymmetrical/symmetrical dc loads. By maximizing a reward function, the PPO agent tries to find the optimal policy to design the control coefficients of BSC with the aim of regulating the PEC9 capacitor's voltages. The developed BSC based on the PPO tuner is validated using hardware-in-the-loop (HiL) and experimental implementation of the PEC9 rectifier to assess the performance of the proposed control scheme.
KW - Backstepping control (BSC)
KW - deep neural network (DNN)
KW - hybrid multilevel rectifiers (HMRs)
KW - nine-level packed E-cell (PEC9)
UR - https://www.scopus.com/pages/publications/85200208328
U2 - 10.1109/TTE.2024.3435763
DO - 10.1109/TTE.2024.3435763
M3 - Journal article
AN - SCOPUS:85200208328
SN - 2332-7782
VL - 11
SP - 3194
EP - 3203
JO - IEEE Transactions on Transportation Electrification
JF - IEEE Transactions on Transportation Electrification
IS - 1
ER -