Author Identifier
Seyyed Morteza Ghamari: https://orcid.org/0000-0001-5082-820X
Daryoush Habibi: https://orcid.org/0000-0002-7662-6830
Asma Aziz: https://orcid.org/0000-0003-3538-0536
Document Type
Journal Article
Publication Title
Energies
Volume
18
Issue
12
Publisher
MDPI
School
School of Engineering
RAS ID
82150
Funders
Edith Cowan University
Abstract
The Dual Active Bridge converter (DABC), known for its bidirectional power transfer capability and high efficiency, plays a crucial role in various applications, particularly in electric vehicles (EVs), where it facilitates energy storage, battery charging, and grid integration. The Dual Active Bridge Converter (DABC), when paired with a high-performance CLLC filter, is well-regarded for its ability to transfer power bidirectionally with high efficiency, making it valuable across a range of energy applications. While these features make the DABC highly efficient, they also complicate controller design due to nonlinear behavior, fast switching, and sensitivity to component variations. We have used a Fractional-order PID (FOPID) controller to benefit from the simple structure of classical PID controllers with lower complexity and improved flexibility because of additional filtering gains adopted in this method. However, for a FOPID controller to operate effectively under real-time conditions, its parameters must adapt continuously to changes in the system. To achieve this adaptability, a Multi-Agent Reinforcement Learning (MARL) approach is adopted, where each gain of the controller is tuned individually using the Deep Deterministic Policy Gradient (DDPG) algorithm. This structure enhances the controller’s ability to respond to external disturbances with greater robustness and adaptability. Meanwhile, finding the best initial gains in the RL structure can decrease the overall efficiency and tracking performance of the controller. To overcome this issue, Grey Wolf Optimization (GWO) algorithm is proposed to identify the most suitable initial gains for each agent, providing faster adaptation and consistent performance during the training process. The complete approach is tested using a Hardware-in-the-Loop (HIL) platform, where results confirm accurate voltage control and resilient dynamic behavior under practical conditions. In addition, the controller’s performance was validated under a battery management scenario where the DAB converter interacts with a nonlinear lithium-ion battery. The controller successfully regulated the State of Charge (SOC) through automated charging and discharging transitions, demonstrating its real-time adaptability for BMS-integrated EV systems. Consequently, the proposed MARL-FOPID controller reported better disturbance-rejection performance in different working cases compared to other conventional methods.
DOI
10.3390/en18123046
Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.
Comments
Ghamari, S. M., Habibi, D., & Aziz, A. (2025). Robust adaptive fractional-order PID controller design for high-power DC-DC dual active bridge converter enhanced using multi-agent deep deterministic policy gradient algorithm for electric vehicles. Energies, 18(12), 3046. https://doi.org/10.3390/en18123046