Sensors & Transducers
Vol. 270, Issue 3, November 2025, pp. 69-78
_______________
Intelligent Intersection Management through Multi-agent Reinforcement Learning, Self-adaptive Phase Adjustment and Visible Light Communication
1, 2
Manuel A. VIEIRA,
1
Tomás ANTUNES,
1
Gonçalo GALVÃO,
1, 2, 3 Manuela VIEIRA, 1, 5 Mário VÉSTIAS and 1, 2 Paula LOURO
1
Electronics Telecommunication and Computer Dept. ISEL,
R. Conselheiro Emídio Navarro, 1959-007 Lisboa, Portugal
2
UNINOVA –CTS and LASI, Quinta da Torre, Monte da Caparica, 2829-516, Caparica, Portugal
3
NOVA School of Science and Technology, Quinta da Torre,
Monte da Caparica, 2829-516, Caparica, Portugal
4
Instituto de Telecomunicações, Instituto Superior Técnico, 1049-001,
​Lisboa, Portugal
5
INESC INOV-Lab, Instituto Superior Técnico, Universidade de Lisboa, 1000-029, Lisboa, Portugal
E-mail: manuela.vieira@isel.pt
Received: 31 May 2025 / Revised: 7 Nov. 2025 / Accepted: 12 Nov. 2025 /
​Published: 28 Nov. 2025
​​
Abstract:
Urban traffic management faces growing challenges due to increasing volumes of vehicles and pedestrians, resulting
in congestion, delays, and safety concerns. This study introduces an innovative traffic signal control framework that integrates
Multi-Agent Reinforcement Learning (MARL) with Visible Light Communication (VLC) and a Self-Adaptive Phase
Adjustment (SAPA) module to enhance coordination across urban intersections. In the proposed architecture, each intersection
is governed by an independent Deep Reinforcement Learning (DRL) agent capable of making real-time, decentralized
decisions for both vehicular and pedestrian flows. Cooperative behavior emerges through the MARL framework, allowing
agents to account for the dynamic states of neighboring intersections and improve network-wide efficiency. VLC provides
high-resolution, low-latency data exchange between vehicles, pedestrians, and infrastructure, enabling precise sensing of
position, speed, queue length, and stop duration. The introduction of the SAPA module further enhances adaptability by
dynamically adjusting phase durations based on real-time queue/request/response patterns, resolving conflicts and prioritizing
urgent demands. Extensive simulations and field tests demonstrate that the MARL–VLC–SAPA system significantly
outperforms centralized and conventional agent-based approaches, reducing waiting and travel times while improving overall
safety and responsiveness in complex urban environments.
Keywords: Multi-Agent Reinforcement Learning (MARL), Visible Light Communication (VLC), Intelligent traffic signal control, Real-time decision making, Road safety.
_________________________________________________________________________________________