TY - GEN
T1 - Multi-Agent Deep Reinforcement Learning for Spectrum Management in V2X with Social Roles
AU - Chen, Po Yen
AU - Zheng, Yu Heng
AU - Althamary, Ibrahim
AU - Chern, Jann Long
AU - Huang, Chih Wei
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - In a vehicle-to-everything (V2X) communication system involving multiple vehicle types, there is a more challenging and practical problem compared to a single-type scenario. Each vehicle type acts autonomously with distinct communication policies. While prior knowledge can establish behavior for each agent type, it may reduce the adaptability and versatility of the system. This paper proposes a role-oriented actor-critic (ROAC) approach, where vehicles of similar types share similar policies in a satellite-assisted V2X network for more precise and effective spectrum management. The vehicles are trained to optimize system utility by selecting transmission modes, power levels, and sub-channels. The social role properties enable each agent to make better decisions based on the environment and its type. The ROAC model provides 8-10% higher normalized system utility over other advanced methods, even with vehicle-role extension, in situations with heavier traffic.
AB - In a vehicle-to-everything (V2X) communication system involving multiple vehicle types, there is a more challenging and practical problem compared to a single-type scenario. Each vehicle type acts autonomously with distinct communication policies. While prior knowledge can establish behavior for each agent type, it may reduce the adaptability and versatility of the system. This paper proposes a role-oriented actor-critic (ROAC) approach, where vehicles of similar types share similar policies in a satellite-assisted V2X network for more precise and effective spectrum management. The vehicles are trained to optimize system utility by selecting transmission modes, power levels, and sub-channels. The social role properties enable each agent to make better decisions based on the environment and its type. The ROAC model provides 8-10% higher normalized system utility over other advanced methods, even with vehicle-role extension, in situations with heavier traffic.
KW - multi-agent reinforcement learning
KW - resource allocation
KW - social roles
KW - V2x
UR - http://www.scopus.com/inward/record.url?scp=85187366726&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85187366726&partnerID=8YFLogxK
U2 - 10.1109/GLOBECOM54140.2023.10437067
DO - 10.1109/GLOBECOM54140.2023.10437067
M3 - Conference contribution
AN - SCOPUS:85187366726
T3 - Proceedings - IEEE Global Communications Conference, GLOBECOM
SP - 2293
EP - 2298
BT - GLOBECOM 2023 - 2023 IEEE Global Communications Conference
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2023 IEEE Global Communications Conference, GLOBECOM 2023
Y2 - 4 December 2023 through 8 December 2023
ER -