Full title—Multi-agent Reinforcement Learning Resources Allocation Method using Dueling Double Deep Q-Network in Vehicular Networks
For the Internet of Vehicles (IoV), it is a fundamental challenge to achieve low latency and high reliability communication for real-time data interaction over short distances in a complex wireless propagation environment, as well as to attenuate and avoid inter-vehicle interference through a reasonable spectrum allocation.
To solve the above problems, this paper proposes a resource allocation (RA) method using dueling double deep-Q network reinforcement learning (RL) with low-latitude fingerprints and soft-update architecture (D3QN-LS) while constructing a multi-agent model based on a Manhattan grid layout urban virtual environment, with communication links between V2V links acting as agents to reuse vehicle-to-infrastructure (V2I) spectrum resources.
Full Article: IEEE Transactions on Vehicular Technology, Early Access 2023 |