Meta-reinforcement learning based resource allocation for dynamic V2X communications

Yuan, Yi; Zheng, Gan; Wong, Kai-Kit; Letaief, Khaled B.

FINAL VERSION.pdf (1.34 MB)

Meta-reinforcement learning based resource allocation for dynamic V2X communications

journal contribution

posted on 2021-11-15, 11:47 authored by Yi Yuan, Gan Zheng, Kai-Kit Wong, Khaled B. Letaief

This paper studies the allocation of shared resources between vehicle-to-infrastructure (V2I) and vehicle-to-vehicle (V2V) links in vehicle-to-everything (V2X) communications. In existing algorithms, dynamic vehicular environments and quantization of continuous power become the bottlenecks for providing an effective and timely resource allocation policy. In this paper, we develop two algorithms to deal with these difficulties. First, we propose a deep reinforcement learning (DRL)-based resource allocation algorithm to improve the performance of both V2I and V2V links. Specifically, the algorithm uses deep Q-network (DQN) to solve the sub-band assignment and deep deterministic policy-gradient (DDPG) to solve the continuous power allocation problem. Second, we propose a meta-based DRL algorithm to enhance the fast adaptability of the resource allocation policy in the dynamic environment. Numerical results demonstrate that the proposed DRL-based algorithm can significantly improve the performance compared to the DQN-based algorithm that quantizes continuous power. In addition, the proposed meta-based DRL algorithm can achieve the required fast adaptation in the new environment with limited experiences.

Funding

Unlocking Potentials of MIMO Full-duplex Radios for Heterogeneous Networks (UPFRONT)

Engineering and Physical Sciences Research Council

Find out more...

6G Mitola Radio: Cognitive Brain That Has Collective Intelligence

Engineering and Physical Sciences Research Council

Find out more...

Leverhulme Trust Research Project Grant under grant RPG-2017-129

Hong Kong Research Grant Council under Grant No. 16220719

History

School

Mechanical, Electrical and Manufacturing Engineering

Published in

IEEE Transactions on Vehicular Technology

Volume

70

Issue

9

Pages

8964 - 8977

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Version

AM (Accepted Manuscript)

Rights holder

Publisher statement

Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.

Acceptance date

2021-07-16

Publication date

2021-07-26

Copyright date

2021

DOI

https://doi.org/10.1109/TVT.2021.3098854

ISSN

0018-9545

eISSN

1939-9359

Publisher version

https://doi.org/10.1109/TVT.2021.3098854

Language

en

Depositor

Prof Gan Zheng. Deposit date: 12 November 2021

Usage metrics

Licence

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

Meta-reinforcement learning based resource allocation for dynamic V2X communications

Funding

Unlocking Potentials of MIMO Full-duplex Radios for Heterogeneous Networks (UPFRONT)

6G Mitola Radio: Cognitive Brain That Has Collective Intelligence

Leverhulme Trust Research Project Grant under grant RPG-2017-129

Hong Kong Research Grant Council under Grant No. 16220719

History

School

Published in

Volume

Issue

Pages

Publisher

Version

Rights holder

Publisher statement

Acceptance date

Publication date

Copyright date

DOI

ISSN

eISSN

Publisher version

Language

Depositor

Usage metrics

Categories

Keywords

Licence

Exports