TY - JOUR
T1 - Multi-Agent Deep Reinforcement Learning Based Trajectory Planning for Multi-UAV Assisted Mobile Edge Computing
AU - Wang, Liang
AU - Wang, Kezhi
AU - Pan, Cunhua
AU - Xu, Wei
AU - Aslam, Nauman
AU - Hanzo, Lajos
N1 - Research funded by National Natural Science Foundation of China (6202202661871109) | Engineering and Physical Sciences Research Council (EP/P003990/1 (COALESCE)EP/N004558/1EP/P034284/1) | Royal Society
PY - 2021/3
Y1 - 2021/3
N2 - An unmanned aerial vehicle (UAV)-aided mobile edge computing (MEC) framework is proposed, where several UAVs having different trajectories fly over the target area and support the user equipments (UEs) on the ground. We aim to jointly optimize the geographical fairness among all the UEs, the fairness of each UAV’ UE-load and the overall energy consumption of UEs. The above optimization problem includes both integer and continues variables and it is challenging to solve. To address the above problem, a multi-agent deep reinforcement learning based trajectory control algorithm is proposed for managing the trajectory of each UAV independently, where the popular Multi-Agent Deep Deterministic Policy Gradient (MADDPG) method is applied. Given the UAVs’ trajectories, a low-complexity approach is introduced for optimizing the offloading decisions of UEs. We show that our proposed solution has considerable performance over other traditional algorithms, both in terms of the fairness for serving UEs, fairness of UE-load at each UAV and energy consumption for all the UEs.
AB - An unmanned aerial vehicle (UAV)-aided mobile edge computing (MEC) framework is proposed, where several UAVs having different trajectories fly over the target area and support the user equipments (UEs) on the ground. We aim to jointly optimize the geographical fairness among all the UEs, the fairness of each UAV’ UE-load and the overall energy consumption of UEs. The above optimization problem includes both integer and continues variables and it is challenging to solve. To address the above problem, a multi-agent deep reinforcement learning based trajectory control algorithm is proposed for managing the trajectory of each UAV independently, where the popular Multi-Agent Deep Deterministic Policy Gradient (MADDPG) method is applied. Given the UAVs’ trajectories, a low-complexity approach is introduced for optimizing the offloading decisions of UEs. We show that our proposed solution has considerable performance over other traditional algorithms, both in terms of the fairness for serving UEs, fairness of UE-load at each UAV and energy consumption for all the UEs.
KW - Multi-Agent Deep Reinforcement Learning
KW - MADDPG
KW - Mobile Edge Computing
KW - UAV
KW - Trajectory Control
UR - http://www.scopus.com/inward/record.url?scp=85091953434&partnerID=8YFLogxK
U2 - 10.1109/TCCN.2020.3027695
DO - 10.1109/TCCN.2020.3027695
M3 - Article
SN - 2332-7731
VL - 7
SP - 73
EP - 84
JO - IEEE Transactions on Cognitive Communications and Networking
JF - IEEE Transactions on Cognitive Communications and Networking
IS - 1
ER -