TY - JOUR
T1 - 3D-Trajectory and Phase-Shift Design for RIS-Assisted UAV Systems using Deep Reinforcement Learning
AU - Mei, Haibo
AU - Yang, Kun
AU - Liu, Qiang
AU - Wang, Kezhi
N1 - Funding information: This work is partially founded by Natural Science Foundation of China (Grant Nos. 61620106011, U1705263 and 61871076), UESTC Yangtze Delta Region Research Institute - Quzhou (Grant No.: 2020D002) and EU H2020 Project COSAFE (GA-824019).
PY - 2022/3/1
Y1 - 2022/3/1
N2 - Unmanned aerial vehicle (UAV) can effectively work as temporary base station or access point in the air to transfer/receive data to/from ground terminals (GTs). However, UAV-GT links might be blocked by ground obstacles, like buildings in urban area, leading to a poor performance on data transferring rate. To address this problem, reconfigurable intelligent surface (RIS), as a promising technique, can intelligently reflect the received signals between UAV and GT to significantly enhance the communication quality. Under this deployment of RIS-assisted UAV, we intend to jointly optimize the 3D-space of the UAV and the phase-shift of the RIS to maximize the data transferring rate of the UAV, while minimizing the UAV propulsion energy. The joint problem is non-convex in its original form and difficult to be timely solved by using traditional method, like successive convex approximation (SCA). Therefore, to facilitate the online decision making to this joint problem, we leverage deep reinforcement learning (DRL) to learn the near-optimal solution, and the well known Double Deep Q-Network (DDQN) and Deep Deterministic Policy Gradient (DDPG) algorithms are ultilized. Numerical results show that DRL can effectively improve the energy-efficiency performance of the RIS-Assisted UAV system, compared with benchmark solutions.
AB - Unmanned aerial vehicle (UAV) can effectively work as temporary base station or access point in the air to transfer/receive data to/from ground terminals (GTs). However, UAV-GT links might be blocked by ground obstacles, like buildings in urban area, leading to a poor performance on data transferring rate. To address this problem, reconfigurable intelligent surface (RIS), as a promising technique, can intelligently reflect the received signals between UAV and GT to significantly enhance the communication quality. Under this deployment of RIS-assisted UAV, we intend to jointly optimize the 3D-space of the UAV and the phase-shift of the RIS to maximize the data transferring rate of the UAV, while minimizing the UAV propulsion energy. The joint problem is non-convex in its original form and difficult to be timely solved by using traditional method, like successive convex approximation (SCA). Therefore, to facilitate the online decision making to this joint problem, we leverage deep reinforcement learning (DRL) to learn the near-optimal solution, and the well known Double Deep Q-Network (DDQN) and Deep Deterministic Policy Gradient (DDPG) algorithms are ultilized. Numerical results show that DRL can effectively improve the energy-efficiency performance of the RIS-Assisted UAV system, compared with benchmark solutions.
KW - 3D-trajectory
KW - Reconfigurable intelligent surface (RIS)
KW - deep reinforcement learning (DRL)
KW - intelligent reflecting surface (IRS)
KW - unmanned aerial vehicle (UAV)
UR - http://www.scopus.com/inward/record.url?scp=85123288921&partnerID=8YFLogxK
U2 - 10.1109/TVT.2022.3143839
DO - 10.1109/TVT.2022.3143839
M3 - Article
AN - SCOPUS:85123288921
SN - 0018-9545
VL - 71
SP - 3020
EP - 3029
JO - IEEE Transactions on Vehicular Technology
JF - IEEE Transactions on Vehicular Technology
IS - 3
ER -