TY - JOUR
T1 - Deep reinforcement learning-based long-range autonomous valet parking for smart cities
AU - Khalid, Muhammad
AU - Wang, Liang
AU - Wang, Kezhi
AU - Aslam, Nauman
AU - Pan, Cunhua
AU - Cao, Yue
N1 - Funding Information:
Funding: Wuhan Key Research and Development Program (2022012202015016)
PY - 2023/2/1
Y1 - 2023/2/1
N2 - In this paper, to reduce the congestion rate at the city center and increase the traveling quality of experience (QoE) of each user, the framework of long-range autonomous valet parking is presented. Here, an Autonomous Vehicle (AV) is deployed to pick up, and drop off users at their required spots, and then drive to the car park around well-organized places of city autonomously. In this framework, we aim to minimize the overall distance of AV, while guarantee all users are served with great QoE, i.e., picking up, and dropping off users at their required spots through optimizing the path planning of the AV and number of serving time slots. To this end, we first present a learning-based algorithm, which is named as Double-Layer Ant Colony Optimization (DLACO) algorithm to solve the above problem in an iterative way. Then, to make the fast decision, while considers the dynamic environment (i.e., the AV may pick up and drop off users from different locations), we further present a deep reinforcement learning-based algorithm, i.e., Deep Q-learning Network (DQN) to solve this problem. Experimental results show that the DL-ACO and DQN-based algorithms both achieve the considerable performance.
AB - In this paper, to reduce the congestion rate at the city center and increase the traveling quality of experience (QoE) of each user, the framework of long-range autonomous valet parking is presented. Here, an Autonomous Vehicle (AV) is deployed to pick up, and drop off users at their required spots, and then drive to the car park around well-organized places of city autonomously. In this framework, we aim to minimize the overall distance of AV, while guarantee all users are served with great QoE, i.e., picking up, and dropping off users at their required spots through optimizing the path planning of the AV and number of serving time slots. To this end, we first present a learning-based algorithm, which is named as Double-Layer Ant Colony Optimization (DLACO) algorithm to solve the above problem in an iterative way. Then, to make the fast decision, while considers the dynamic environment (i.e., the AV may pick up and drop off users from different locations), we further present a deep reinforcement learning-based algorithm, i.e., Deep Q-learning Network (DQN) to solve this problem. Experimental results show that the DL-ACO and DQN-based algorithms both achieve the considerable performance.
KW - Ant colony optimization (ACO)
KW - Autonomous vehicle
KW - Deep reinforcement learning
KW - Long-range autonomous valet parking (LAVP)
KW - Sustainable cities and communities
UR - http://www.scopus.com/inward/record.url?scp=85145260285&partnerID=8YFLogxK
U2 - 10.1016/j.scs.2022.104311
DO - 10.1016/j.scs.2022.104311
M3 - Article
AN - SCOPUS:85145260285
VL - 89
JO - Sustainable Cities and Society
JF - Sustainable Cities and Society
SN - 2210-6707
M1 - 104311
ER -