TY - JOUR
T1 - Robustness and performance of Deep Reinforcement Learning
AU - Al-Nima, Raid Rafi Omar
AU - Han, Tingting
AU - Al-Sumaidaee, Saadoon Awad Mohammed
AU - Chen, Taolue
AU - Woo, Wai Lok
N1 - Funding information: This work is supported by EPSRC grants (EP/P015387/1, EP/P00430X/1), Birkbeck BEI School Project (ARTEFACT), Guangdong Science and Technology Department grant (No.2018B010107004), China.
PY - 2021/7/1
Y1 - 2021/7/1
N2 - Deep Reinforcement Learning (DRL) has recently obtained considerable attentions. It empowers Reinforcement Learning (RL) with Deep Learning (DL) techniques to address various difficult tasks. In this paper, a novel approach called the Genetic Algorithm of Neuron Coverage (GANC) is proposed. It is motivated for improving the robustness and performance of a DRL network. The GANC uses Genetic Algorithm (GA) to maximise the Neuron Coverage (NC) of a DRL network by producing augmented inputs. We apply this method in the self-driving car applications, where it is crucial to accurately provide a correct decision for different road tracking views. We evaluate our method on the SYNTHIA-SEQS-05 databases in four different driving environments. Our outcomes are very promising – the best driving accuracy reached 97.75% – and are superior to the state-of-the-art results.
AB - Deep Reinforcement Learning (DRL) has recently obtained considerable attentions. It empowers Reinforcement Learning (RL) with Deep Learning (DL) techniques to address various difficult tasks. In this paper, a novel approach called the Genetic Algorithm of Neuron Coverage (GANC) is proposed. It is motivated for improving the robustness and performance of a DRL network. The GANC uses Genetic Algorithm (GA) to maximise the Neuron Coverage (NC) of a DRL network by producing augmented inputs. We apply this method in the self-driving car applications, where it is crucial to accurately provide a correct decision for different road tracking views. We evaluate our method on the SYNTHIA-SEQS-05 databases in four different driving environments. Our outcomes are very promising – the best driving accuracy reached 97.75% – and are superior to the state-of-the-art results.
KW - Deep Reinforcement Learning
KW - Genetic Algorithm
KW - Neuron Coverage
KW - Road tracking
UR - http://www.scopus.com/inward/record.url?scp=85102966100&partnerID=8YFLogxK
U2 - 10.1016/j.asoc.2021.107295
DO - 10.1016/j.asoc.2021.107295
M3 - Article
AN - SCOPUS:85102966100
SN - 1568-4946
VL - 105
JO - Applied Soft Computing
JF - Applied Soft Computing
M1 - 107295
ER -