TY - CHAP
T1 - An Enhanced Intelligent Agent with Image Description Generation
AU - Fielding, Ben
AU - Kinghorn, Philip
AU - Mistry, Kamlesh
AU - Zhang, Li
PY - 2016/11/23
Y1 - 2016/11/23
N2 - In this paper, we present an Embodied Conversational Agent (ECA) enriched with automatic image understanding, using vision data derived from state-of-the-art machine learning techniques for the advancement of autonomous interaction with the elderly or infirm. The agent is developed to conduct health and emotion well-being monitoring for the elderly. It is not only able to conduct question-answering via speech-based interaction, but also able to provide analysis of the user’s surroundings, company, emotional states, hazards and fall actions via visual data using deep learning techniques. The agent is accessible from a web browser and can be communicated with via voice means, with a webcam required for the visual analysis functionality. The system has been evaluated with diverse real-life images to prove its efficiency.
AB - In this paper, we present an Embodied Conversational Agent (ECA) enriched with automatic image understanding, using vision data derived from state-of-the-art machine learning techniques for the advancement of autonomous interaction with the elderly or infirm. The agent is developed to conduct health and emotion well-being monitoring for the elderly. It is not only able to conduct question-answering via speech-based interaction, but also able to provide analysis of the user’s surroundings, company, emotional states, hazards and fall actions via visual data using deep learning techniques. The agent is accessible from a web browser and can be communicated with via voice means, with a webcam required for the visual analysis functionality. The system has been evaluated with diverse real-life images to prove its efficiency.
KW - Intelligent conversational agent
KW - Image description generation
KW - Human agent interaction
UR - https://www.scopus.com/pages/publications/84994493968
U2 - 10.1007/978-3-319-47665-0_10
DO - 10.1007/978-3-319-47665-0_10
M3 - Chapter
SN - 978-3-319-47664-3
VL - 10011
T3 - Lecture Notes in Computer Science
SP - 110
EP - 119
BT - Intelligent Virtual Agents
PB - Springer
CY - London
ER -