A vision enriched intelligent agent with image description generation (demonstration)

Li Zhang, Ben Fielding, Philip Kinghorn, Kamlesh Mistry

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

6 Citations (Scopus)

Abstract

In this paper, we present an intelligent conversational agent enriched with automatic image understanding and facial expression recognition using state-of-the-art machine learning techniques for the advancement of autonomous interaction with the elderly or infirm. The agent is developed to conduct health and emotion well-being monitoring for the elderly. It is not only capable of conducting question-answering via speech-based interaction, but also able to provide analysis of the user's surroundings, emotional states, hazards and fall actions via visual data. The agent is accessible from a web browser and can be communicated with via voice or text means, with a webcam required for the visual analysis functionality. The system has been evaluated with diverse real-life images to prove its efficiency.

Original languageEnglish
Title of host publicationAAMAS 2016 - Proceedings of the 2016 International Conference on Autonomous Agents and Multiagent Systems
PublisherInternational Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS)
Pages1488-1489
Number of pages2
ISBN (Electronic)9781450342391
Publication statusPublished - 30 May 2016
Event15th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2016 - Singapore, Singapore
Duration: 9 May 201613 May 2016

Conference

Conference15th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2016
Country/TerritorySingapore
CitySingapore
Period9/05/1613/05/16

Keywords

  • Agent architectures
  • Human-agent interaction
  • Image understanding

Fingerprint

Dive into the research topics of 'A vision enriched intelligent agent with image description generation (demonstration)'. Together they form a unique fingerprint.

Cite this