Abstract
While there is a renewed interest in voice user interfaces (VUI) in HCI, little attention has been paid to the design of VUI voice output beyond intelligibility and naturalness. We draw on the field of sociophonetics - the study of the social factors that influence the production and perception of speech - to highlight how current VUIs are based on a limited and homogenised set of voice outputs. We argue that current systems do not adequately consider the diversity of peoples’ speech, how that diversity represents sociocultural identities, and how voices have the potential to shape user perceptions and experiences. Ultimately, as other technological developments have influenced the ideologies of language, the voice outputs of VUIs will influence the ideologies of speech. Based on our argument, we pose three design strategies for VUI voice output design - individualisation, context awareness, and diversification - to motivate new ways of conceptualising and designing these technologies.
Original language | English |
---|---|
Title of host publication | Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems (CHI '19) |
Subtitle of host publication | May 4–9, 2019, Glasgow, Scotland UK |
Place of Publication | New York, NY, USA |
Publisher | ACM |
Number of pages | 14 |
ISBN (Electronic) | 9781450359702 |
DOIs | |
Publication status | Published - 2 May 2019 |
Event | ACM CHI Conference on Human Factors in Computing Systems 2019: CHI’19 Workshop: HCI in China: Research Agenda, Education Curriculum, Industry Partnership, and Communities Building - Scottish Event Campus, Glasgow, United Kingdom Duration: 4 May 2019 → 9 May 2019 https://chi2019.acm.org/ http://chi2019.acm.org |
Conference
Conference | ACM CHI Conference on Human Factors in Computing Systems 2019 |
---|---|
Abbreviated title | CHI 2019 |
Country/Territory | United Kingdom |
City | Glasgow |
Period | 4/05/19 → 9/05/19 |
Internet address |
Keywords
- Voice user interface
- sociophonetics
- design material
- experience-centered design
- computer synthesised speech