Responses of Conversational Agents to Health and Lifestyle Prompts: Investigation of Appropriateness and Presentation Structures.

Kocaballi, AB; Quiroz, JC; Rezazadegan, D; Berkovsky, S; Magrabi, F; Coiera, E; Laranjo, L

Responses of Conversational Agents to Health and Lifestyle Prompts: Investigation of Appropriateness and Presentation Structures.

Kocaballi, AB Quiroz, JC Rezazadegan, D Berkovsky, S Magrabi, F Coiera, E Laranjo, L

Permalink

Publisher:: JMIR PUBLICATIONS, INC
Publication Type:: Journal Article
Citation:: Journal of medical Internet research, 2020, 22, (2)
Issue Date:: 2020-02-09

Closed Access

	Filename	Description	Size
	pdf.pdf		611.27 kB		View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Kocaballi, AB
dc.contributor.author	Quiroz, JC
dc.contributor.author	Rezazadegan, D
dc.contributor.author	Berkovsky, S
dc.contributor.author	Magrabi, F
dc.contributor.author	Coiera, E
dc.contributor.author	Laranjo, L
dc.date.accessioned	2021-03-18T04:59:50Z
dc.date.available	2019-12-16
dc.date.available	2021-03-18T04:59:50Z
dc.date.issued	2020-02-09
dc.identifier.citation	Journal of medical Internet research, 2020, 22, (2)
dc.identifier.issn	1439-4456
dc.identifier.issn	1438-8871
dc.identifier.uri	http://hdl.handle.net/10453/147327
dc.description.abstract	BACKGROUND:Conversational agents (CAs) are systems that mimic human conversations using text or spoken language. Their widely used examples include voice-activated systems such as Apple Siri, Google Assistant, Amazon Alexa, and Microsoft Cortana. The use of CAs in health care has been on the rise, but concerns about their potential safety risks often remain understudied. OBJECTIVE:This study aimed to analyze how commonly available, general-purpose CAs on smartphones and smart speakers respond to health and lifestyle prompts (questions and open-ended statements) by examining their responses in terms of content and structure alike. METHODS:We followed a piloted script to present health- and lifestyle-related prompts to 8 CAs. The CAs' responses were assessed for their appropriateness on the basis of the prompt type: responses to safety-critical prompts were deemed appropriate if they included a referral to a health professional or service, whereas responses to lifestyle prompts were deemed appropriate if they provided relevant information to address the problem prompted. The response structure was also examined according to information sources (Web search-based or precoded), response content style (informative and/or directive), confirmation of prompt recognition, and empathy. RESULTS:The 8 studied CAs provided in total 240 responses to 30 prompts. They collectively responded appropriately to 41% (46/112) of the safety-critical and 39% (37/96) of the lifestyle prompts. The ratio of appropriate responses deteriorated when safety-critical prompts were rephrased or when the agent used a voice-only interface. The appropriate responses included mostly directive content and empathy statements for the safety-critical prompts and a mix of informative and directive content for the lifestyle prompts. CONCLUSIONS:Our results suggest that the commonly available, general-purpose CAs on smartphones and smart speakers with unconstrained natural language interfaces are limited in their ability to advise on both the safety-critical health prompts and lifestyle prompts. Our study also identified some response structures the CAs employed to present their appropriate responses. Further investigation is needed to establish guidelines for designing suitable response structures for different prompt types.
dc.format	Electronic
dc.language	eng
dc.publisher	JMIR PUBLICATIONS, INC
dc.relation.ispartof	Journal of medical Internet research
dc.relation.isbasedon	10.2196/15823
dc.rights	info:eu-repo/semantics/closedAccess
dc.subject	08 Information and Computing Sciences, 11 Medical and Health Sciences, 17 Psychology and Cognitive Sciences
dc.subject.classification	Medical Informatics
dc.subject.mesh	Communication
dc.subject.mesh	Humans
dc.subject.mesh	Life Style
dc.subject.mesh	Humans
dc.subject.mesh	Communication
dc.subject.mesh	Life Style
dc.subject.mesh	Communication
dc.subject.mesh	Humans
dc.subject.mesh	Life Style
dc.title	Responses of Conversational Agents to Health and Lifestyle Prompts: Investigation of Appropriateness and Presentation Structures.
dc.type	Journal Article
utslib.citation.volume	22
utslib.location.activity	Canada
utslib.for	08 Information and Computing Sciences
utslib.for	11 Medical and Health Sciences
utslib.for	17 Psychology and Cognitive Sciences
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology/School of Computer Science
utslib.copyright.status	closed_access	*
pubs.consider-herdc	false
dc.date.updated	2021-03-18T04:59:48Z
pubs.issue	2
pubs.publication-status	Published
pubs.volume	22
utslib.citation.issue	2

Abstract:

BACKGROUND:Conversational agents (CAs) are systems that mimic human conversations using text or spoken language. Their widely used examples include voice-activated systems such as Apple Siri, Google Assistant, Amazon Alexa, and Microsoft Cortana. The use of CAs in health care has been on the rise, but concerns about their potential safety risks often remain understudied. OBJECTIVE:This study aimed to analyze how commonly available, general-purpose CAs on smartphones and smart speakers respond to health and lifestyle prompts (questions and open-ended statements) by examining their responses in terms of content and structure alike. METHODS:We followed a piloted script to present health- and lifestyle-related prompts to 8 CAs. The CAs' responses were assessed for their appropriateness on the basis of the prompt type: responses to safety-critical prompts were deemed appropriate if they included a referral to a health professional or service, whereas responses to lifestyle prompts were deemed appropriate if they provided relevant information to address the problem prompted. The response structure was also examined according to information sources (Web search-based or precoded), response content style (informative and/or directive), confirmation of prompt recognition, and empathy. RESULTS:The 8 studied CAs provided in total 240 responses to 30 prompts. They collectively responded appropriately to 41% (46/112) of the safety-critical and 39% (37/96) of the lifestyle prompts. The ratio of appropriate responses deteriorated when safety-critical prompts were rephrased or when the agent used a voice-only interface. The appropriate responses included mostly directive content and empathy statements for the safety-critical prompts and a mix of informative and directive content for the lifestyle prompts. CONCLUSIONS:Our results suggest that the commonly available, general-purpose CAs on smartphones and smart speakers with unconstrained natural language interfaces are limited in their ability to advise on both the safety-critical health prompts and lifestyle prompts. Our study also identified some response structures the CAs employed to present their appropriate responses. Further investigation is needed to establish guidelines for designing suitable response structures for different prompt types.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/147327