Learning to improve persona consistency in conversation generation with information augmentation

Wang, W; Feng, S; Chen, L; Wang, D; Zhang, Y

Learning to improve persona consistency in conversation generation with information augmentation

Wang, W Feng, S Chen, L

Wang, D Zhang, Y

Permalink

Publisher:: Elsevier BV
Publication Type:: Journal Article
Citation:: Knowledge-Based Systems, 2021, 228, pp. 107246-107246
Issue Date:: 2021-09-27

Closed Access

	Filename	Description	Size
	1-s2.0-S0950705121005086-main.pdf	Published version	2.79 MB	Adobe PDF	View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Wang, W
dc.contributor.author	Feng, S
dc.contributor.author	Chen, L https://orcid.org/0000-0002-6468-5729
dc.contributor.author	Wang, D
dc.contributor.author	Zhang, Y
dc.date.accessioned	2021-10-30T20:05:31Z
dc.date.available	2021-10-30T20:05:31Z
dc.date.issued	2021-09-27
dc.identifier.citation	Knowledge-Based Systems, 2021, 228, pp. 107246-107246
dc.identifier.issn	0950-7051
dc.identifier.uri	http://hdl.handle.net/10453/151272
dc.description.abstract	In an open-domain conversation system, maintaining consistent persona is a key factor to earn trust from users and engage users in the conversation. Existing methods suffer from the issue that only sparse persona-relevant signals are available in the target responses, leading to the generation of responses with inconsistent persona. To address the issue, in this paper, we propose two methods to augment persona learning signals for persona preservation. At the sentence level, we develop a dual variational learning model based on the bidirectional encoder representations from transformers (i.e., BERT), which enriches persona signals with relevant persona sentences, in addition to target responses. Therefore, both the encoder part and the latent variable can be guided to learn consistent persona features through back-propagation of losses, which will drive response decoding towards consistent persona expression. At the word level, we propose a persona-based calibration network, which is used to amplify the influence of persona-relevant words in target responses. The experimental results show that our developed model outperforms the strong baseline algorithms by large margins and effectively promotes persona consistency in conversation generation.
dc.language	en
dc.publisher	Elsevier BV
dc.relation.ispartof	Knowledge-Based Systems
dc.relation.isbasedon	10.1016/j.knosys.2021.107246
dc.rights	info:eu-repo/semantics/closedAccess
dc.subject	08 Information and Computing Sciences, 15 Commerce, Management, Tourism and Services, 17 Psychology and Cognitive Sciences
dc.subject.classification	Artificial Intelligence & Image Processing
dc.title	Learning to improve persona consistency in conversation generation with information augmentation
dc.type	Journal Article
utslib.citation.volume	228
utslib.for	08 Information and Computing Sciences
utslib.for	15 Commerce, Management, Tourism and Services
utslib.for	17 Psychology and Cognitive Sciences
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Strength - AAII - Australian Artificial Intelligence Institute
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology/School of Computer Science
utslib.copyright.status	closed_access	*
dc.date.updated	2021-10-30T20:05:28Z
pubs.publication-status	Published
pubs.volume	228

Abstract:

In an open-domain conversation system, maintaining consistent persona is a key factor to earn trust from users and engage users in the conversation. Existing methods suffer from the issue that only sparse persona-relevant signals are available in the target responses, leading to the generation of responses with inconsistent persona. To address the issue, in this paper, we propose two methods to augment persona learning signals for persona preservation. At the sentence level, we develop a dual variational learning model based on the bidirectional encoder representations from transformers (i.e., BERT), which enriches persona signals with relevant persona sentences, in addition to target responses. Therefore, both the encoder part and the latent variable can be guided to learn consistent persona features through back-propagation of losses, which will drive response decoding towards consistent persona expression. At the word level, we propose a persona-based calibration network, which is used to amplify the influence of persona-relevant words in target responses. The experimental results show that our developed model outperforms the strong baseline algorithms by large margins and effectively promotes persona consistency in conversation generation.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/151272