CHBias: Bias Evaluation and Mitigation of Chinese Conversational Language Models

Zhao, J; Fang, M; Shi, Z; Li, Y; Chen, L; Pechenizkiy, M

CHBias: Bias Evaluation and Mitigation of Chinese Conversational Language Models

Zhao, J Fang, M Shi, Z Li, Y Chen, L

Pechenizkiy, M

Permalink

Publisher:: Association for Computational Linguistics (ACL)
Publication Type:: Conference Proceeding
Citation:: Proceedings of the Annual Meeting of the Association for Computational Linguistics, 2023, 1, pp. 13538-13556
Issue Date:: 2023-01-01

Open Access

Copyright Clearance Process

Recently Added
In Progress
Open Access

This item is open access.

Adobe PDF

Download Published VersionAdobe PDF (832.42 kB)

View on publisher's site

View statistics

Full metadata record

Field	Value	Language
dc.contributor.author	Zhao, J
dc.contributor.author	Fang, M
dc.contributor.author	Shi, Z
dc.contributor.author	Li, Y
dc.contributor.author	Chen, L https://orcid.org/0000-0002-6468-5729
dc.contributor.author	Pechenizkiy, M
dc.date	2023-07
dc.date.accessioned	2024-03-02T07:04:19Z
dc.date.available	2024-03-02T07:04:19Z
dc.date.issued	2023-01-01
dc.identifier.citation	Proceedings of the Annual Meeting of the Association for Computational Linguistics, 2023, 1, pp. 13538-13556
dc.identifier.isbn	9781959429722
dc.identifier.issn	0736-587X
dc.identifier.uri	http://hdl.handle.net/10453/176018
dc.description.abstract	Pretrained conversational agents have been exposed to safety issues, exhibiting a range of stereotypical human biases such as gender bias. However, there are still limited bias categories in current research, and most of them only focus on English. In this paper, we introduce a new Chinese dataset, CHBias, for bias evaluation and mitigation of Chinese conversational language models. Apart from those previous well-explored bias categories, CHBias includes under-explored bias categories, such as ageism and appearance biases, which received less attention. We evaluate two popular pretrained Chinese conversational models, CDial-GPT and EVA2.0, using CHBias. Furthermore, to mitigate different biases, we apply several debiasing methods to the Chinese pretrained models. Experimental results show that these Chinese pretrained models are potentially risky for generating texts that contain social biases, and debiasing methods using the proposed dataset can make response generation less biased while preserving the models' conversational capabilities.
dc.description.uri	http://dx.doi.org/10.18653/v1/2023.acl-long.757
dc.language	en
dc.publisher	Association for Computational Linguistics (ACL)
dc.relation.ispartof	Proceedings of the Annual Meeting of the Association for Computational Linguistics
dc.relation.ispartof	Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
dc.relation.isbasedon	10.18653/v1/2023.acl-long.757
dc.rights	info:eu-repo/semantics/openAccess
dc.title	CHBias: Bias Evaluation and Mitigation of Chinese Conversational Language Models
dc.type	Conference Proceeding
utslib.citation.volume	1
pubs.organisational-group	University of Technology Sydney
pubs.organisational-group	University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	University of Technology Sydney/Strength - AAII - Australian Artificial Intelligence Institute
pubs.organisational-group	University of Technology Sydney/Faculty of Engineering and Information Technology/School of Computer Science
utslib.copyright.status	open_access	*
dc.date.updated	2024-03-02T07:04:16Z
dc.identifier.doi	10.18653/v1/2023.acl-long.757
pubs.finish-date	2023-07
pubs.publication-status	Published
pubs.start-date	2023-07
pubs.volume	1

Abstract:

Pretrained conversational agents have been exposed to safety issues, exhibiting a range of stereotypical human biases such as gender bias. However, there are still limited bias categories in current research, and most of them only focus on English. In this paper, we introduce a new Chinese dataset, CHBias, for bias evaluation and mitigation of Chinese conversational language models. Apart from those previous well-explored bias categories, CHBias includes under-explored bias categories, such as ageism and appearance biases, which received less attention. We evaluate two popular pretrained Chinese conversational models, CDial-GPT and EVA2.0, using CHBias. Furthermore, to mitigate different biases, we apply several debiasing methods to the Chinese pretrained models. Experimental results show that these Chinese pretrained models are potentially risky for generating texts that contain social biases, and debiasing methods using the proposed dataset can make response generation less biased while preserving the models' conversational capabilities.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/176018