BESA: BERT-based Simulated Annealing for Adversarial Text Attacks

Yang, X; Liu, W; Tao, D; Liu, W

BESA: BERT-based Simulated Annealing for Adversarial Text Attacks

Yang, X Liu, W

Tao, D Liu, W

Permalink

Publisher:: International Joint Conferences on Artificial Intelligence
Publication Type:: Conference Proceeding
Citation:: Proceedings of the 30th International Joint Conference on Artificial Intelligence (IJCAI 2021), 2021, pp. 3293-3299
Issue Date:: 2021-08-19

Open Access

Copyright Clearance Process

Recently Added
In Progress
Open Access

This item is open access.

Adobe PDF

Download Accepted versionAdobe PDF (370.53 kB)

View on publisher's site

View statistics

Full metadata record

Field	Value	Language
dc.contributor.author	Yang, X
dc.contributor.author	Liu, W https://orcid.org/0000-0002-3003-1313
dc.contributor.author	Tao, D
dc.contributor.author	Liu, W https://orcid.org/0000-0002-3003-1313
dc.date	2021-08-19
dc.date.accessioned	2023-03-30T12:43:14Z
dc.date.available	2023-03-30T12:43:14Z
dc.date.issued	2021-08-19
dc.identifier.citation	Proceedings of the 30th International Joint Conference on Artificial Intelligence (IJCAI 2021), 2021, pp. 3293-3299
dc.identifier.uri	http://hdl.handle.net/10453/168857
dc.description.abstract	<jats:p>Modern Natural Language Processing (NLP) models are known immensely brittle towards text adversarial examples. Recent attack algorithms usually adopt word-level substitution strategies following a pre-computed word replacement mechanism. However, their resultant adversarial examples are still imperfect in achieving grammar correctness and semantic similarities, which is largely because of their unsuitable candidate word selections and static optimization methods. In this research, we propose BESA, a BERT-based Simulated Annealing algorithm, to address these two problems. Firstly, we leverage the BERT Masked Language Model (MLM) to generate contextual-aware candidate words to produce fluent adversarial text and avoid grammar errors. Secondly, we employ Simulated Annealing (SA) to adaptively determine the word substitution order. The SA provides sufficient word replacement options via internal simulations, with an objective to obtain both a high attack success rate and a low word substitution rate. Besides, our algorithm is able to jump out of local optima with a controlled probability, making it closer to achieve the best possible attack (i.e., the global optima). Experiments on five popular datasets manifest the superiority of BESA compared with existing methods, including TextFooler, BAE, BERT-Attack, PWWS, and PSO.</jats:p>
dc.language	en
dc.publisher	International Joint Conferences on Artificial Intelligence
dc.relation.ispartof	Proceedings of the 30th International Joint Conference on Artificial Intelligence (IJCAI 2021)
dc.relation.ispartof	Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence
dc.relation.isbasedon	10.24963/ijcai.2021/453
dc.rights	info:eu-repo/semantics/openAccess
dc.title	BESA: BERT-based Simulated Annealing for Adversarial Text Attacks
dc.type	Conference Proceeding
utslib.location.activity	Montreal, Canada
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Strength - AAI - Advanced Analytics Institute Research Centre
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology/School of Computer Science
utslib.copyright.status	open_access	*
pubs.consider-herdc	false
dc.date.updated	2023-03-30T12:43:13Z
pubs.finish-date	2021-08-26
pubs.publication-status	Published
pubs.start-date	2021-08-19

Abstract:

Modern Natural Language Processing (NLP) models are known immensely brittle towards text adversarial examples. Recent attack algorithms usually adopt word-level substitution strategies following a pre-computed word replacement mechanism. However, their resultant adversarial examples are still imperfect in achieving grammar correctness and semantic similarities, which is largely because of their unsuitable candidate word selections and static optimization methods. In this research, we propose BESA, a BERT-based Simulated Annealing algorithm, to address these two problems. Firstly, we leverage the BERT Masked Language Model (MLM) to generate contextual-aware candidate words to produce fluent adversarial text and avoid grammar errors. Secondly, we employ Simulated Annealing (SA) to adaptively determine the word substitution order. The SA provides sufficient word replacement options via internal simulations, with an objective to obtain both a high attack success rate and a low word substitution rate. Besides, our algorithm is able to jump out of local optima with a controlled probability, making it closer to achieve the best possible attack (i.e., the global optima). Experiments on five popular datasets manifest the superiority of BESA compared with existing methods, including TextFooler, BAE, BERT-Attack, PWWS, and PSO.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/168857