Improving the Robustness of Summarization Systems with Dual Augmentation

Chen, X; Long, G; Tao, C; Li, M; Gao, X; Zhang, C; Zhang, X

Improving the Robustness of Summarization Systems with Dual Augmentation

Chen, X Long, G

Tao, C Li, M Gao, X Zhang, C

Zhang, X

Permalink

Publication Type:: Conference Proceeding
Citation:: Proceedings of the Annual Meeting of the Association for Computational Linguistics, 2023, 1, pp. 6846-6857
Issue Date:: 2023-01-01

Open Access

Copyright Clearance Process

Recently Added
In Progress
Open Access

This item is open access.

Adobe PDF

Download Accepted versionAdobe PDF (639.57 kB)

View statistics

Full metadata record

Field	Value	Language
dc.contributor.author	Chen, X
dc.contributor.author	Long, G https://orcid.org/0000-0003-3740-9515
dc.contributor.author	Tao, C
dc.contributor.author	Li, M
dc.contributor.author	Gao, X
dc.contributor.author	Zhang, C https://orcid.org/0000-0001-5715-7154
dc.contributor.author	Zhang, X
dc.date.accessioned	2024-03-13T01:07:41Z
dc.date.available	2024-03-13T01:07:41Z
dc.date.issued	2023-01-01
dc.identifier.citation	Proceedings of the Annual Meeting of the Association for Computational Linguistics, 2023, 1, pp. 6846-6857
dc.identifier.isbn	9781959429722
dc.identifier.issn	0736-587X
dc.identifier.uri	http://hdl.handle.net/10453/176605
dc.description.abstract	A robust summarization system should be able to capture the gist of the document, regardless of the specific word choices or noise in the input. In this work, we first explore the summarization models' robustness against perturbations including word-level synonym substitution and noise. To create semantic-consistent substitutes, we propose a SummAttacker, which is an efficient approach to generating adversarial samples based on language models. Experimental results show that state-of-the-art summarization models have a significant decrease in performance on adversarial and noisy test sets. Next, we analyze the vulnerability of the summarization systems and explore improving the robustness by data augmentation. Specifically, the first brittleness factor we found is the poor understanding of infrequent words in the input. Correspondingly, we feed the encoder with more diverse cases created by SummAttacker in the input space. The other factor is in the latent space, where the attacked inputs bring more variations to the hidden states. Hence, we construct adversarial decoder input and devise manifold softmixing operation in hidden space to introduce more diversity. Experimental results on Gigaword and CNN/DM datasets demonstrate that our approach achieves significant improvements over strong baselines and exhibits higher robustness on noisy, attacked, and clean datasets.
dc.language	en
dc.relation.ispartof	Proceedings of the Annual Meeting of the Association for Computational Linguistics
dc.rights	info:eu-repo/semantics/openAccess
dc.title	Improving the Robustness of Summarization Systems with Dual Augmentation
dc.type	Conference Proceeding
utslib.citation.volume	1
pubs.organisational-group	University of Technology Sydney
pubs.organisational-group	University of Technology Sydney/DVC (International)
pubs.organisational-group	University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	University of Technology Sydney/Strength - ACRI - Australia China Relations Institute
pubs.organisational-group	University of Technology Sydney/Strength - AAII - Australian Artificial Intelligence Institute
utslib.copyright.status	open_access	*
dc.date.updated	2024-03-13T01:07:40Z
pubs.publication-status	Published
pubs.volume	1

Abstract:

A robust summarization system should be able to capture the gist of the document, regardless of the specific word choices or noise in the input. In this work, we first explore the summarization models' robustness against perturbations including word-level synonym substitution and noise. To create semantic-consistent substitutes, we propose a SummAttacker, which is an efficient approach to generating adversarial samples based on language models. Experimental results show that state-of-the-art summarization models have a significant decrease in performance on adversarial and noisy test sets. Next, we analyze the vulnerability of the summarization systems and explore improving the robustness by data augmentation. Specifically, the first brittleness factor we found is the poor understanding of infrequent words in the input. Correspondingly, we feed the encoder with more diverse cases created by SummAttacker in the input space. The other factor is in the latent space, where the attacked inputs bring more variations to the hidden states. Hence, we construct adversarial decoder input and devise manifold softmixing operation in hidden space to introduce more diversity. Experimental results on Gigaword and CNN/DM datasets demonstrate that our approach achieves significant improvements over strong baselines and exhibits higher robustness on noisy, attacked, and clean datasets.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/176605