T3L: Translate-and-Test Transfer Learning for Cross-Lingual Text Classification

Unanue, IJ; Haffari, G; Piccardi, M

T3L: Translate-and-Test Transfer Learning for Cross-Lingual Text Classification

Unanue, IJ Haffari, G Piccardi, M

Permalink

Publisher:: MIT PRESS
Publication Type:: Journal Article
Citation:: Transactions of the Association for Computational Linguistics, 2023, 11, pp. 1147-1161
Issue Date:: 2023-01-01

Open Access

Copyright Clearance Process

Recently Added
In Progress
Open Access

This item is open access.

Adobe PDF

Download Published versionAdobe PDF (521.73 kB)

View on publisher's site

View statistics

Full metadata record

Field	Value	Language
dc.contributor.author	Unanue, IJ
dc.contributor.author	Haffari, G
dc.contributor.author	Piccardi, M https://orcid.org/0000-0001-9250-6604
dc.date.accessioned	2024-04-12T05:28:40Z
dc.date.available	2024-04-12T05:28:40Z
dc.date.issued	2023-01-01
dc.identifier.citation	Transactions of the Association for Computational Linguistics, 2023, 11, pp. 1147-1161
dc.identifier.issn	2307-387X
dc.identifier.issn	2307-387X
dc.identifier.uri	http://hdl.handle.net/10453/177846
dc.description.abstract	Cross-lingual text classification leverages text classifiers trained in a high-resource language to perform text classification in other languages with no or minimal fine-tuning (zero/ few-shots cross-lingual transfer). Nowadays, cross-lingual text classifiers are typically built on large-scale, multilingual language models (LMs) pretrained on a variety of languages of interest. However, the performance of these models varies significantly across languages and classification tasks, suggesting that the superposition of the language modelling and classification tasks is not always effective. For this reason, in this paper we propose revis-iting the classic ‘‘translate-and-test’’ pipeline to neatly separate the translation and classification stages. The proposed approach couples 1) a neural machine translator translating from the targeted language to a high-resource lan-guage, with 2) a text classifier trained in the high-resource language, but the neural machine translator generates ‘‘soft’’ translations to permit end-to-end backpropagation during fine-tuning of the pipeline. Extensive experi-ments have been carried out over three cross-lingual text classification datasets (XNLI, MLDoc, and MultiEURLEX), with the results showing that the proposed approach has significantly improved performance over a com-petitive baseline.
dc.language	English
dc.publisher	MIT PRESS
dc.relation.ispartof	Transactions of the Association for Computational Linguistics
dc.relation.isbasedon	10.1162/tacl_a_00593
dc.rights	info:eu-repo/semantics/openAccess
dc.subject	0801 Artificial Intelligence and Image Processing, 1702 Cognitive Sciences, 2004 Linguistics
dc.subject.classification	4602 Artificial intelligence
dc.subject.classification	4704 Linguistics
dc.title	T3L: Translate-and-Test Transfer Learning for Cross-Lingual Text Classification
dc.type	Journal Article
utslib.citation.volume	11
utslib.for	0801 Artificial Intelligence and Image Processing
utslib.for	1702 Cognitive Sciences
utslib.for	2004 Linguistics
pubs.organisational-group	University of Technology Sydney
pubs.organisational-group	University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	University of Technology Sydney/Strength - GBDTC - Global Big Data Technologies
pubs.organisational-group	University of Technology Sydney/Faculty of Engineering and Information Technology/School of Electrical and Data Engineering
utslib.copyright.status	open_access	*
dc.date.updated	2024-04-12T05:28:38Z
pubs.publication-status	Published
pubs.volume	11

Abstract:

Cross-lingual text classification leverages text classifiers trained in a high-resource language to perform text classification in other languages with no or minimal fine-tuning (zero/ few-shots cross-lingual transfer). Nowadays, cross-lingual text classifiers are typically built on large-scale, multilingual language models (LMs) pretrained on a variety of languages of interest. However, the performance of these models varies significantly across languages and classification tasks, suggesting that the superposition of the language modelling and classification tasks is not always effective. For this reason, in this paper we propose revis-iting the classic ‘‘translate-and-test’’ pipeline to neatly separate the translation and classification stages. The proposed approach couples 1) a neural machine translator translating from the targeted language to a high-resource lan-guage, with 2) a text classifier trained in the high-resource language, but the neural machine translator generates ‘‘soft’’ translations to permit end-to-end backpropagation during fine-tuning of the pipeline. Extensive experi-ments have been carried out over three cross-lingual text classification datasets (XNLI, MLDoc, and MultiEURLEX), with the results showing that the proposed approach has significantly improved performance over a com-petitive baseline.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/177846