ReWE: Regressing word embeddings for regularization of neural machine translation systems

Unanue, IJ; Borzeshi, EZ; Esmaili, N; Piccardi, M

ReWE: Regressing word embeddings for regularization of neural machine translation systems

Unanue, IJ Borzeshi, EZ Esmaili, N Piccardi, M

Permalink

Publication Type:: Journal Article
Citation:: NAACL HLT 2019 - 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - Proceedings of the Conference, 2019, 1, pp. 430-436
Issue Date:: 2019-01-01

Open Access

Copyright Clearance Process

Recently Added
In Progress
Open Access

This item is open access.

Adobe PDF

Download Published versionAdobe PDF (537.68 kB)

Adobe PDF

Download Accepted versionAdobe PDF (501.76 kB)

Adobe PDF

Download Supporting informationAdobe PDF (45.03 kB)

Adobe PDF

Download full textAdobe PDF (3.09 MB)

View statistics

Full metadata record

Field	Value	Language
dc.contributor.author	Unanue, IJ
dc.contributor.author	Borzeshi, EZ
dc.contributor.author	Esmaili, N
dc.contributor.author	Piccardi, M https://orcid.org/0000-0001-9250-6604
dc.date.accessioned	2021-03-29T06:51:55Z
dc.date.available	2021-03-29T06:51:55Z
dc.date.issued	2019-01-01
dc.identifier.citation	NAACL HLT 2019 - 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - Proceedings of the Conference, 2019, 1, pp. 430-436
dc.identifier.isbn	9781950737130
dc.identifier.uri	http://hdl.handle.net/10453/147615
dc.description.abstract	Regularization of neural machine translation is still a significant problem, especially in low-resource settings. To mollify this problem, we propose regressing word embeddings (ReWE) as a new regularization technique in a system that is jointly trained to predict the next word in the translation (categorical value) and its word embedding (continuous value). Such a joint training allows the proposed system to learn the distributional properties represented by the word embeddings, empirically improving the generalization to unseen sentences. Experiments over three translation datasets have showed a consistent improvement over a strong baseline, ranging between 0.91 and 2.54 BLEU points, and also a marked improvement over a state-of-the-art system.
dc.language	en
dc.relation.ispartof	NAACL HLT 2019 - 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - Proceedings of the Conference
dc.rights	info:eu-repo/semantics/openAccess
dc.title	ReWE: Regressing word embeddings for regularization of neural machine translation systems
dc.type	Journal Article
utslib.citation.volume	1
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Strength - GBDTC - Global Big Data Technologies
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology/School of Electrical and Data Engineering
utslib.copyright.status	open_access	*
dc.date.updated	2021-03-29T06:51:52Z
pubs.publication-status	Published
pubs.volume	1

Abstract:

Regularization of neural machine translation is still a significant problem, especially in low-resource settings. To mollify this problem, we propose regressing word embeddings (ReWE) as a new regularization technique in a system that is jointly trained to predict the next word in the translation (categorical value) and its word embedding (continuous value). Such a joint training allows the proposed system to learn the distributional properties represented by the word embeddings, empirically improving the generalization to unseen sentences. Experiments over three translation datasets have showed a consistent improvement over a strong baseline, ranging between 0.91 and 2.54 BLEU points, and also a marked improvement over a state-of-the-art system.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/147615