A Shared Attention Mechanism for Interpretation of Neural Automatic Post-Editing Systems

Jauregi Unanue, I; Zare Borzeshi, E; Piccardi, M

A Shared Attention Mechanism for Interpretation of Neural Automatic Post-Editing Systems

Jauregi Unanue, I

Zare Borzeshi, E Piccardi, M

Permalink

Publication Type:: Conference Proceeding
Citation:: 2018, pp. 11 - 17
Issue Date:: 2018-07-20

Closed Access

	Filename	Description	Size
	mypaper_v1.pdf	Accepted Manuscript version	320.07 kB	Adobe PDF	View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Jauregi Unanue, I https://orcid.org/0000-0001-6223-9584	en_US
dc.contributor.author	Zare Borzeshi, E	en_US
dc.contributor.author	Piccardi, M https://orcid.org/0000-0001-9250-6604	en_US
dc.date	2018-07-20	en_US
dc.date.issued	2018-07-20	en_US
dc.identifier.citation	2018, pp. 11 - 17	en_US
dc.identifier.isbn	978-1-948087-40-7	en_US
dc.identifier.uri	http://hdl.handle.net/10453/128997
dc.description.abstract	Automatic post-editing (APE) systems aim to correct the systematic errors made by machine translators. In this paper, we propose a neural APE system that encodes the source (src) and machine translated (mt) sentences with two separate encoders, but leverages a shared attention mechanism to better understand how the two inputs contribute to the generation of the post-edited (pe) sentences. Our empirical observations have showed that when the mt is incorrect, the attention shifts weight toward tokens in the src sentence to properly edit the incorrect translation. The model has been trained and evaluated on the official data from the WMT16 and WMT17 APE IT domain English-German shared tasks. Additionally, we have used the extra 500K artificial data provided by the shared task. Our system has been able to reproduce the accuracies of systems trained with the same data, while at the same time providing better interpretability.	en_US
dc.relation.ispartof	Workshop on Neural Machine Translation and Generation	en_US
dc.rights	info:eu-repo/semantics/closedAccess
dc.title	A Shared Attention Mechanism for Interpretation of Neural Automatic Post-Editing Systems	en_US
dc.type	Conference Proceeding
utslib.location.activity	Melbourne, Australia	en_US
utslib.for	0803 Computer Software	en_US
pubs.embargo.period	Not known	en_US
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology/School of Electrical and Data Engineering
pubs.organisational-group	/University of Technology Sydney/Strength - GBDTC - Global Big Data Technologies
pubs.organisational-group	/University of Technology Sydney/Students
utslib.copyright.status	closed_access	*
pubs.consider-herdc	true	en_US
pubs.finish-date	2018-07-20	en_US
pubs.publication-status	Published	en_US
pubs.start-date	2018-07-20	en_US

Abstract:

Automatic post-editing (APE) systems aim to correct the systematic errors made by machine translators. In this paper, we propose a neural APE system that encodes the source (src) and machine translated (mt) sentences with two separate encoders, but leverages a shared attention mechanism to better understand how the two inputs contribute to the generation of the post-edited (pe) sentences. Our empirical observations have showed that when the mt is incorrect, the attention shifts weight toward tokens in the src sentence to properly edit the incorrect translation. The model has been trained and evaluated on the official data from the WMT16 and WMT17 APE IT domain English-German shared tasks. Additionally, we have used the extra 500K artificial data provided by the shared task. Our system has been able to reproduce the accuracies of systems trained with the same data, while at the same time providing better interpretability.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/128997