English-Basque statistical and neural machine translation

Publication Type:
Conference Proceeding
Citation:
LREC 2018 - 11th International Conference on Language Resources and Evaluation, 2019, pp. 880 - 885
Issue Date:
2019-01-01
Filename Description Size
101.pdfPublished version148.93 kB
Adobe PDF
Full metadata record
© LREC 2018 - 11th International Conference on Language Resources and Evaluation. All rights reserved. Neural Machine Translation (NMT) has attracted increasing attention in the recent years. However, it tends to require very large training corpora which could prove problematic for languages with low resources. For this reason, Statistical Machine Translation (SMT) continues to be a popular approach for low-resource language pairs. In this work, we address English-Basque translation and compare the performance of three contemporary statistical and neural machine translation systems: OpenNMT, Moses SMT and Google Translate. For evaluation, we employ an open-domain and an IT-domain corpora from the WMT16 resources for machine translation. In addition, we release a small dataset (Berriak) of 500 highly-accurate English-Basque translations of complex sentences useful for a thorough testing of the translation systems.
Please use this identifier to cite or link to this item: