Skip to main navigation Skip to search Skip to main content

NMT or SMT: Case Study of a Narrow-domain English-Latvian Post-editing Project

  • Tilde Company

Research output: Chapter in Book/Report/Conference proceedingConference paperResearchpeer-review

10 Citations (Scopus)

Abstract

The recent technological shift in machine translation from statistical machine translation (SMT) to neural machine translation (NMT) raises the question of the strengths and weaknesses of NMT. In this paper, we present an analysis of NMT and SMT systems’ outputs from narrow domain English-Latvian MT systems that were trained on a rather small amount of data. We analyze post-edits produced by professional translators and manually annotated errors in these outputs. Analysis of post-edits allowed us to conclude that both approaches are comparably successful, allowing for an increase in translators’ productivity, with the NMT system showing slightly worse results. Through the analysis of annotated errors, we found that NMT translations are more fluent than SMT translations. However, errors related to accuracy, especially, mistranslation and omission errors, occur more often in NMT outputs. The word form errors, that characterize the morphological richness of Latvian, are frequent for both systems, but slightly fewer in NMT outputs.

Original languageEnglish
Title of host publication8th International Joint Conference on Natural Language Processing - Proceedings of the IJCNLP 2017
PublisherAssociation for Computational Linguistics (ACL)
Pages373-383
Number of pages11
ISBN (Electronic)9781948087001
Publication statusPublished - 2017
Externally publishedYes
Event8th International Joint Conference on Natural Language Processing, IJCNLP 2017 - Taipei, Taiwan, Province of China
Duration: 27 Nov 20171 Dec 2017

Publication series

Name8th International Joint Conference on Natural Language Processing - Proceedings of the IJCNLP 2017, System Demonstrations
Volume1

Conference

Conference8th International Joint Conference on Natural Language Processing, IJCNLP 2017
Country/TerritoryTaiwan, Province of China
CityTaipei
Period27/11/171/12/17

Fingerprint

Dive into the research topics of 'NMT or SMT: Case Study of a Narrow-domain English-Latvian Post-editing Project'. Together they form a unique fingerprint.

Cite this