Pāriet uz galveno navigāciju Pāriet uz meklēšanu Pāriet uz galveno saturu

NMT or SMT: Case Study of a Narrow-domain English-Latvian Post-editing Project

  • Tilde Company

Zinātniskās darbības rezultāts: Nodaļa grāmatā/enciklopēdijā/konferences krājumāKonferences zinātniskais rakstsPētniecībakoleģiāli recenzēts

10 Atsauces (Scopus)

Kopsavilkums

The recent technological shift in machine translation from statistical machine translation (SMT) to neural machine translation (NMT) raises the question of the strengths and weaknesses of NMT. In this paper, we present an analysis of NMT and SMT systems’ outputs from narrow domain English-Latvian MT systems that were trained on a rather small amount of data. We analyze post-edits produced by professional translators and manually annotated errors in these outputs. Analysis of post-edits allowed us to conclude that both approaches are comparably successful, allowing for an increase in translators’ productivity, with the NMT system showing slightly worse results. Through the analysis of annotated errors, we found that NMT translations are more fluent than SMT translations. However, errors related to accuracy, especially, mistranslation and omission errors, occur more often in NMT outputs. The word form errors, that characterize the morphological richness of Latvian, are frequent for both systems, but slightly fewer in NMT outputs.

OriģinālvalodaAngļu
Rīkotāja publikācijas nosaukums8th International Joint Conference on Natural Language Processing - Proceedings of the IJCNLP 2017
IzdevējsAssociation for Computational Linguistics (ACL)
Lapas373-383
Lapu skaits11
ISBN (Elektroniski)9781948087001
Publikācijas statussPublicēts - 2017
Ārēji publicēts
Pasākums8th International Joint Conference on Natural Language Processing, IJCNLP 2017 - Taipei, Taivāna, Ķīnas Province
Ilgums: 27 nov. 20171 dec. 2017

Publikāciju sērijas

Nosaukums8th International Joint Conference on Natural Language Processing - Proceedings of the IJCNLP 2017, System Demonstrations
Sējums1

Konference

Konference8th International Joint Conference on Natural Language Processing, IJCNLP 2017
Valsts/TeritorijaTaivāna, Ķīnas Province
PilsētaTaipei
Periods27/11/171/12/17

Nospiedums

Uzziniet vairāk par pētniecības tēmām “NMT or SMT: Case Study of a Narrow-domain English-Latvian Post-editing Project”. Kopā tie veido unikālu nospiedumu.

Citēt šo