Skip to main navigation Skip to search Skip to main content

English-Latvian SMT: knowledge or data?

  • University of Latvia

Research output: Contribution to conferencePaperpeer-review

4 Citations (Scopus)

Abstract

In cases when phrase-based statistical machine translation (SMT) is applied to languages with rather free word order and rich morphology, translated texts often are not fluent due to misused inflectional forms and wrong word order between phrases or even inside the phrase. One of possible solutions how to improve translation quality is to apply factored models. The paper presents work on English-Latvian phrase-based and factored SMT systems and, using evaluation results, demonstrates that although factored models seem more appropriate for highly inflected languages, they have rather small influence on translation results, while using phrase-model with more data better translation quality could be achieved.

Original languageEnglish
Pages242-245
Number of pages4
Publication statusPublished - 2009
Externally publishedYes
Event17th Nordic Conference of Computational Linguistics, NODALIDA 2009 - Odense, Denmark
Duration: 14 May 200916 May 2009

Conference

Conference17th Nordic Conference of Computational Linguistics, NODALIDA 2009
Country/TerritoryDenmark
CityOdense
Period14/05/0916/05/09

Fingerprint

Dive into the research topics of 'English-Latvian SMT: knowledge or data?'. Together they form a unique fingerprint.

Cite this