Abstract
In cases when phrase-based statistical machine translation (SMT) is applied to languages with rather free word order and rich morphology, translated texts often are not fluent due to misused inflectional forms and wrong word order between phrases or even inside the phrase. One of possible solutions how to improve translation quality is to apply factored models. The paper presents work on English-Latvian phrase-based and factored SMT systems and, using evaluation results, demonstrates that although factored models seem more appropriate for highly inflected languages, they have rather small influence on translation results, while using phrase-model with more data better translation quality could be achieved.
| Original language | English |
|---|---|
| Pages | 242-245 |
| Number of pages | 4 |
| Publication status | Published - 2009 |
| Externally published | Yes |
| Event | 17th Nordic Conference of Computational Linguistics, NODALIDA 2009 - Odense, Denmark Duration: 14 May 2009 → 16 May 2009 |
Conference
| Conference | 17th Nordic Conference of Computational Linguistics, NODALIDA 2009 |
|---|---|
| Country/Territory | Denmark |
| City | Odense |
| Period | 14/05/09 → 16/05/09 |
Fingerprint
Dive into the research topics of 'English-Latvian SMT: knowledge or data?'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver