Pāriet uz galveno navigāciju Pāriet uz meklēšanu Pāriet uz galveno saturu

Tracing Mistakes and Finding Gaps in Automatic Word Alignments for Latvian-English Translation

  • Valdis Girgzdis
  • , Maija Kale*
  • , Martins Vaicekauskis
  • , Ieva Zarina
  • , Inguna Skadiņa
  • *Šī darba korespondējošais autors
  • University of Latvia
  • Tilde Company

Zinātniskās darbības rezultāts: Nodaļa grāmatā/enciklopēdijā/konferences krājumāKonferences zinātniskais rakstsPētniecībakoleģiāli recenzēts

5 Atsauces (Scopus)

Kopsavilkums

This paper aims to contribute to an in-depth understanding of computer based word alignment processes in machine translation (MT). The performance of word alignment, based on IBM models and incorporated in GIZA++, has been widely discussed in machine translation literature. The debate has lead towards a general consensus that GIZA++ does not provide sufficiently good results for word alignments. In this paper, we analyse the performance of GIZA++ and Fast Align for the Latvian-English pair against the manually aligned Gold Standard. Experiments showed that Fast Align proved to be approximately 2-3% more accurate and three times faster than GIZA++ in the alignment task. Where it concerns pre-processing, the removal of articles has a small, but positive, influence on alignment quality and machine translation output. We also present a Word Alignment Visualisation tool for analysis and editing of word alignments.

OriģinālvalodaAngļu
Rīkotāja publikācijas nosaukumsHuman Language Technologies - The Baltic Perspective
Rīkotāja publikācijas apakšnosaukumsProceedings of the 6th International Conference Baltic HLT 2014
RedaktoriAndrius Utka, Gintare Grigonyte, Jurgita Kapociute-Dzikiene, Jurgita Vaicenoniene
IzdevējsIOS Press BV
Lapas87-94
Lapu skaits8
ISBN (Elektroniski)9781614994411
DOIs
Publikācijas statussPublicēts - 2014
Pasākums6th International Conference on Human Language Technologies - The Baltic Perspective, Baltic HLT 2014 - Kaunas, Lietuva
Ilgums: 26 sept. 201427 sept. 2014

Publikāciju sērijas

NosaukumsFrontiers in Artificial Intelligence and Applications
Sējums268
ISSN (Drukātā versija)0922-6389
ISSN (Elektroniskā versija)1879-8314

Konference

Konference6th International Conference on Human Language Technologies - The Baltic Perspective, Baltic HLT 2014
Valsts/TeritorijaLietuva
PilsētaKaunas
Periods26/09/1427/09/14

Nospiedums

Uzziniet vairāk par pētniecības tēmām “Tracing Mistakes and Finding Gaps in Automatic Word Alignments for Latvian-English Translation”. Kopā tie veido unikālu nospiedumu.

Citēt šo