Skip to main navigation Skip to search Skip to main content

Error-Annotated Corpus of Latvian

  • Tilde Company

Research output: Chapter in Book/Report/Conference proceedingConference paperResearchpeer-review

6 Citations (Scopus)

Abstract

This paper reports on the development of the annotated Latvian language error corpus designed for grammar checker development and evaluation. We describe the error classification system introduced for this purpose, the annotation process, and guidelines. Two corpora (the corpus of student papers and the balanced text corpus) consisting of a total of 20,877 sentences have been created and annotated. A general characterisation of the corpora and a summary of the annotation results are presented.

Original languageEnglish
Title of host publicationHuman Language Technologies - The Baltic Perspective
Subtitle of host publicationProceedings of the 6th International Conference Baltic HLT 2014
EditorsAndrius Utka, Gintare Grigonyte, Jurgita Kapociute-Dzikiene, Jurgita Vaicenoniene
PublisherIOS Press BV
Pages163-166
Number of pages4
ISBN (Electronic)9781614994411
DOIs
Publication statusPublished - 2014
Externally publishedYes
Event6th International Conference on Human Language Technologies - The Baltic Perspective, Baltic HLT 2014 - Kaunas, Lithuania
Duration: 26 Sept 201427 Sept 2014

Publication series

NameFrontiers in Artificial Intelligence and Applications
Volume268
ISSN (Print)0922-6389
ISSN (Electronic)1879-8314

Conference

Conference6th International Conference on Human Language Technologies - The Baltic Perspective, Baltic HLT 2014
Country/TerritoryLithuania
CityKaunas
Period26/09/1427/09/14

Keywords

  • corpus annotation
  • error annotated corpus
  • Error classification
  • grammar checking
  • Latvian language

Fingerprint

Dive into the research topics of 'Error-Annotated Corpus of Latvian'. Together they form a unique fingerprint.

Cite this