Skip to main navigation Skip to search Skip to main content

Latvian FrameNet: Cross-lingual issues

  • Gunta Nespore-Be-Rzkalne*
  • , Baiba Saulite
  • , Normunds Gruzitis
  • *Corresponding author for this work
  • University of Latvia

Research output: Chapter in Book/Report/Conference proceedingConference paperResearchpeer-review

3 Citations (Scopus)

Abstract

This paper reports the lessons learned while creating a FrameNetannotated text corpus of Latvian. This is still an ongoing work, a part of a larger project which aims at the creation of a multilayer text corpus, anchored in crosslingual state-of-the-art representations: Universal Dependencies (UD), FrameNet and PropBank, as well as Abstract Meaning Representation (AMR). For the FrameNet layer, we use the latest frame inventory of Berkeley FrameNet (BFN v1.7), while the annotation itself is done on top of the underlying UD layer. We strictly follow a corpus-driven approach, meaning that lexical units (LU) in Latvian FrameNet are created only based on the annotated corpus examples. Since we are aiming at a medium-sized still general-purpose corpus, an important aspect that we take into account is the variety and balance of the corpus in terms of genres, domains and LUs. We have finished the first phase of the FrameNet corpus annotation, and we have collected and discuss cross-lingual issues and their possible solutions. The issues are relevant for other languages as well, particularly if the goal is to maintain cross-lingual compatibility via BFN.

Original languageEnglish
Title of host publicationHuman Language Technologies - The Baltic Perspective - Proceedings of the 8th International Conference, Baltic HLT 2018
EditorsKadri Muischnek, Kaili Muurisep
PublisherIOS Press BV
Pages96-103
Number of pages8
ISBN (Electronic)9781614999119
DOIs
Publication statusPublished - 2018
Externally publishedYes
Event8th International Conference on Human Language Technologies - The Baltic Perspective, Baltic HLT 2018 - Tartu, Estonia
Duration: 27 Sept 201829 Sept 2018

Publication series

NameFrontiers in Artificial Intelligence and Applications
Volume307
ISSN (Print)0922-6389
ISSN (Electronic)1879-8314

Conference

Conference8th International Conference on Human Language Technologies - The Baltic Perspective, Baltic HLT 2018
Country/TerritoryEstonia
CityTartu
Period27/09/1829/09/18

Keywords

  • Corpus
  • Cross-lingual
  • FrameNet
  • Latvian
  • NLU

Fingerprint

Dive into the research topics of 'Latvian FrameNet: Cross-lingual issues'. Together they form a unique fingerprint.

Cite this