Skip to main navigation Skip to search Skip to main content

Latvian WordNet

    Research output: Chapter in Book/Report/Conference proceedingConference paperResearchpeer-review

    3 Citations (Scopus)

    Abstract

    This paper describes the recently developed Latvian WordNet and the main linguistic principles used in its development. The inventory of words and senses is based on the Tēzaurs.lv online dictionary, restructuring the senses of the most frequently used words based on corpus evidence. The semantic linking methodology adapts Princeton WordNet principles to fit the Latvian language usage and existing linguistic tradition. The semantic links include hyponymy, meronymy, antonymy, similarity, conceptual connection and gradation. We also measure inter-annotator agreement for different types of semantic links. The dataset consists of 7609 words linked in 6515 synsets. 1266 of these words are considered fully completed as they have all the outgoing semantic links annotated, corpus examples assigned for each sense, as well as links to the English Princeton WordNet formed. The data is available to the public on Tēzaurs.lv as an addition to the general dictionary data, and is also published as a downloadable dataset.

    Original languageEnglish
    Title of host publication12th Global Wordnet Conference Gwc 2023
    EditorsGerman Rigau, Francis Bond, Alexandre Rademaker
    Place of Publication[Leioa]
    PublisherGlobal WordNet Association
    Pages187-196
    Number of pages10
    ISBN (Electronic)9781713890881
    ISBN (Print)978-84-09-53956-7, 9781713890881
    Publication statusPublished - 2023

    Publication series

    Name12th Global Wordnet Conference, GWC 2023

    Fingerprint

    Dive into the research topics of 'Latvian WordNet'. Together they form a unique fingerprint.

    Cite this