Description
The Database of Latvian Morphemes and Derivational Models (DLMDM)" is a corpus-based derivational morphology resource developed at the Department of Latvian and Baltic Studies, Faculty of Humanities, University of Latvia. The core of the database consists of lemmas imported from the Balanced Corpus of Modern Latvian (LVK2018), with additional lemmas from other sources added to improve coverage of Latvian derivational morphology. The morphemic segmentation, part-of-speech information, morphological features, and derivational data have been manually validated at the lemma level. DLMDM provides four cross-indexed linguistic registers: the lemma register, the root register, the affix register, and the source register. Each register captures a different layer of derivational morphology information and is distributed as a UTF-8 tab-separated file
| Date made available | Mar 2026 |
|---|---|
| Publisher | CLARIN Centre of Latvian language resources and tools |
| Date of data production | 1 Apr 2023 - 31 Mar 2026 |
| Geographical coverage | Latvija |
Cite this
- DataSetCite