Pāriet uz galveno navigāciju Pāriet uz meklēšanu Pāriet uz galveno saturu

UniMorph 3.0: Universal Morphology

    Zinātniskās darbības rezultāts: Nodaļa grāmatā/enciklopēdijā/konferences krājumāKonferences zinātniskais rakstsPētniecībakoleģiāli recenzēts

    77 Atsauces (Scopus)

    Kopsavilkums

    The Universal Morphology (UniMorph) project is a collaborative effort providing broad-coverage instantiated normalized morphological paradigms for hundreds of diverse world languages. The project comprises two major thrusts: a language-independent feature schema for rich morphological annotation and a type-level resource of annotated data in diverse languages realizing that schema. We have implemented several improvements to the extraction pipeline which creates most of our data, so that it is both more complete and more correct. We have added 66 new languages, as well as new parts of speech for 12 languages. We have also amended the schema in several ways. Finally, we present three new community tools: two to validate data for resource creators, and one to make morphological data available from the command line. UniMorph is based at the Center for Language and Speech Processing (CLSP) at Johns Hopkins University in Baltimore, Maryland. This paper details advances made to the schema, tooling, and dissemination of project resources since the UniMorph 2.0 release described at LREC 2018.

    OriģinālvalodaAngļu
    Rīkotāja publikācijas nosaukumsLREC 2020 - 12th International Conference on Language Resources and Evaluation, Conference Proceedings
    RedaktoriNicoletta Calzolari, Frederic Bechet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
    Publikācijas vietaParis
    IzdevējsEuropean Language Resources Association
    Lapas3922-3931
    Lapu skaits10
    ISBN (Elektroniski)9791095546344
    ISBN (Drukātā versija)9791095546344
    Publikācijas statussPublicēts - 2020

    Publikāciju sērijas

    NosaukumsLREC 2020 - 12th International Conference on Language Resources and Evaluation, Conference Proceedings

    OECD Zinātnes nozare

    • 6.2 Valodniecība un literatūrzinātne

    Nospiedums

    Uzziniet vairāk par pētniecības tēmām “UniMorph 3.0: Universal Morphology”. Kopā tie veido unikālu nospiedumu.

    Citēt šo