Pāriet uz galveno navigāciju Pāriet uz meklēšanu Pāriet uz galveno saturu

Evaluating Open-Source LLMs in Low-Resource Languages: Insights from Latvian High School Exams

  • Institute of Mathematics and Computer Science
  • University of Latvia

Zinātniskās darbības rezultāts: Nodaļa grāmatā/enciklopēdijā/konferences krājumāKonferences zinātniskais rakstsPētniecībakoleģiāli recenzēts

4 Atsauces (Scopus)

Kopsavilkums

The latest large language models (LLM) have significantly advanced natural language processing (NLP) capabilities across various tasks.However, their performance in low-resource languages, such as Latvian with 1.5 million native speakers, remains substantially underexplored due to both limited training data and the absence of comprehensive evaluation benchmarks.This study addresses this gap by conducting a systematic assessment of prominent open-source LLMs on natural language understanding (NLU) and natural language generation (NLG) tasks in Latvian.We utilize standardized high school centralized graduation exams as a benchmark dataset, offering relatable and diverse evaluation scenarios that encompass multiple-choice questions and complex text analysis tasks.Our experimental setup involves testing models from the leading LLM families, including Llama, Qwen, Gemma, and Mistral, with OpenAI's GPT-4 serving as a performance reference.The results reveal that certain open-source models demonstrate competitive performance in NLU tasks, narrowing the gap with GPT-4.However, all models exhibit notable deficiencies in NLG tasks, specifically in generating coherent and contextually appropriate text analyses, highlighting persistent challenges in NLG for low-resource languages.These findings contribute to efforts to develop robust multilingual benchmarks and to improve LLM performance in diverse linguistic contexts.

OriģinālvalodaAngļu
Rīkotāja publikācijas nosaukumsNlp4dh 2024 4th International Conference on Natural Language Processing for Digital Humanities Proceedings of the Conference
RedaktoriMika Hamalainen, Emily Ohman, So Miyagawa, Khalid Alnajjar, Yuri Bizzoni
Publikācijas vietaStroudsburg
IzdevējsAssociation for Computational Linguistics
Lapas289-293
Lapu skaits5
ISBN (Elektroniski)9798891761810
ISBN (Drukātā versija)979-889176181-0, 9798891761810
DOIs
Publikācijas statussPublicēts - 2024

Publikāciju sērijas

NosaukumsNLP4DH 2024 - 4th International Conference on Natural Language Processing for Digital Humanities, Proceedings of the Conference

OECD Zinātnes nozare

  • 1.2 Datorzinātne un informātika

Nospiedums

Uzziniet vairāk par pētniecības tēmām “Evaluating Open-Source LLMs in Low-Resource Languages: Insights from Latvian High School Exams”. Kopā tie veido unikālu nospiedumu.

Citēt šo