Skip to main navigation Skip to search Skip to main content

Toward Federated Learning Through Intent Detection Research

  • Daiga Deksne *
  • , Jurgita Kapo?i?t?-Dzikien?
  • , Raivis Skadiņš
  • *Corresponding author for this work
  • Faculty of Computing
  • Department of Computer Science
  • Tilde Company
  • Tilde IT // Vytautas Magnus University

Research output: Chapter in Book/Report/Conference proceedingConference paperResearchpeer-review

Abstract

Modern organizational communication heavily relies on virtual assistants, necessitating robust Natural Language Understanding (NLU) models for effective interaction. This research addresses the challenges of developing NLU models across multiple languages, including Estonian, English, German, Spanish, French, Italian, and Latvian. We explore various intent detection methodologies, including memory-based techniques that encompass both vectorization with Language-agnostic BERT Sentence Embedding (LaBSE), Advanced Data Analysis (ADA), or Sentence-level MultimOdal and LaNguage-Agnostic Representations (SONAR) models, and semantic search using cosine similarity or Levenshtein distance-based approaches. Additionally, we investigate supervised text classification methods such as FastText with the Convolutional Neural Network, LaBSE with Feed-Forward Neural Network, or fine-tuning LaBSE, as well as text generation techniques leveraging OpenAI’s Davinci large language model. Our findings highlight the efficacy of memory-based approaches, particularly for non-English languages. We showcase the effectiveness of multilingual and cross-lingual LaBSE vectorization and the SONAR large language model. Furthermore, we introduce open-source intent detection software tailored for Federated Learning (FL). Through a prototype, we demonstrate the seamless integration of this framework into RASA-based virtual assistants, offering practical guidance for organizations interested in deploying intelligent and privacy-preserving conversational agents. This research advances virtual assistant development and highlights the potential of FL for seamless integration with NLU models. In the future, we plan to test it with more languages and with real client scenarios.

Original languageEnglish
Title of host publicationDigital Business and Intelligent Systems - 16th International Baltic Conference, Baltic DB and IS 2024, Proceedings
EditorsAudronė Lupeikienė, Gintautas Dzemyda, Jolita Ralyté
Pages79-92
Number of pages14
Volume2157 CCIS
DOIs
Publication statusPublished - 2024

Publication series

NameCommunications in Computer and Information Science
Volume2157 CCIS
ISSN (Print)1865-0929
ISSN (Electronic)1865-0937

Keywords

  • English
  • and Latvian languages
  • Italian
  • Federated learning
  • French
  • Memory-based
  • German
  • Spanish
  • supervised classification and generation approaches
  • Intent detection
  • Estonian

OECD Field of Science

  • 1.2 Computer and Information Sciences

Fingerprint

Dive into the research topics of 'Toward Federated Learning Through Intent Detection Research'. Together they form a unique fingerprint.

Cite this