TY - GEN
T1 - Toward Federated Learning Through Intent Detection Research
AU - Deksne , Daiga
AU - Kapo?i?t?-Dzikien?, Jurgita
AU - Skadiņš, Raivis
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2024.
PY - 2024
Y1 - 2024
N2 - Modern organizational communication heavily relies on virtual assistants, necessitating robust Natural Language Understanding (NLU) models for effective interaction. This research addresses the challenges of developing NLU models across multiple languages, including Estonian, English, German, Spanish, French, Italian, and Latvian. We explore various intent detection methodologies, including memory-based techniques that encompass both vectorization with Language-agnostic BERT Sentence Embedding (LaBSE), Advanced Data Analysis (ADA), or Sentence-level MultimOdal and LaNguage-Agnostic Representations (SONAR) models, and semantic search using cosine similarity or Levenshtein distance-based approaches. Additionally, we investigate supervised text classification methods such as FastText with the Convolutional Neural Network, LaBSE with Feed-Forward Neural Network, or fine-tuning LaBSE, as well as text generation techniques leveraging OpenAI’s Davinci large language model. Our findings highlight the efficacy of memory-based approaches, particularly for non-English languages. We showcase the effectiveness of multilingual and cross-lingual LaBSE vectorization and the SONAR large language model. Furthermore, we introduce open-source intent detection software tailored for Federated Learning (FL). Through a prototype, we demonstrate the seamless integration of this framework into RASA-based virtual assistants, offering practical guidance for organizations interested in deploying intelligent and privacy-preserving conversational agents. This research advances virtual assistant development and highlights the potential of FL for seamless integration with NLU models. In the future, we plan to test it with more languages and with real client scenarios.
AB - Modern organizational communication heavily relies on virtual assistants, necessitating robust Natural Language Understanding (NLU) models for effective interaction. This research addresses the challenges of developing NLU models across multiple languages, including Estonian, English, German, Spanish, French, Italian, and Latvian. We explore various intent detection methodologies, including memory-based techniques that encompass both vectorization with Language-agnostic BERT Sentence Embedding (LaBSE), Advanced Data Analysis (ADA), or Sentence-level MultimOdal and LaNguage-Agnostic Representations (SONAR) models, and semantic search using cosine similarity or Levenshtein distance-based approaches. Additionally, we investigate supervised text classification methods such as FastText with the Convolutional Neural Network, LaBSE with Feed-Forward Neural Network, or fine-tuning LaBSE, as well as text generation techniques leveraging OpenAI’s Davinci large language model. Our findings highlight the efficacy of memory-based approaches, particularly for non-English languages. We showcase the effectiveness of multilingual and cross-lingual LaBSE vectorization and the SONAR large language model. Furthermore, we introduce open-source intent detection software tailored for Federated Learning (FL). Through a prototype, we demonstrate the seamless integration of this framework into RASA-based virtual assistants, offering practical guidance for organizations interested in deploying intelligent and privacy-preserving conversational agents. This research advances virtual assistant development and highlights the potential of FL for seamless integration with NLU models. In the future, we plan to test it with more languages and with real client scenarios.
KW - English
KW - and Latvian languages
KW - Italian
KW - Federated learning
KW - French
KW - Memory-based
KW - German
KW - Spanish
KW - supervised classification and generation approaches
KW - Intent detection
KW - Estonian
UR - https://link.springer.com/chapter/10.1007/978-3-031-63543-4_6
UR - https://www.scopus.com/pages/publications/85199147868
U2 - 10.1007/978-3-031-63543-4_6
DO - 10.1007/978-3-031-63543-4_6
M3 - Conference paper
SN - 9783031635427
VL - 2157 CCIS
T3 - Communications in Computer and Information Science
SP - 79
EP - 92
BT - Digital Business and Intelligent Systems - 16th International Baltic Conference, Baltic DB and IS 2024, Proceedings
A2 - Lupeikienė, Audronė
A2 - Dzemyda, Gintautas
A2 - Ralyté, Jolita
ER -