Language Technology Initiative

Year 2023 Jan–2026 Jun
Funding EU Recovery and Resilience Facility
The goal of the project is to develop and refine large language models (LLM), grammars and lexicons, to create technologies for monolingual and multilingual audiovisual data processing, and to build resources and tools to support the learning of language technologies for both developers and us
2.3.1.1.i.0/1/22/I/CFLA/002
Partners University of Latvia, Institute of Literature, Folklore and Art UL, Riga Technical University, Riga Stradins University, Tilde
Abstract The goal of the project is to develop and refine large-scale language models (LLM), grammars and lexicons, to create technologies for monolingual and multilingual audiovisual data processing, and to build resources and tools to support the learning of language technologies for both developers and users. It is also planned to set up a high-performance computing centre for LLM training.

Publications

R. Dargis, G. Barzdins, I. Skadina, N. Gruzitis, B. Saulite
Evaluating Open-Source LLMs in Low-Resource Languages: Insights from Latvian High School Exams
Proceedings of the 4th International Conference on Natural Language Processing for Digital Humanities, Association for Computational Linguistics, 2024
PDF, BibTeX
I. Skadina, J. Kuzmina, S. Kruks, M. Platonova, T. Smirnova, I. Auzina
Language Technology Initiative - Bridging the Gap between Research and Education
CLARIN Annual Conference Proceedings, 2024
PDF, BibTeX
P. Paikens, L. Pretkalnina, L. Rituma
A Computational Model of Latvian Morphology
Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING), 2024
PDF, BibTeX
R. Dargis, A. Znotins, I. Auzina, B. Saulite, S. Reinsone, R. Dejus, A. Klavinska, N. Gruzitis
BalsuTalka.lv – Boosting the Common Voice Corpus for Low-Resource Languages
Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING), 2024
PDF, BibTeX
R. Dargis and B. Saulite
Korpuss.lv – a Versatile Platform for Digital Humanities
Baltic Journal of Modern Computing, 12(4), 636-645, 2024
PDF, DOI, BibTeX
E. Mukans and G. Barzdins
RIGA at SemEval-2023 Task 2: NER Enhanced with GPT-3
17th International Workshop on Semantic Evaluation (SemEval), ACL, 2023
PDF, BibTeX
A. Branco, M. Eskevich, F. Frontini, J. Hajic, E. Hinrichs, F. de Jong, P. Kamocki, A. Konig, K. Linden, C. Navarretta et al.
The CLARIN Infrastructure as an Interoperable Language Technology Platform for SSH and Beyond
Language Resources and Evaluation, 2023
PDF, DOI, BibTeX