A. Masciolini, A. Caines, O. D. Clercq, J. Kruijsbergen, M. Kurfalı, R. M. Sanchez, E. Volodina, R. Ostling, K. Allkivi, S. A. Holdt, I. Auzina, R. Dargis, E. Drakonaki, J. Frey, I. Glisic, P. Kikilintza, L. Nicolas, M. Romanyshyn, A. Rosen, A. Rozovskaya, K. Suluste, O. Syvokon, A. Tantos, D. Touriki, K. Tsiotskas, E. Tsourilla, V. Varsamopoulos, K. Wisniewski, A. Zagar, and T. Zesch. Towards better language representation in Natural Language Processing. International Journal of Learner Corpus Research, John Benjamins Publishing Company, April 2025. I. Skadina, B. Bakanovs, and R. Dargis. First Steps in Benchmarking Latvian in Large Language Models. Proceedings of the Third Workshop on Resources and Representations for Under-Resourced Languages and Domains (RESOURCEFUL-2025), University of Tartu Library, pages 86-95, March 2025. A. Znotins, N. Gruzitis, and R. Dargis. From Conversational Speech to Readable Text: Post-Processing Noisy Transcripts in a Low-Resource Setting. Proceedings of the Tenth Workshop on Noisy and User-generated Text, ACL, pages 143--148, 2025. R. Dargis, G. Barzdins, I. Skadina, N. Gruzitis, and B. Saulite. Evaluating Open-Source LLMs in Low-Resource Languages: Insights from Latvian High School Exams. Proceedings of the 4th International Conference on Natural Language Processing for Digital Humanities, Association for Computational Linguistics, pages 289-293, November 2024. R. Dargis, A. Znotins, I. Auzina, B. Saulite, S. Reinsone, R. Dejus, A. Klavinska, and N. Gruzitis. BalsuTalka.lv – Boosting the Common Voice Corpus for Low-Resource Languages. Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING), pages 2080, May 2024. T. Erjavec, M. Kopp, N. Ljubesic, T. Kuzman, P. Rayson, P. Osenova, M. Ogrodniczuk, C. Coltekin, D. Korzinek, K. Meden, J. Skubic, P. Rupnik, T. Agnoloni, J. Aires, S. Barkarson, R. Bartolini, N. Bel, M. C. Perez, R. Dargis, and e. al.. ParlaMint II: advancing comparable parliamentary corpora across Europe. Language Resources and Evaluation, 2024. I. Auzina, N. Gruzitis, R. Dargis, G. Rabante-Busa, D. Gosko, J. Vempers, R. Kivkucans, and A. Znotins. Recent Latvian Speech Corpora for Linguistic Research and Technology Development. Baltic Journal of Modern Computing, University of Latvia, volume 12, issue 4, pages 646-658, 2024. R. Dargis and B. Saulite. Korpuss.lv – a Versatile Platform for Digital Humanities. Baltic Journal of Modern Computing, University of Latvia, volume 12, issue 4, pages 636-645, 2024. T. Erjavec, M. Ogrodniczuk, P. Osenova, N. Ljubesic, K. Simov, A. Pancur, M. Rudolf, M. Kopp, S. Barkarson, S. Steingrimsson, C. Coltekin, J. de Does, K. Depuydt, T. Agnoloni, G. Venturi, M. C. Perez, L. de Macedo, C. Navarretta, G. Luxardo, M. Coole, P. Rayson, V. Morkevicius, T. Krilavicius, R. Dargis, O. Ring, R. van Heusden, M. Marx, and D. Fiser. The ParlaMint corpora of parliamentary proceedings. Language Resources and Evaluation, Springer, volume 57, pages 415-448, 2023. B. Saulite, I. Auzina, and R. Dargis. Nacionālā korpusu kolekcija Korpuss.lv. Linguistica Lettica, LU Latviešu valodas institūts, volume 31, issue 1, pages 202-223, 2023. A. Znotins, R. Dargis, N. Gruzitis, G. Barzdins, and D. Gosko. RUTA:MED – Dual Workflow Medical Speech Transcription Pipeline and Editor. Natural Language Processing and Information Systems, Springer, volume 13286, pages 209-214, June 2022. L. Skestere and R. Dargis. Agenda-Setting Dynamics during COVID-19: Who Leads and Who Follows? Social Sciences, volume 11, issue 12, pages 556, 2022. I. Skadina, I. Auzina, R. Dargis, E. Lasmanis, and A. Voitkans. CLARIN-LV: Many Steps till Operation. CLARIN Annual Conference, pages 9-13, 2022. R. Dargis, I. Auzina, I. Kaija, K. Levane-Petrova, and K. Pokratniece. Corpus Based Self-Assessment Platform for Latvian Language Learners. Baltic Journal of Modern Computing, volume 10, issue 3, pages 392-401, 2022. I. Auzina, R. Dargis, B. Saulite, N. Gruzitis, M. Grasmanis, A. Spektors, and K. Stepanovs. Specializēta latviešu valodas runas korpusa un izrunas vārdnīcas izveide vizuālās diagnostikas izmeklējumu lingvistiskai analīzei un sistemātiskai transkribēšanai. Letonica, volume 47, pages 244-262, 2022. I. Auzina, R. Dargis, I. Kaija, K. Levane-Petrova, and K. Pokratniece. Valodas korpusu izmantošana latviešu valodas uzdevumu automātiskā ģenerēšanā. Letonica, volume 47, pages 264-282, 2022. I. Skadina, I. Auzina, R. Dargis, and A. Voitkans. CLARIN valodas resursu un rīku pētniecības infrastruktūra humanitārajām un sociālajām zinātnēm. Letonica, volume 47, pages 312-327, 2022. N. Gruzitis, R. Dargis, V. Lasmanis, G. Garkaje, and D. Gosko. Adapting Automatic Speech Recognition to the Radiology Domain for a Less-Resourced Language: The Case of Latvian. Intelligent Sustainable Systems, Springer, volume 333, pages 267-276, 2022. B. Saulite, R. Dargis, N. Gruzitis, I. Auzina, K. Levane-Petrova, L. Pretkalnina, L. Rituma, P. Paikens, A. Znotins, L. Strankale, K. Pokratniece, I. Poikans, G. Barzdins, I. Skadina, A. Baklane, V. Saulespurens, and J. Ziedins. Latvian National Corpora Collection – Korpuss.lv. 13th Language Resources and Evaluation Conference (LREC), pages 5123-5129, 2022. R. Dargis, I. Auzina, I. Kaija, K. Levane-Petrova, and K. Pokratniece. LaVA – Latvian Language Learner corpus. 13th Language Resources and Evaluation Conference (LREC), pages 727-731, 2022. I. Auzina, I. Kaija, K. Levane-Petrova, K. Pokratniece, and R. Dargis. Latviešu valodas apguvēju korpusa (LaVA) izmantošana pētniecībā un mācību uzdevumu izstrādē. Latviešu valodas apguve. XIII Starptautiskais baltistu kongress, LiePA, pages 142-161, 2021. R. Dargis, I. Auzina, K. Levane-Petrova, and I. Kaija. Detailed Error Annotation for Morphologically Rich Languages: Latvian Use Case. Human Language Technologies - The Baltic Perspective, IOS Press, volume 328, pages 241-244, 2020. R. Dargis, N. Gruzitis, I. Auzina, and K. Stepanovs. Creation of Language Resources for the Development of a Medical Speech Recognition System for Latvian. Human Language Technologies - The Baltic Perspective, IOS Press, volume 328, pages 135-141, 2020. N. Gruzitis, R. Dargis, L. Rituma, G. Nespore-Berzkalne, and B. Saulite. Deriving a PropBank Corpus from Parallel FrameNet and UD Corpora. Proceedings of the International FrameNet Workshop 2020: Towards a Global, Multilingual FrameNet, pages 63-69, 2020. R. Dargis, K. Levane-Petrova, and I. Poikans. Lessons Learned from Creating a Balanced Corpus from Online Data. Human Language Technologies - The Baltic Perspective, IOS Press, volume 328, pages 127-134, 2020. R. Dargis, I. Auzina, K. Levane-Petrova, and I. Kaija. Quality Focused Approach to a Learner Corpus Development. Proceedings of The 12th Language Resources and Evaluation Conference (LREC), pages 392-396, 2020. R. Dargis, P. Paikens, N. Gruzitis, I. Auzina, and A. Akmane. Development and Evaluation of Speech Synthesis Corpora for Latvian. Proceedings of The 12th Language Resources and Evaluation Conference (LREC), pages 6633-6637, 2020. U. Bojars, R. Dargis, U. Lavrinovics, and P. Paikens. LinkedSaeima: a Linked Open Dataset of Latvia's Parliamentary Debates. Proceedings of the 15th SEMANTiCS Conference, Springer, volume 11702, pages 50-56, 2019. O. Urek, A. Vulane, R. Dargis, A. Taurina, T. Zirina, and H. G. Simonsen. Latvian CDI: methodology, developmental trends and cross-linguistic comparison. Journal of Baltic Studies, Routledge, volume 50, issue 3, pages 285-305, 2019. I. Auzina, R. Dargis, and K. Levane-Petrova. Latviešu valodas apguvēju kļūdu analīze: pareizrakstības kļūdas. Vārds un tā pētīšanas aspekti, LiePA, issue 23, pages 220-227, 2019. R. Dargis and I. Auzina. Towards a Modern Text-to-Speech System for Latvian. Human Language Technologies - The Baltic Perspective, IOS Press, volume 307, pages 26-29, 2018. R. Dargis, I. Auzina, U. Bojars, P. Paikens, and A. Znotins. Annotation of the Corpus of the Saeima with Multilingual Standards. Proceedings of the 2018 ParlaCLARIN Workshop, 2018. R. Dargis, I. Auzina, and K. Levane-Petrova. The Use of Text Alignment in Semi-Automatic Error Analysis: Use Case in the Development of the Corpus of the Latvian Language Learners. Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC), pages 4111-4115, 2018. A. Spektors, I. Auzina, R. Dargis, N. Gruzitis, P. Paikens, L. Pretkalnina, L. Rituma, and B. Saulite. Tezaurs.lv: the largest open lexical database for Latvian. Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC), 2016. I. Auzina, K. Levane-Petrova, G. Rabante-Busa, R. Dargis, and A. Fabregas. Designing an annotated longitudinal Latvian children's speech corpus. Human Language Technologies - The Baltic Perspective, IOS Press, volume 289, 2016. R. Dargis, G. Rabante-Busa, I. Auzina, and S. Kruks. ParliSearch - A system for large text corpus discourse analysis. Human Language Technologies - The Baltic Perspective, IOS Press, volume 289, 2016. A. Znotins, K. Polis, and R. Dargis. Media monitoring system for Latvian radio and TV broadcasts. Proceedings of the 16th Annual Conference of the International Speech Communication Association (INTERSPEECH), 2015. R. Dargis and A. Znotins. Baseline for keyword spotting in Latvian broadcast speech. Human Language Technologies - The Baltic Perspective, IOS Press, volume 268, 2014. G. Garkaje, E. Zilgalve, and R. Dargis. Normalization and automatized sentiment analysis of contemporary online Latvian Language. Human Language Technologies - The Baltic Perspective, IOS Press, volume 268, 2014. I. Auzina, M. Pinnis, and R. Dargis. Comparison of rule-based and statistical methods for grapheme to phoneme modelling. Human Language Technologies - The Baltic Perspective, IOS Press, volume 268, 2014.