Toggle Navigation
Projects
Publications
People
Events
Resources
Latviski
AiLab (co-)authored language resources: data and tools
Digital language resource repository CLARIN-LV:
repository.clarin.lv
Dictionaries
Explanatory and synonym dictionary
Tēzaurs
:
tezaurs.lv
Latvian Wordnet:
wordnet.ailab.lv
Dictionary of Contemporary Latvian:
mlvv.tezaurs.lv
Dictionary of Standard Latvian:
llvv.tezaurs.lv
K. Mīlenbahs, J.Endzelīns, Dictionary of the Latvian Language:
mev.tezaurs.lv
Dictionary of Historical Latvian:
lvvv.tezaurs.lv
Text and speech corpora
National Corpus Collection:
korpuss.lv
Latvian Treebank:
sintakse.korpuss.lv
Latvian FrameNet corpus:
framenet.korpuss.lv
Corpus of News Portal Comments:
barometrs.korpuss.lv
Corpus of the Saeima (the Parliament of Latvia):
saeima.korpuss.lv
Latvian Language Learner Corpus:
lava.korpuss.lv
Corpus of Early Written Latvian Texts:
senie.korpuss.lv
Latvian Grammatical Error Correction and Fluency Corpus "Norma":
norma.korpuss.lv
Tools and language models
Audio recognition, transcription and subtitling platform:
late.ailab.lv
Latvian text analysis toolchain LV-PIPE:
nlp.ailab.lv
LV-PIPE components in the
Docker Hub
repository:
hub.docker.com/u/nlppipe
Tēzaurs' API:
api.tezaurs.lv
Most of the open source tools developed by AiLab are available in the
GitHub
repository:
github.com/LUMII-AILab
Latvian morphological analyzer:
github.com/PeterisP/morphology
Latvian POS and morphological tagger:
github.com/PeterisP/LVTagger
Domainname splitter:
github.com/lauma/LVSegmenter
Some open source tools and models are also available in the
GitLab
repository:
gitlab.com/ailab
Models and datasets in the
H
ugging Face
repository:
huggingface.co/AiLab-IMCS-UL
SELMA
DockerSpaces
platform for scalable, multilingual NLP:
selma-project.github.io
SELMA Open-Source Platform:
selma.ailab.lv
Other resources
Historical language resource site:
valoda.ailab.lv
Historical homepage:
ailab.mii.lu.lv