Knowledge Centres

Display kcentres


Full nameCLARIN Knowledge Centre for Belarusian Text and Speech Processing
Short nameK-BLP
URLhttps://clarin-belarus.corpus.by/
Hosted by(1) The United Institute of Informatics Problems of the National Academy of Sciences of Belarus
City of main hubMinsk
Country of main hubBY
Date of certifcation2020-02-10
Area of competenceKnowledge about text and speech processing of Belarusian and other languages; Knowledge about Belarusian language learning; - Tools and resources for text and speech processing for Belarusian and other languages
Audiences- Computational linguists
- Computer scientists
- Linguists
- Historians
- Language teachers
- Library staff
- Sociolinguists
- Sociologists
- Programmers
- Archivists
- Citizen scientists
Types of services- knowledge about tokenization, morphological analysis, voiced electronic grammatical dictionary, part-of-speech tagging, frequency count, spell checking, text classification, and other approaches used in speech and text processing.
- offers special courses in language processing, data analysis, and collecting research data for the fast entrance of humanities and others into the digital world of Belarusian data processing.
- wide-ranging user support, guidelines, and instructions for each service and material.
Language portal for- Belarusian
Other languages covered- Russian
- English
- Morphologically rich languages
Modalities covered- Text and speech processing
- Audio-visual data generation
- Multi-modality
- Speech
- Written text
Linguistic topics- Applied linguistics
- Dialect studies
- Field linguistics
- Language learning
- Lexical studies
- Morphology
- Phonetics
- Phonology
- Syntax
- Pragmatics
- Semantics
- Terminology
- Translation studies
Language processing topics- Speech synthesis
- Speech recognition (word, intonation, emotion, pathology)
- Text generation
- Characters and Words counting
- Information extraction
- Language generation
- Language understanding
- Machine translation
- Processing of morphologically rich languages
- Summarization
- Text mining
- Word sense disambiguation
- Basic language processing
- Dictionary Processing
- Lemmatization
- Part-of-Speech Tagging
- Spell-checking
- Transliteration (Convert from Cyrillic to Latin letters)
- Homograph Identification
- Machine translation
Data types- Dictionaries
- Language models
- Term banks (UDC Codes)
- Typological databases
- Dialectological Maps
Resource families- Dictionaries
- Part-of-speech tagging and lemmatization
Generic topics- Language use in specific domains (a legal document translation, UDC classification for libraries)
- Working with maps
Other keywords- Online platform
- Natural language processing
Tour de CLARIN introduction 
Tour de CLARIN interview
Last update2021-01-22 11:03:19