List CLARIN K-centres with expertise in specific generic, language independent topics and other keywords


Click on the full name of the K-centre to go to its landing page, and click on the acronym to see its full organisation details

ACE

CLARIN Knowledge Centre for Atypical Communication

Areas of competenceAtypical communication encompasses language and speech as encountered during (second) language acquisition and development, and in language disorders, but also more broadly in bilingual language development and in sign language. ACE is specialised in this type of research and concomitant infrastructural issues related to data acquisition, processing and sharing, which is typically highly characterised by sensitivity issues. For data storage and access the centre collaborates with MPI's TLA (The Language Archive) which is a CLARIN B Centre and also based in Nijmegen.
Is portal for language(s)-
Other languages covered   -
Modalities coveredText, speech, sign language
Linguistic topicsLanguage acquisition (L1 and L2), language disorders
Language processing-
Data types-
Resource families-
Generic topicsCritical Data Management; Legal and ethical issues
Other keywords-
Types of servicesInformation and guidelines about:; - consent (forms); - hosting corpora and datasets containing atypical communication; - where to find corpora and datasets containing atypical communication; - including FAQ; Helpdesk/consultancy for questions on these topics; Technical assistance for designing, creating, annotating, formatting and metadating these resources; Outreach: presentations, workshops contributions, etc
Tour de CLARIN- -

CKLD

CLARIN Knowledge-Centre for linguistic diversity and language documentation

Areas of competenceThe CLARIN Knowledge-Centre for linguistic diversity and language documentation offers expertise on data and data-related methods, technology and background information on language resources and tools to researchers - including students and native speakers. CKLD provides information and assistance relating to fieldwork and data-related methodological aspects and in particular relating to equipment, digital tools, methods, where to find data and information, whom to contact for specialist information on particular regions or language families.
Is portal for language(s)-
Other languages covered   Under-researched languages and languages families (linguistic diversity). Expertise in Athabascan, Austronesian, Austro-Asiatic, Dravidian, Finno-Ugric, Papuan, etc
Modalities coveredText, audio-visual recorings of speech
Linguistic topicslanguage documentation, linguistic typology, linguistic fieldwork
Language processing-
Data typesAV collections, typological databases
Resource familiesAV collections of endangered and under-researched languages
Generic topicslinguistic fieldwork
Other keywords-
Types of servicesInformation materials, guidelines, tutorials, consultancy
Tour de CLARIN- -

CLARIN-HUMLAB

CLARIN Knowledge Centre of Lund University Humanities Lab

Areas of competenceAdvice on multimodal and sensor-based methods, including EEG, eye-tracking, articulography, virtual reality, motion capture, av-recording
Is portal for language(s)-
Other languages covered   -
Modalities coveredMultimodal data, sensor-based data
Linguistic topics-
Language processingtext mining, machine learning-related research on textual data, keystroke logging
Data types-
Resource families-
Generic topicsmultimodal and sensor-based methods, including EEG, eye-tracking, articulography, virtual reality, motion capture and av-recording
Other keywords-
Types of servicestools, mentoring, consultancy, tutorials
Tour de CLARIN- -

CLARIN-SPEECH

CLARIN Knowledge Centre for Speech Analysis

Areas of competenceTechnical advice on speech analysis relating to all aspects of speech technology, including speech science, speech applications, and speech in interaction.
Is portal for language(s)-
Other languages covered   Swedish, English
Modalities coveredSpeech, biosiglnals, audiovisual data, sensor data
Linguistic topicsphonetics, pathology
Language processingspeech analysis, speech modelling, speech processing
Data typesacoustic and language models, dictionaries, vocabularies, pronunciation data, biosignals related to spoken interaction
Resource familiesoral history, parliamentary records
Generic topicsdeep learning, evaluation, tools, visualization, ASR, legal issue, data management
Other keywords-
Types of servicesawareness, tools, mentoring
Tour de CLARIN- -

CLASSLA

CLARIN Knowledge Centre for South Slavic languages

Areas of competenceOffers expertise on language resources and technologies for South Slavic languages
Is portal for language(s)Slovene, Croatian, Bosnian, Montenegrin, Serbian, Macedonian, Bulgarian
Other languages covered   -
Modalities coveredText
Linguistic topicsApplied linguistics, Dialect studies, Sociolinguistics (for South Slavic languages)
Language processingBasic processing of South Slavic languages
Data typestraining data, language models (for South Slavic languages)
Resource familiesNewspapers, social media, parliamentary records, historical texts, language learner corpora (for South Slavic languages)
Generic topicsdeep learning, evaluation of tools (for South Slavic languages)
Other keywords-
Types of servicestools, data, mentoring, dissemination, awareness, web lectures
Tour de CLARIN- -

IMPACT-CKC

IMPACT centre of competence - CLARIN K-centre in digitisation

Areas of competenceIMPACT-CKC (IMPACT centre of competence - CLARIN K-centre in digitisation), as knowledge centre offers expertise and resources to institutions and researchers looking for advice in digitisation and related fields. The IMPACT-CKC resoruces include a demonstrator platform for online testing tools, a collection of high quality images with associated ground truth, historical lexica for 10 languages as well as training materials and registries on tools, initiatives, datasets and competitions.
Is portal for language(s)-
Other languages covered   Spanish, English, Polish, French, Dutch, German, Slovene, Czech, Latin, Bulgarian
Modalities coveredText, AV data
Linguistic topicscorpus linguistics, diachronic language resources, language learning
Language processingbasic language processing, information extraction
Data typeslexical data, language models, linked open data and ontologies
Resource familieshistorical texts, lexical resources, literary texts, newspapers
Generic topicsOCR, digitisation, visualisation, evaluation of tools
Other keywords-
Types of servicestools, data, mentoring, dissemination, awareness, tutorials, web lectures.
Tour de CLARINIntroduction Interview

NSD-K-centre

CLARIN Knowledge Centre for Data Management at NSD

Areas of competenceProvides expertise in data management, including legal and ethical issues related to privacy and IPR.
Is portal for language(s)-
Other languages covered   -
Modalities covered-
Linguistic topics-
Language processing-
Data types-
Resource families-
Generic topicsData Management, Legal and ethical issues
Other keywords-
Types of services-
Tour de CLARIN- -

PORTULAN

CLARIN Knowledge Centre for the Science and Technology of the Portuguese Language

Areas of competenceThe Science and Technology of the Portuguese Language is the thematic area of this CLARIN Knowledge Centre. Related to the Portuguese language, it covers all topics, from Phonetics to Discourse and Dialogue, considering all language functions, from communicative performance to cultural expression, approached by all disciplines, from Theoretical Linguistics to Language Technology, covering all language variants, from national standard varieties across the world to dialects of professional groups, taking into account all media of representation, from audio to brain imageology recordings.
Is portal for language(s)Portuguese
Other languages covered   -
Modalities coveredText and speech
Linguistic topics-
Language processingPortuguese language processing
Data types-
Resource families-
Generic topicsBrain image recording
Other keywords-
Types of services-
Tour de CLARIN- -

PhA-OeAW

Phonogrammarchiv - Austrian Academy of Sciences

Areas of competenceAs an audio and audiovisual archive with numerous collections of unique research recordings from all across the world, the Phonogrammarchiv offers various services: Besides providing access to its data and metadata resources (remote & onsite), it advises scholars on field research methodology and technologies of audio and audiovisual documentation, supporting them with necessary recording equipment. In addition, it widely shares its broad expertise on topics such as restoration, digitisation, format obsolescence, cataloguing, metadata, long-term preservation and storage.
Is portal for language(s)-
Other languages covered   Audio and audiovisual recordings plus accompanying documentation on a wide variety of languages / dialects from all across the world, covering a timespan of 120 years.
Modalities coveredAudio and audiovisual recordings.
Linguistic topicsField linguistics, interview techniques (social/cultural anthropology, ethnomusicology), language documentation, oral history.
Language processing-
Data typesAudio and audiovisual recordings.
Resource families-
Generic topicsArchiving: physical restoration, digitisation, format migration, cataloguing, metadata, long-term preservation and storage. Research: methods and technologies of audiovisual fieldwork and documentation
Other keywords-
Types of servicesIndividual advice, group trainings, workshops and higher education teaching, internships, practical assistance and institutional cooperations. Access to audiovisual data and metadata (remote and onsite).
Tour de CLARINIntroduction -

SAFMORIL

Systems and Frameworks for Morphologically Rich Languages

Areas of competenceSAFMORIL brings together researchers and developers in the area of computational morphology and its NLP applications. The focus of SAFMORIL is actual, working systems and frameworks based on linguistic principles providing linguistically motivated analyses and generation outputs. Such systems are relevant in particular for languages with rich morphologies. SAFMORIL offers online courses for developing morphologies, tokenizers and spell-checkers, and a repository for storing morphologies.
Is portal for language(s)-
Other languages covered   Primarily Nordic and Baltic languages (such as Finnish, Swedish, Norwegian, Latvian, Lithuanian as well as the Sámi languages), but also more generally Fenno-Ugric languages, Inuit languages, Canadian First Nation languages and Babylonian languages
Modalities coveredText
Linguistic topicsMorphology and Morphosyntax
Language processingProcessing of morphologically rich languages
Data typesLexical resources containing inflectional, derivational and compounding information as well as morphosyntactic grammars and language models
Resource familiesMorphological Lexicons, Grammars and Language Models
Generic topicsPrimarily Finite-State Applications, but to some degree also Statistical Methods and Neural Networks
Other keywords-
Types of servicesdata, tools, web demos, web lectures and tutorials
Tour de CLARIN- -

SWELANG

CLARIN Knowledge Centre for The Languages of Sweden

Areas of competenceInformation service offering advice on the use of digital language resources and tools for the Swedish language, minority languages in Sweden, the Swedish sign language, Swedish dialects, as well as other parts of the intangible cultural heritage of Sweden in text and speech, as well as language policy and planning.
Is portal for language(s)Swedish
Other languages covered   Finnish, Meänkieli, Romani, Jiddisch, Swedish sign and other languages in Sweden
Modalities coveredText and sign language, spoken Swedish dialects
Linguistic topicslanguage policy and planning, language infrastructure, language technology, dialect studies, sociolinguistics, folkloristics, plain language and language comprehensibility, terminology, lexicography
Language processingtopic modelling
Data typesspeech recordings, mono- and multilingual lexica and word/term collections
Resource families-
Generic topicslanguage policy and planning
Other keywords-
Types of serviceson-line lexica, q&a database, map interfaces for folk tales and dialects, open data, language consulting (by telephone, email and social media)
Tour de CLARINIntroduction Interview

Spanish-K-Centre

Spanish CLARIN K-Centre

Areas of competenceThe Spanish CLARIN K-Centre aims to provide knowledge, services, consultancy and specialized web services to the Humanities and Social Science research communities. Our web services and consultancy is about how to use and research with basic tools that can handle and exploit textual data at least in the four (co)official languages (Spanish, Catalan, Galician, Basque) and English, which is one of the most important sources of information for many HSC disciplines.
Is portal for language(s)Spanish, Basque, Catalan, Galician
Other languages covered   -
Modalities coveredText
Linguistic topics1. general linguistics (phonology, morphology, syntax, semantics,; pragmatics); 2. computational linguistics; 3. corpus linguistics; 4. applied linguistics; 5. stylistics
Language processingSpanish, Catalan, Galician and Euskera language processing:; morphology, syntax, semantics, discourse
Data types1. Lexical databases: general, sentiment, NERC...; 2. Syntax Tree banks; 3. Discourse Tree banks: correference, relational; 4. Spoken databases; 5. Semantic annotation: semantic roles, word sense,; 6. Error annotation; 7. Image bank (wikimedia); 8. Conversational QA
Resource families-
Generic topics1. Grammars; 2. Finite-State Applications; 3. Statistical Methods; 4. Neural Networks
Other keywords-
Types of services1. Tools; 2. Data; 3. Mentoring; 4. Dissemination; 5. Tutorials
Tour de CLARIN- -