List CLARIN K-centres with expertise in specific data types

Key Short name Full name Areas of competence Data types covered Types of services
101Spanish K-CentreSpanish CLARIN K-CentreThe Spanish CLARIN K-Centre aims to provide knowledge, services, consultancy and specialized web services to the Humanities and Social Science research communities. Our web services and consultancy is about how to use and research with basic tools that can handle and exploit textual data at least in the four (co)official languages (Spanish, Catalan, Galician, Basque) and English, which is one of the most important sources of information for many HSC disciplines.1. Lexical databases: general, sentiment, NERC...
2. Syntax Tree banks
3. Discourse Tree banks: correference, relational
4. Spoken databases
5. Semantic annotation: semantic roles, word sense,
6. Error annotation
7. Image bank (wikimedia)
8. Conversational QA
1. Tools
2. Data
3. Mentoring
4. Dissemination
5. Tutorials
102PhA-OeAWPhonogrammarchiv - Austrian Academy of SciencesAs an audio and audiovisual archive with numerous collections of unique research recordings from all across the world, the Phonogrammarchiv offers various services: Besides providing access to its data and metadata resources (remote & onsite), it advises scholars on field research methodology and technologies of audio and audiovisual documentation, supporting them with necessary recording equipment. In addition, it widely shares its broad expertise on topics such as restoration, digitisation, format obsolescence, cataloguing, metadata, long-term preservation and storage.Audio and audiovisual recordings.Individual advice, group trainings, workshops and higher education teaching, internships, practical assistance and institutional cooperations. Access to audiovisual data and metadata (remote and onsite).
103TreebankingCLARIN Knowledge Centre for TreebankingTreebanks: construction, integration, search, processing, formatsTree banks
104CLARIN-SPEECHCLARIN Knowledge Centre for Speech AnalysisTechnical advice on speech analysis relating to all aspects of speech technology, including speech science, speech applications, and speech in interaction.acoustic and language models, dictionaries, vocabularies, pronunciation data, biosignals related to spoken interactionawareness, tools, mentoring
105DANSKCLARIN K-Centre DANSK - DANish helpdeSKDanish language and Danish sign language, language resources and language technology tools and services for Danish.corpora, word net, multimodal annotations
106CLARIN-LearnCLARIN Knowledge Centre for Language Learning AnalysisOur centre is happy to provide advice on tools, corpora, and methods for the study of first and second language learning, conversational interactions, and a variety of language and developmental disabilities including aphasia, stuttering, TBI, dementia, and ASD.Corpora, both text and multimodalcorpora, tools, web screencasts, manuals, GoogleGroup mailing lists, email support, workshops
107SWELANGCLARIN Knowledge Centre for The Languages of SwedenInformation service offering advice on the use of digital language resources and tools for the Swedish language, minority languages in Sweden, the Swedish sign language, Swedish dialects, as well as other parts of the intangible cultural heritage of Sweden in text and speech, as well as language policy and planning.speech recordings, mono- and multilingual lexica and word/term collectionson-line lexica, q&a database, map interfaces for folk tales and dialects, open data, language consulting (by telephone, email and social media)
110CKLDCLARIN Knowledge-Centre for linguistic diversity and language documentation The CLARIN Knowledge-Centre for linguistic diversity and language documentation offers expertise on data and data-related methods, technology and background information on language resources and tools to researchers - including students and native speakers. CKLD provides information and assistance relating to fieldwork and data-related methodological aspects and in particular relating to equipment, digital tools, methods, where to find data and information, whom to contact for specialist information on particular regions or language families.AV collections, typological databasesInformation materials, guidelines, tutorials, consultancy
112IMPACT-CKCIMPACT centre of competence - CLARIN K-centre in digitisationIMPACT-CKC (IMPACT centre of competence - CLARIN K-centre in digitisation), as knowledge centre offers expertise and resources to institutions and researchers looking for advice in digitisation and related fields. The IMPACT-CKC resoruces include a demonstrator platform for online testing tools, a collection of high quality images with associated ground truth, historical lexica for 10 languages as well as training materials and registries on tools, initiatives, datasets and competitions.lexical data, language models, linked open data and ontologiestools, data, mentoring, dissemination, awareness, tutorials, web lectures.
113CorpLingCzCzech CLARIN Knowledge Centre for Corpus LinguisticsProvides information, consulting and technical assistance on all topics related to corpus linguistics. This includes data formats, annotation, metadata encoding, corpus querying, corpus linguistics methodology, statistical methods etc. Another specialization of the centre is empirical research on the Czech language.Speech corpora, parallel corporaWe are ready to provide data, tools and technical assistance, share expertise and hold workshops on demand on the topics covered by the K-centre. There is an on-line helpdesk to handle the user requests.
114CLARIN-SMSSwedish in a Multilingual SettingOffers special expertise in the areas of processing of parallel and comparable corpora, including alignment and machine translation, cross-linguistically consistent annotation within the framework of Universal Dependencies, computation and evaluation of measures of text complexity and language technology for Swedish Sign Language.parallel corpora
116TRTCTerminology Resources and Translation CorporaThe K-­-Centre provides information and training to users on the
preparation and documentation of translation-­-related resources,
in particular terminology resources and translation corpora. This
includes inquiries submitted to the Helpdesk related to tools,
methods, data, and guidance in seeking further expert support.
The service does not focus on language resources in particular
language, but is language independent.
Translation corpora, Terminology
117CLASSLACLARIN Knowledge Centre for South Slavic languagesOffers expertise on language resources and technologies for South Slavic languagestraining data, language models (for South Slavic languages)tools, data, mentoring, dissemination, awareness, web lectures
119SAFMORILSystems and Frameworks for Morphologically Rich LanguagesSAFMORIL brings together researchers and developers in the area of computational morphology and its NLP applications. The focus of SAFMORIL is actual, working systems and frameworks based on linguistic principles providing linguistically motivated analyses and generation outputs. Such systems are relevant in particular for languages with rich morphologies. SAFMORIL offers online courses for developing morphologies, tokenizers and spell-checkers, and a repository for storing morphologies. Lexical resources containing inflectional, derivational and compounding information as well as morphosyntactic grammars and language modelsdata, tools, web demos, web lectures and tutorials

Steven Krauwer (s.krauwer@uu.nl) Utrecht Institute of Linguistics UiL OTS
Phone +31 30 253 6050 Faculty of Humanities, Utrecht University
[Page generated: 21-01-2020] Drift 10, 3512 BS Utrecht, Netherlands