Cologne Language Archive Services (CLASS)
CLASS is a collective term for the projects and measures that were carried out at the University of Cologne between 2012 and 2014 as part of the curation projects 3.1 and 3.2 of the CLARIN-D Working Group 3 (F-AG 3). Some of the results of the projects are now under the supervision of the DCH.
Curation project 3.1: Poio API - a framework for processing and using field research data in linguistic research
The curation project aims to improve the possibilities for searching and annotating documentation data and to create a bridge between the data formats of language documentation archives (in particular ELAN annotation format) and the data formats used in corpus linguistics and NLP (in particular LAF/GrAF). This goal will be achieved in two steps: The establishment of an open and modular software library - Poio API - can serve as the basis for general web-based applications and as the core of project-related software.
A productive server-based sample application, which provides both web services and a graphical user interface, makes the library usable as a reference implementation. The project focuses technically on the DoBeS corpus as a central resource in the CLARIN infrastructure for the departments of F-AG 3.
Curation project 3.2: Field Linguistic Tool Repository
The curation project provides existing help scripts in the specialist community as web applications and web services. To this end, research-supporting scripts are collected and maintained, the source code is published and the functionality is made available via an HTML-based user interface and a REST interface.
The Field Linguistic Tool Repository establishes four scripts as CLARIN resources: ToolboxPy, Toolbox2LaTeX, ToolboxSearch and the CMDI File Generator. The first three curated scripts allow the search and replacement of annotations in toolbox files. Some of these are integrated into the CLARIN resource Poio. The CMDI-File Generator enables the bulk generation of CMDI files for files from linguistic field research. The various web applications are integrated into the Cologne Language Archive Services (CLASS) and can also be accessed via a REST interface.
The CMDI file generator will be implemented as an HTML5 web application. This web application will also work offline so that researchers can generate metadata files during field research without an internet connection.
Partner Institutions
Department of General Linguistics (ASW), Institute of Linguistics (IfL), University of Cologne
Cologne Center for eHumanities (CCeH), University of Cologne
Interdisciplinary Centre for Social and Language Documentation (CIDLeS), Minde (Portugal)
Management
Prof. Dr. Nikolaus Himmelmann (ASW, IfL)
Team (Curation project 3.1)
Jonathan Blumtritt (DCH)
António Lopes (CIDLeS)
Team (Curation project 3.2)
Felix Rau (IfL)
Jonathan Blumtritt (DCH)
Sebastian Zimmer (CCeH)
Peter Bouda (CIDLeS)
Curation project 3.1: 01.09.2012-30.09.2013
Curation project 3.2: 01.08.2013-31.03.2014
Contact person
Felix Rau (DCH)
Universitätsstr. 22, 1st floor, room 1.10
E-mail: f.rau@uni-koeln.de
Phone: +49 221 470-5832