A corpus of spoken Kurdish collected during fieldwork in the Kurdish-speaking regions of Iran
Description
The dataset consists of video-recorded interviews with adult native speakers of Kurdish from the Kurdish-speaking regions of Iran (Kurdistan and Kermanshah), collected during fieldwork in 2021. The fieldwork was exploratory in nature and did not follow predefined research questions or hypotheses. Its primary aim was to identify linguistically relevant and potentially interesting phenomena and to provide empirical orientation for future research projects. The data collection was conducted in cooperation with the Atlas of Kurdish Dialects. Interviews were guided by a predefined set of questions and carried out by local informants from the host community. All recordings were transcribed using ELAN.
Core team
- Vahid Mortezapour Kouhanabni (https://orcid.org/0000-0002-6832-0057)
- Alessia Cassarà
Project Information
Project title: Multilingualism in Iran - a sociolinguistic approach to grammar (MISOGRA)
Funder: DAAD
Cite as
Mortezapour Kouhanabni, Vahid & Alessia Cassarà 2026. A corpus of spoken Kurdish collected during fieldwork in the Kurdish-speaking regions of Iran. Data Center for the Humanities. https://doi.org/10.18716/dch/a.00000060
Archive Information
DOI: https://doi.org/10.18716/dch/a.00000060
Date of the archival: 2026-02-17
Size of the package: 903 GB
Contact Name: Felix Rau (DCH) - info(at)dch.uni-koeln(dot)de
Data formats
- Video recordings: insv / insp
- Audio recordings: wav
- Transcriptions and annotations: ELAN .eaf files
- English translations: included in ELAN annotation tiers