Jump to main content

A corpus of spoken Kurdish collected during fieldwork in the Kurdish-speaking regions of Iran

Description

The dataset consists of video-recorded interviews with adult native speakers of Kurdish from the Kurdish-speaking regions of Iran (Kurdistan and Kermanshah), collected during fieldwork in 2021. The fieldwork was exploratory in nature and did not follow predefined research questions or hypotheses. Its primary aim was to identify linguistically relevant and potentially interesting phenomena and to provide empirical orientation for future research projects. The data collection was conducted in cooperation with the Atlas of Kurdish Dialects. Interviews were guided by a predefined set of questions and carried out by local informants from the host community. All recordings were transcribed using ELAN.

Core team

Project Information

Project title: Multilingualism in Iran - a sociolinguistic approach to grammar (MISOGRA)
Funder: DAAD
 

Cite as

Mortezapour Kouhanabni, Vahid & Alessia Cassarà 2026. A corpus of spoken Kurdish collected during fieldwork in the Kurdish-speaking regions of Iran. Data Center for the Humanities. https://doi.org/10.18716/dch/a.00000060


Archive Information

DOI: https://doi.org/10.18716/dch/a.00000060
Date of the archival: 2026-02-17
Size of the package: 903 GB
Contact Name: Felix Rau (DCH) -

Data formats

  • Video recordings: insv / insp
  • Audio recordings: wav
  • Transcriptions and annotations: ELAN .eaf files
  • English translations: included in ELAN annotation tiers