skip to content

File Format Whitelist

This list documents the file formats that are approved for submission and archiving at the Language Archvie Cologne. For a more detailed and descriptions of best practices regarding file formats see the LAC Submission Guidelines. For each data type, the acceptable file formats are given in the order of preference – from preferred to acceptable. However, all listed formats are equally accepted.

Metadata Files

BLAM CMDI

Audio files

WAV with Linear Pulse Code Modulation (LPCM) audio (preferred sampling rate 48 kHz and bit depth 16 bit)

Video Files

MP4 : Video codec h.264 (preferred profile: main, level: 4.0, 1080p, 30fps), Audio encoding AAC (LC) (preferred sampling rate 48 kHz and bit rate 128–384 kbps). MOV : Video codec h.264 (preferred profile: main, level: 4.0, 1080p, 30fps), Audio encoding LPCM (preferred sampling rate 48 kHz and bit depth 16 bit).

Annotations

Elan annotations

Praat TextGrid

Exmeralda transcriptions

TEI (in particular ISO 24624:2016)

FleX XML

Toolbox Files

Written resources

Text files (plain UTF–8 encoded)

PDF/A

XHTML

Still Images

TIFF

JPEG2000

PNG

JPEG

Additional metadata

CMDI Metadata

other XML Metadata formats

CSV encoded metadata (preferably with W3C Metadata Vocabulary for Tabular Data annotations)