Table of Contents
Voice recordings are privacy-sensitive data; please use them respectfully and for academic purposes only.
NiCLS
Enhanced amplitude modulations contribute to the Lombard intelligibility benefit: Evidence from the Nijmegen Corpus of Lombard Speech. The Journal of the Acoustical Society of America, 147(2), 721-730, doi:10.1121/10.0000646.(2020).
- Nijmegen Corpus of Lombard Speech [NiCLS]
- Lead author: Hans Rutger Bosker
- https://hdl.handle.net/1839/21ee5744-b5dc-4eed-9693-c37e871cdaf6
- Dutch
- 42 talkers (37 F/5 M); each reading a folk story of 56 utterances; in both plain speech (read in quiet) and Lombard speech (read in noise over headphones)
- 3968 wav files (1984 Lombard-plain pairs) + forced-alignment TextGrids
- CC BY-NC-ND 4.0 license
PiNCeR
Control of speaking rate is achieved by switching between qualitatively distinct cognitive ‘gaits’: Evidence from simulation. Psychological Review, 127(2), 281-304, doi:10.1037/rev0000172.(2020).
- Picture Naming at Cued Rates [PiNCeR] corpus
- Lead author: Joe Rodd
- https://hdl.handle.net/1839/7c210d30-bb55-4cbe-9eeb-baf18570460c
- Dutch
- 25 talkers (21 F/4 M) naming disyllabic pictures arranged on a ‘clock face’; produced at three different cued rates: fast, medium, slow
- with (manual) word-level and (forced-aligned) syllable-level annotations; with eye-tracking data
- CC BY 4.0 license
Minimal stress pairs
- List of minimal pairs differing only in lexical stress
- An example from English: FOREbear [noun] ‘ancestor’ vs. forBEAR [verb] ’to be patient’
- Lead author: Hans Rutger Bosker
- Download the Excel file here
- Dutch, English, German, Italian, Romanian, Spanish (each language in an individual sheet)
- The two members of a pair are primarily distinguished by stress. However, there may also occassionally be subtle segmental differences involved (e.g., vowel reduction in unstressed syllables in English and German)
- Sources and further references listed at the top of each individual sheet; thanks to everyone who helped out!
- CC BY 4.0 license
Other corpora
Many other speech, video, and picture corpora are publicly available nowadays. Please see Other resources for some examples that we ourselves have used in the past.