Seminar für Sprachwissenschaft

The following gives a short description and link to a corpus that is created by the Quantitative Linguistics group.


The Karl Eberhards Corpus of spontaneously spoken southern German in dialogues - audio and articulatory recordings

Details zur KEC

The current paper presents a corpus containing 40 dialogues of spontaneously spoken southern German, including half an hour of articulography for 20 of the speakers. Speakers were seated in separate recording chambers, mimicking a telephone call, and recorded on individual audio channels. The corpus provides manually corrected word boundaries and automatically aligned segment boundaries, as well as part of speech tags. Annotations are provided in the Praat format. In addition to audio recordings, speakers filled out a detailed questionnaire, assessing among others their audio-visual consumption habits.