Facial expressions, vocal intonation, gestures and other non-vocal cues play an important role in communication. However, there are not many freely available audiovisual conversational corpora which can be employed to study such a relevant topic. Here there is a short list of some of them. If you are compiling a corpus or have a corpus available and you wish to add it to the list, please write to Zoraida Callejas at:
SCARE: A Situated Corpus with Annotated Referring Expressions (ENGLISH)
Each session in this corpus records the joint problem-solving of a pair of human partners working through a treasure-hunt style task in a virtual world. The corpus includes: quicktime movies, audio files, video time-aligned transcriptions and annotation, audio-aligned word-level transcriptions, positional information from the virtual world’s log and state-change information on items that could be manipulated in the virtual world. The corpus is freely available for research and educational use only, and is especially suited for investigating task-oriented dialog or deictic expressions in English. Corpus web page (more information & download): http://slate.cse.ohio-state.edu/quake-corpora/scare/
The CHILDES Database (MULTILINGUAL)
The CHILDES database contains transcript and media data collected from conversations between young children and their playmates and caretakers. Conversations with older children and adults are available from TalkBank. The corpus includes: audio and video files and transcriptions in CHAT and CA/CHAT formats. The conversations are divided in the following categories: American English, British English, Bilingual (bilingual language acquisition), Clinical (Data from adults and children with language disorders in various languages), Narrative (data on narrations by adults and children in various languages), Germanic Languages (data on the acquisition of Germanic languages), Romance Languages (data on the acquisition of Romance languages), Slavic Languages, East Asian Languages, Celtic Languages and Other (data on the acquisition of other languages). The corpus is distributed under the GPLv2 license. Corpus web page (more information & download): http://childes.psy.cmu.edu/
IFA Dialog Video corpus (DUTCH)
The IFA Dialog Video corpus is a collection of annotated video recordings of friendly Face-to-Face dialogs. It is modelled on the Face-to-Face dialogs in the Spoken Dutch Corpus (CGN). For this corpus 20 dialog conversations of 15 minutes we recorded and annotated, in total 5 hours of speech. The corpus includes: compressed video (MPEG-4, DivX3, OGV), speech files (RIF/WAV, SPEEX, Ogg Vorbis, MP3) and annotations. The annotations are comprised of: word and phoneme aligned orthographic transcription, part-of-Speech labelling, gaze direction, readable dialog transcripts, SMIL xml files and metadata on the recordings and the speakers. The corpus is distributed under the GPLv2 license. The procedures and design of the corpus were adapted to make this corpus useful for other researchers of Dutch speech. Corpus web page (more information & download): http://www.fon.hum.uva.nl/IFA-SpokenLanguageCorpora/IFADVcorpus/
