The MSP-Avatar corpus is a motion capture database which explores the role of discourse functions in non-verbal human interactions. This database comprises three sessions of recordings of spontaneous dyadic interactions between six actors. The scenarios are designed to elicit different types of discourse-related gestures in the actors. The actors are selected from the UT Dallas art department.
The MSP-AVATAR corpus is being recorded as part of our NSF project "EAGER: Investigating the Role of Discourse Context in Speech-Driven Facial Animations" (NSF IIS: 1352950) which studies the benefits of using discourse and dialog contextual information in the generation of believable, human-like behaviors for conversational agent (CA).
Generating a CA requires a careful analysis of human gestures and speech during human interactions. The MSP-AVATAR corpus is a rich resource for this purpose, since it includes spontaneous interaction targeting several discourse functions. We expect to investigate the effect of context in nonverbal human interaction.
The recordings include audio, video, and motion capture data from the actors. The motion captures are from the upper-body skeleton and facial area. The categories of discourse functions are carefully chosen. The different types of contexts considered are contrast, confirmation-negation, question, uncertainty, order, suggest, warn, inform, large-small (reference to the size), and deictic.
Subjects are presented with a slide which describes a scenario and also some typical gestures assumed to be associated with the corresponding context. They are told to behave naturally and use their body language in conveying their meanings, while incorporating the presented gestures or any other gestures which felt natural. For further information on the corpus, please read:
We are currently cleaning the motion capture data for the analysis. We plan to share this corpus with the research community in the future.
