In this paper we present the multimodal data collected for developing a system able to influence the behavior of small groups in an informal and non goal-oriented conversation scenario. The prototype system looks like a table in a museum cafeteria and it is aimed at inducing the people sitting around to talk about their visit to the museum. To this aim, the system provides visual cues to foster participants’ engagement in the conversation. The cues are contextualized by automatically monitoring the group dynamics and by continuously planning and executing minimalist strategies based on the participants’ speaking activity and visual attention. In the paper, we shortly describe the system, its main components and functionalities. We then present the two data collections carried out to gather multimodal data to tune the basic perceptual modules of the system (voice activity detector and face tracker) and to improve the presentation engine of the visual cues.
Multimodal Corpora for an Automatic System Fostering Participants’ Engagement in Informal Conversations around a Museum Café Table
Mana, Nadia;Cappelletti, Alessandro;Stock, Oliviero;Zancanaro, Massimo
2011-01-01
Abstract
In this paper we present the multimodal data collected for developing a system able to influence the behavior of small groups in an informal and non goal-oriented conversation scenario. The prototype system looks like a table in a museum cafeteria and it is aimed at inducing the people sitting around to talk about their visit to the museum. To this aim, the system provides visual cues to foster participants’ engagement in the conversation. The cues are contextualized by automatically monitoring the group dynamics and by continuously planning and executing minimalist strategies based on the participants’ speaking activity and visual attention. In the paper, we shortly describe the system, its main components and functionalities. We then present the two data collections carried out to gather multimodal data to tune the basic perceptual modules of the system (voice activity detector and face tracker) and to improve the presentation engine of the visual cues.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.