A Multimodal Scheduler for Synchronized Humanoid Robot Gesture and Speech
TL;DRAbstract
In order to engage in natural and fluent human-robot interaction, humanoid robot companions must be able to produce speech-accompanying non-verbal behavior including hand and arm gestures. In human communication, gestures are considered an integral part of the human thinking process. Accordingly, they are found to be finely synchronized with the accompanying linguistic affiliate [1]. Many researchers have emphasized the importance of this temporal synchrony in terms of coexpressiveness (e.g. [2], [3]). However, for a humanoid robot required to generate speech and gesture, an appropriate synchronization of the two modalities still poses a major challenge. In many existing approaches used for virtual conversational agents or robotic platforms, synchronization of different modalities is either achieved only approximately or by solely adapting one modality to the other, e.g. by adjusting gesture speed to the timing of running speech. Given the limitations of robotic platforms, e.g. motor v
Chat with Paper
AI Agents for this Paper
In order to engage in natural and fluent human-robot interaction, humanoid robot companions must be able to produce speech-accompanying non-verbal behavior including hand and arm gestures. In human communication, gestures are considered an integral part of the human thinking process. Accordingly, they are found to be finely synchronized with the accompanying linguistic affiliate [1]. Many researchers have emphasized the importance of this temporal synchrony in terms of coexpressiveness (e.g. [2], [3]). However, for a humanoid robot required to generate speech and gesture, an appropriate synchronization of the two modalities still poses a major challenge. In many existing approaches used for virtual conversational agents or robotic platforms, synchronization of different modalities is either achieved only approximately or by solely adapting one modality to the other, e.g. by adjusting gesture speed to the timing of running speech. Given the limitations of robotic platforms, e.g. motor v
Keywords
Chat
Click to start Chat