Listen, Denoise, Action! Audio-Driven Motion Synthesis with Diffusion Models

We present diffusion models for audio-driven synthesis of high-quality 3D motion, with dancing and co-speech gesticulation as example applications. Our architecture uses Conformers and translation-invariant self attention. Optional style control is provided through classifier-free guidance. We also demonstrate results on path-driven locomotion and a novel formulation of diffusion-model ensembles.

Additional Images:

: 2023 Technical Papers: Alexanderson_Listen, Denoise, Action! Audio-Driven Motion Synthesis with Diffusion Models

Overview Page:

SIGGRAPH 2023: Technical Papers

Submit a story:

If you would like to submit a story about this presentation, please contact us: historyarchives@siggraph.org

ACM SIGGRAPH HISTORY ARCHIVES

“Listen, Denoise, Action! Audio-Driven Motion Synthesis with Diffusion Models” by Alexanderson, Nagy, Beskow and Henter

Conference:

Type(s):

Title:

Session/Category Title:

Presenter(s)/Author(s):

Moderator(s):

Abstract:

Additional Images:

Overview Page:

Submit a story:

Sponsored by: