“Listen, Denoise, Action! Audio-Driven Motion Synthesis with Diffusion Models” by Alexanderson, Nagy, Beskow and Henter

Next: “Literacy LABELS: Emergent Literacy Application... »

« Previous: “Listen Up! Real-Time Auditory Interfaces for...

Conference:

Type(s):

Technical Papers

Title:

Listen, Denoise, Action! Audio-Driven Motion Synthesis with Diffusion Models

Session/Category Title: Character Animation: Knowing What To Do With Your Hands

Presenter(s)/Author(s):

Simon Alexanderson

Rajmund Nagy

Jonas Beskow

Gustav Eje Henter

Moderator(s):

Michael Neff

Abstract:

We present diffusion models for audio-driven synthesis of high-quality 3D motion, with dancing and co-speech gesticulation as example applications. Our architecture uses Conformers and translation-invariant self attention. Optional style control is provided through classifier-free guidance. We also demonstrate results on path-driven locomotion and a novel formulation of diffusion-model ensembles.

Additional Images:

©Simon Alexanderson, Rajmund Nagy, Jonas Beskow, and Gustav Eje Henter

Overview Page:

SIGGRAPH 2023: Technical Papers

Sponsored by:

All artwork and text on this site are the exclusive copyrighted works of the artist or author. ALL RIGHTS RESERVED. Any unlawful redistribution or reproduction of images featured on this site without prior express written authorization of the copyright owner is strictly prohibited.