“Simplifying Facial Animation using Deep Learning based Phoneme Recognition” by Ivanov and Havaldar – ACM SIGGRAPH HISTORY ARCHIVES

“Simplifying Facial Animation using Deep Learning based Phoneme Recognition” by Ivanov and Havaldar

  • ©

Conference:


Type(s):


Interest Area:


    Production & Animation

Title:

    Simplifying Facial Animation using Deep Learning based Phoneme Recognition

Session/Category Title:   Big Rigs: Advances in Rigging


Presenter(s)/Author(s):



Abstract:


    We present a machine learning framework for facial animation that is simple, easy to implement and integrates well into an artist friendly workflow. The framework employs a pre-trained deep learning model used for phoneme extraction. Each phoneme is mapped to artist defined face shapes resulting in sparse key frames generating quick and synchronized looking dialogue animations, which can optionally be enhanced using familiar artist friendly workflows.

References:


    [1] Tero Karras, Timo Aila, Samuli Laine, Antti Herva, and Jaakko Lehtinen. 2017. Audio-driven facial animation by joint end-to-end learning of pose and emotion. ACM Trans. Graph. 36, 4 (2017), 94:1–94:12. https://doi.org/10.1145/3072959.3073658

Additional Images:

©

ACM Digital Library Publication:



Overview Page:



Submit a story:

If you would like to submit a story about this presentation, please contact us: historyarchives@siggraph.org