“From Style to Identity: AI Pipelines for Visual and Character Coherence in Film” by Zhang – ACM SIGGRAPH HISTORY ARCHIVES

“From Style to Identity: AI Pipelines for Visual and Character Coherence in Film” by Zhang

  • 2025 Posters_Zhang_From Style to Identity

Conference:


Type(s):


Title:

    From Style to Identity: AI Pipelines for Visual and Character Coherence in Film

Session/Category Title:

    Images, Video & Computer Vision

Presenter(s)/Author(s):



Abstract:


    We introduce a modular, open-source pipeline that combines multiple custom-trained LoRA and ControlNet models to disentangle style and identity, enabling fast, visually and narratively consistent AI-generated short films,validated through two award-winning multi-scene productions.

References:


    [1] ComfyUI. 2025. ComfyUI. GitHub repository. Retrieved from github.com/comfyanonymous/ComfyUI
    [2 Edward J. Hu, Yelong Shen, Phil Wallis, Zeyuan Allen-Zhu, Yuanzhi Li, Lu Wang, and Weizhu Chen. 2021. LoRA: Low-Rank Adaptation of Large Language Models. arXiv preprint arXiv:2106.09685.
    [3] Robin Rombach, Andreas Blattmann, Dominik Lorenz, Patrick Esser, and Björn Ommer. 2022. High-resolution image synthesis with latent diffusion models. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 10684–10695.


ACM Digital Library Publication:



Overview Page:



Submit a story:

If you would like to submit a story about this presentation, please contact us: historyarchives@siggraph.org