From Style to Identity: AI Pipelines for Visual and Character Coherence in Film

Zhiyu Zhang

“From Style to Identity: AI Pipelines for Visual and Character Coherence in Film” by Zhang

Next: “From the Earth to Infinity: Scientists From... »

« Previous: “From storytelling to story making: Interactive...

Conference:

SIGGRAPH 2025

Type(s):

Posters

Title:

From Style to Identity: AI Pipelines for Visual and Character Coherence in Film

Session/Category Title:

Images, Video & Computer Vision

Presenter(s)/Author(s):

Zhiyu Zhang

Abstract:

We introduce a modular, open-source pipeline that combines multiple custom-trained LoRA and ControlNet models to disentangle style and identity, enabling fast, visually and narratively consistent AI-generated short films，validated through two award-winning multi-scene productions.

References:

[1] ComfyUI. 2025. ComfyUI. GitHub repository. Retrieved from github.com/comfyanonymous/ComfyUI
[2 Edward J. Hu, Yelong Shen, Phil Wallis, Zeyuan Allen-Zhu, Yuanzhi Li, Lu Wang, and Weizhu Chen. 2021. LoRA: Low-Rank Adaptation of Large Language Models. arXiv preprint arXiv:2106.09685.
[3] Robin Rombach, Andreas Blattmann, Dominik Lorenz, Patrick Esser, and Björn Ommer. 2022. High-resolution image synthesis with latent diffusion models. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 10684–10695.

ACM Digital Library Publication:

From Style to Identity: AI Pipelines for Visual and Character Coherence in Film

Overview Page:

SIGGRAPH 2025: Posters

Submit a story:

If you would like to submit a story about this presentation, please contact us: historyarchives@siggraph.org

ACM SIGGRAPH HISTORY ARCHIVES