“PictorialAttributes: Depicting Multiple Attributes with Realistic Imaging” by Dahary, Liu, Patashnik and Cohen-Or
Conference:
Type(s):
Title:
- PictorialAttributes: Depicting Multiple Attributes with Realistic Imaging
Session/Category Title: Rendering & Displays
Presenter(s)/Author(s):
Abstract:
Traditional visualizations often use abstract graphics, limiting understanding and memorability. Existing methods for pictorial visualization are more engaging, but often create disjointed compositions. To address this, we propose PictorialAttributes, a technique utilizing LLMs and diffusion models to depict data attributes. Examples show its promise for compelling and informative pictorial visualizations.
References:
[1]
Michelle A Borkin, Azalea A Vo, Zoya Bylinskii, Phillip Isola, Shashank Sunkavalli, Aude Oliva, and Hanspeter Pfister. 2013. What makes a visualization memorable?IEEE transactions on visualization and computer graphics 19, 12 (2013), 2306?2315.
[2]
Omer Dahary, Or Patashnik, Kfir Aberman, and Daniel Cohen-Or. 2024. Be Yourself: Bounded Attention for Multi-Subject Text-to-Image Generation. arXiv preprint arXiv:2403.16990 (2024).
[3]
Long Lian, Boyi Li, Adam Yala, and Trevor Darrell. 2023. LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models. arXiv preprint arXiv:2305.13655 (2023).
[4]
Long Ouyang, Jeffrey Wu, Xu Jiang, Diogo Almeida, Carroll Wainwright, Pamela Mishkin, Chong Zhang, Sandhini Agarwal, Katarina Slama, Alex Ray, 2022. Training language models to follow instructions with human feedback. Advances in neural information processing systems 35 (2022), 27730?27744.
[5]
Dustin Podell, Zion English, Kyle Lacey, Andreas Blattmann, Tim Dockhorn, Jonas M?ller, Joe Penna, and Robin Rombach. 2023. Sdxl: Improving latent diffusion models for high-resolution image synthesis. arXiv preprint arXiv:2307.01952 (2023).
[6]
Leigang Qu, Shengqiong Wu, Hao Fei, Liqiang Nie, and Tat-Seng Chua. 2023. LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation. arXiv preprint arXiv:2308.05095 (2023).
[7]
Hugo Romat, Nathalie Henry Riche, Christophe Hurter, Steven Drucker, Fereshteh Amini, and Ken Hinckley. 2020. Dear pictograph: Investigating the role of personalization and immersion for consuming and enjoying visualizations. In Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems. 1?13.
[8]
Robin Rombach, Andreas Blattmann, Dominik Lorenz, Patrick Esser, and Bj?rn Ommer. 2022. High-resolution image synthesis with latent diffusion models. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 10684?10695.
[9]
Yang Shi, Pei Liu, Siji Chen, Mengdi Sun, and Nan Cao. 2022. Supporting expressive and faithful pictorial visualization design with visual style transfer. IEEE Transactions on Visualization and Computer Graphics 29, 1 (2022), 236?246.
[10]
Haijun Xia, Nathalie Henry Riche, Fanny Chevalier, Bruno De Araujo, and Daniel Wigdor. 2018. DataInk: Direct and creative data-oriented drawing. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems. 1?13.
[11]
Jiayi Eris Zhang, Nicole Sultanum, Anastasia Bezerianos, and Fanny Chevalier. 2020. DataQuilt: Extracting visual elements from images to craft pictorial visualizations. In Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems. 1?13.
[12]
Dewei Zhou, You Li, Fan Ma, Zongxin Yang, and Yi Yang. 2024. MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis. arXiv preprint arXiv:2402.05408 (2024).