For the last decade, there has been a push to use multi-dimensional (latent) spaces to represent concepts; and yet how to manipulate these concepts or reason with them remains largely unclear. Some recent methods exploit multiple latent representations and their connection, making this research question even more entangled. Our goal is to understand how operations in the latent space affect the underlying concepts. We hence explore the task of concept blending through diffusion models. Diffusion models are based on a connection between a latent representation of textual prompts and a latent space that enables image reconstruction and generation. This task allows us to try different text-based combination strategies, and evaluate them visually. Our conclusion is that concept blending through space manipulation is possible, although the best strategy depends on the context.
Olearo, L., Longari, G., Melzi, S., Raganato, A., Penaloza, R. (2024). How to Blend Concepts in Diffusion Models. In Proceedings of The Eighth Image Schema Day co-located with The 23rd International Conference of the Italian Association for Artificial Intelligence(AI*IA 2024) (pp.1-12). CEUR-WS.
How to Blend Concepts in Diffusion Models
Olearo L.;Longari G.;Melzi S.;Raganato A.;Penaloza R.
2024
Abstract
For the last decade, there has been a push to use multi-dimensional (latent) spaces to represent concepts; and yet how to manipulate these concepts or reason with them remains largely unclear. Some recent methods exploit multiple latent representations and their connection, making this research question even more entangled. Our goal is to understand how operations in the latent space affect the underlying concepts. We hence explore the task of concept blending through diffusion models. Diffusion models are based on a connection between a latent representation of textual prompts and a latent space that enables image reconstruction and generation. This task allows us to try different text-based combination strategies, and evaluate them visually. Our conclusion is that concept blending through space manipulation is possible, although the best strategy depends on the context.File | Dimensione | Formato | |
---|---|---|---|
Olearo-2024-AI IA 2024-AAM.pdf
accesso aperto
Descrizione: Depositato in arXiv
Tipologia di allegato:
Author’s Accepted Manuscript, AAM (Post-print)
Licenza:
Creative Commons
Dimensione
2.81 MB
Formato
Adobe PDF
|
2.81 MB | Adobe PDF | Visualizza/Apri |
Olearo-2024-AI IA 2024-VoR.pdf
accesso aperto
Tipologia di allegato:
Publisher’s Version (Version of Record, VoR)
Licenza:
Creative Commons
Dimensione
1.72 MB
Formato
Adobe PDF
|
1.72 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.