Multimodal learning has recently emerged as a powerful paradigm for financial forecasting, enabling the integration of heterogeneous data sources such as market time series, textual news, and relational graphs. This survey presents a unified taxonomy for multimodal financial forecasting models, structured along four key dimensions: input modalities, modelling architectures, fusion strategies, and predictive tasks. Using this taxonomy, we conduct a systematic review of 35 representative works published between 2018 and 2025, highlighting methodological trends, design choices, and performance patterns. Our analysis identifies persistent challenges, including temporal misalignment, modality imbalance, missing or noisy data, and limited cross-market generalization. We also discuss emerging trends and promising research directions, such as adaptive fusion, incomplete modality learning, and the integration of large language models and temporal graph neural networks, and analyse how architectural and fusion design choices impact practical considerations such as interpretability and deployability, aiming to bridge methodological innovation with domain-specific requirements.

D'Amico, S., Mercorio, F., Nobani, N., Sperlì, G., Ventre, C. (2026). Learning Across Modalities: A Systematic Survey of Multimodal Models for Financial Analysis. INFORMATION FUSION [10.1016/j.inffus.2026.104249].

Learning Across Modalities: A Systematic Survey of Multimodal Models for Financial Analysis

D'Amico, Simone;Mercorio, Fabio;Nobani, Navid;
2026

Abstract

Multimodal learning has recently emerged as a powerful paradigm for financial forecasting, enabling the integration of heterogeneous data sources such as market time series, textual news, and relational graphs. This survey presents a unified taxonomy for multimodal financial forecasting models, structured along four key dimensions: input modalities, modelling architectures, fusion strategies, and predictive tasks. Using this taxonomy, we conduct a systematic review of 35 representative works published between 2018 and 2025, highlighting methodological trends, design choices, and performance patterns. Our analysis identifies persistent challenges, including temporal misalignment, modality imbalance, missing or noisy data, and limited cross-market generalization. We also discuss emerging trends and promising research directions, such as adaptive fusion, incomplete modality learning, and the integration of large language models and temporal graph neural networks, and analyse how architectural and fusion design choices impact practical considerations such as interpretability and deployability, aiming to bridge methodological innovation with domain-specific requirements.
Articolo in rivista - Articolo scientifico
Fusion Strategies; Multimodal Analysis; Market predictive analysis; Stock Market analysis
English
24-feb-2026
2026
none
D'Amico, S., Mercorio, F., Nobani, N., Sperlì, G., Ventre, C. (2026). Learning Across Modalities: A Systematic Survey of Multimodal Models for Financial Analysis. INFORMATION FUSION [10.1016/j.inffus.2026.104249].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/594430
Citazioni
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
Social impact