In the banking and finance sectors, members of the business units focused on Trend and Risk Analysis daily process internal and external visually-rich documents including text, images, and tables. Given a facet (i.e., topic) of interest, they are particularly interested in retrieving the top trending keywords related to it and then use them to annotate the most relevant document elements (e.g., text paragraphs, images or tables). In this paper, we explore the use of both open-source and proprietary Large Language Models to automatically generate lists of facet-relevant keywords, automatically produce free-text descriptions of both keywords and multimedia document content, and then annotate documents by leveraging textual similarity approaches. The preliminary results, achieved on English and Italian documents, show that OpenAI GPT-4 achieves superior performance in keyword description generation and multimedia content annotation, while the open-source Meta AI Llama2 model turns out to be highly competitive in generating additional keywords.

Gallipoli, G., Papicchio, S., Vaiani, L., Cagliero, L., Miola, A., Borghi, D. (2024). Keyword-based Annotation of Visually-Rich Document Content for Trend and Risk Analysis using Large Language Models. In Proceedings of the Joint Workshop of the 7th Financial Technology and Natural Language Processing, the 5th Knowledge Discovery from Unstructured Data in Financial Services, and the 4th Workshop on Economics and Natural Language Processing (pp.130-136). European Language Resources Association (ELRA).

Keyword-based Annotation of Visually-Rich Document Content for Trend and Risk Analysis using Large Language Models

Miola A.;
2024

Abstract

In the banking and finance sectors, members of the business units focused on Trend and Risk Analysis daily process internal and external visually-rich documents including text, images, and tables. Given a facet (i.e., topic) of interest, they are particularly interested in retrieving the top trending keywords related to it and then use them to annotate the most relevant document elements (e.g., text paragraphs, images or tables). In this paper, we explore the use of both open-source and proprietary Large Language Models to automatically generate lists of facet-relevant keywords, automatically produce free-text descriptions of both keywords and multimedia document content, and then annotate documents by leveraging textual similarity approaches. The preliminary results, achieved on English and Italian documents, show that OpenAI GPT-4 achieves superior performance in keyword description generation and multimedia content annotation, while the open-source Meta AI Llama2 model turns out to be highly competitive in generating additional keywords.
paper
Large Language Models; Trend and Risk analysis; Visually-Rich Document Understanding;
English
Joint Workshop of the 7th Financial Technology and Natural Language Processing, 5th Knowledge Discovery from Unstructured Data in Financial Services and 4th Economics and Natural Language Processing, FinNLP-KDF-ECONLP 2024 - 20 May 2024
2024
Chen, CC; Liu, X; Hahn, U; Nourbakhsh, A; Ma, Z; Smiley, C; Hoste, V; Das, SR; Li, M; Ghassemi, M; Huang, HH; Takamura, H; Chen, HH
Proceedings of the Joint Workshop of the 7th Financial Technology and Natural Language Processing, the 5th Knowledge Discovery from Unstructured Data in Financial Services, and the 4th Workshop on Economics and Natural Language Processing
9782493814197
2024
130
136
https://aclanthology.org/2024.finnlp-1.13/
open
Gallipoli, G., Papicchio, S., Vaiani, L., Cagliero, L., Miola, A., Borghi, D. (2024). Keyword-based Annotation of Visually-Rich Document Content for Trend and Risk Analysis using Large Language Models. In Proceedings of the Joint Workshop of the 7th Financial Technology and Natural Language Processing, the 5th Knowledge Discovery from Unstructured Data in Financial Services, and the 4th Workshop on Economics and Natural Language Processing (pp.130-136). European Language Resources Association (ELRA).
File in questo prodotto:
File Dimensione Formato  
Gallipoli-2024-FinNLP-KDF-ECONLP 2024-VoR.pdf

accesso aperto

Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Licenza: Creative Commons
Dimensione 1.07 MB
Formato Adobe PDF
1.07 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/573841
Citazioni
  • Scopus 1
  • ???jsp.display-item.citation.isi??? ND
Social impact