Bicocca Open Archive

In the banking and finance sectors, members of the business units focused on Trend and Risk Analysis daily process internal and external visually-rich documents including text, images, and tables. Given a facet (i.e., topic) of interest, they are particularly interested in retrieving the top trending keywords related to it and then use them to annotate the most relevant document elements (e.g., text paragraphs, images or tables). In this paper, we explore the use of both open-source and proprietary Large Language Models to automatically generate lists of facet-relevant keywords, automatically produce free-text descriptions of both keywords and multimedia document content, and then annotate documents by leveraging textual similarity approaches. The preliminary results, achieved on English and Italian documents, show that OpenAI GPT-4 achieves superior performance in keyword description generation and multimedia content annotation, while the open-source Meta AI Llama2 model turns out to be highly competitive in generating additional keywords.

Gallipoli, G., Papicchio, S., Vaiani, L., Cagliero, L., Miola, A., Borghi, D. (2024). Keyword-based Annotation of Visually-Rich Document Content for Trend and Risk Analysis using Large Language Models. In Proceedings of the Joint Workshop of the 7th Financial Technology and Natural Language Processing, the 5th Knowledge Discovery from Unstructured Data in Financial Services, and the 4th Workshop on Economics and Natural Language Processing (pp.130-136). European Language Resources Association (ELRA).

Keyword-based Annotation of Visually-Rich Document Content for Trend and Risk Analysis using Large Language Models

Gallipoli G.;Papicchio S.;Vaiani L.;Cagliero L.;Miola A.;Borghi D.

2024

Abstract

In the banking and finance sectors, members of the business units focused on Trend and Risk Analysis daily process internal and external visually-rich documents including text, images, and tables. Given a facet (i.e., topic) of interest, they are particularly interested in retrieving the top trending keywords related to it and then use them to annotate the most relevant document elements (e.g., text paragraphs, images or tables). In this paper, we explore the use of both open-source and proprietary Large Language Models to automatically generate lists of facet-relevant keywords, automatically produce free-text descriptions of both keywords and multimedia document content, and then annotate documents by leveraging textual similarity approaches. The preliminary results, achieved on English and Italian documents, show that OpenAI GPT-4 achieves superior performance in keyword description generation and multimedia content annotation, while the open-source Meta AI Llama2 model turns out to be highly competitive in generating additional keywords.

Scheda breve

Scheda completa

Scheda completa (DC)

	Tipo di intervento
	
				paper
			
	Parole chiave
	
				Large Language Models; Trend and Risk analysis; Visually-Rich Document Understanding;
			
	Lingua del contenuto
	
				English
			
	Nome del convegno
	
				Joint Workshop of the 7th Financial Technology and Natural Language Processing, 5th Knowledge Discovery from Unstructured Data in Financial Services and 4th Economics and Natural Language Processing, FinNLP-KDF-ECONLP 2024 - 20 May 2024
			
	Anno del convegno
	
				2024
			
	Curatori della monografia
	
				Chen, CC; Liu, X; Hahn, U; Nourbakhsh, A; Ma, Z; Smiley, C; Hoste, V; Das, SR; Li, M; Ghassemi, M; Huang, HH; Takamura, H; Chen, HH
			
	Titolo degli atti
	
				Proceedings of the Joint Workshop of the 7th Financial Technology and Natural Language Processing, the 5th Knowledge Discovery from Unstructured Data in Financial Services, and the 4th Workshop on Economics and Natural Language Processing
			
	ISBN del volume degli atti
	
				9782493814197
			
	Collana o serie
	
				INTERNATIONAL CONFERENCE ON COMPUTATIONAL LINGUISTICS
			
	Data di pubblicazione
	
				2024
			
	Pagina iniziale
	
				130
			
	Pagina finale
	
				136
			
	URL alternativo
	
				https://aclanthology.org/2024.finnlp-1.13/
			
	Fulltext
	
				open
			
	Citazione
	
				Gallipoli, G., Papicchio, S., Vaiani, L., Cagliero, L., Miola, A., Borghi, D. (2024). Keyword-based Annotation of Visually-Rich Document Content for Trend and Risk Analysis using Large Language Models. In Proceedings of the Joint Workshop of the 7th Financial Technology and Natural Language Processing, the 5th Knowledge Discovery from Unstructured Data in Financial Services, and the 4th Workshop on Economics and Natural Language Processing (pp.130-136). European Language Resources Association (ELRA).
			
	Appare nelle tipologie:
	
				02 - Intervento a convegno

File in questo prodotto:

File	Dimensione	Formato
Gallipoli-2024-FinNLP-KDF-ECONLP 2024-VoR.pdf accesso aperto Tipologia di allegato: Publisher’s Version (Version of Record, VoR) Licenza: Creative Commons Dimensione 1.07 MB Formato Adobe PDF Visualizza/Apri	1.07 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/573841

Citazioni

1

ND

Social impact