The capabilities of LLMs represent a pivotal step in transforming how we manage and interact with information and data. We witness an increasingly pervasive use of such models in various computational tasks. In some preliminary works, attempts to integrate Knowledge Graphs and Large Language Models (LLMs) can be identified, in particular, to perform the classic tasks related to the construction of Knowledge Graphs through semantic annotation of texts. Nowadays, tables are widely used and play a crucial role in creating, organising, and sharing information that could be used to produce factual knowledge to be integrated into a Knowledge Graph. However, table-to-KG techniques through LLM have not been extensively investigated. This paper presents stEELlm, an innovative Semantic Table Interpretation approach obtained by fine-tuning the Mixtral 8x7B model. Conducted experiments demonstrate the capabilities of our model to successfully create semantic annotations of heterogeneous datasets, a scenario where classic approaches based on heuristics tend to fail.

Cremaschi, M., D'Adda, F., Maurino, A. (2025). stEELlm: An LLM for Generating Semantic Annotations of Tabular Data. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY [10.1145/3719206].

stEELlm: An LLM for Generating Semantic Annotations of Tabular Data

Cremaschi, Marco
;
D'Adda, Fabio;Maurino, Andrea
2025

Abstract

The capabilities of LLMs represent a pivotal step in transforming how we manage and interact with information and data. We witness an increasingly pervasive use of such models in various computational tasks. In some preliminary works, attempts to integrate Knowledge Graphs and Large Language Models (LLMs) can be identified, in particular, to perform the classic tasks related to the construction of Knowledge Graphs through semantic annotation of texts. Nowadays, tables are widely used and play a crucial role in creating, organising, and sharing information that could be used to produce factual knowledge to be integrated into a Knowledge Graph. However, table-to-KG techniques through LLM have not been extensively investigated. This paper presents stEELlm, an innovative Semantic Table Interpretation approach obtained by fine-tuning the Mixtral 8x7B model. Conducted experiments demonstrate the capabilities of our model to successfully create semantic annotations of heterogeneous datasets, a scenario where classic approaches based on heuristics tend to fail.
Articolo in rivista - Articolo scientifico
Large Language Models; Knowledge Graphs; Pre-training; Fine-tuning; Prompt Engineering; Semantic Table Interpretation
English
21-feb-2025
2025
none
Cremaschi, M., D'Adda, F., Maurino, A. (2025). stEELlm: An LLM for Generating Semantic Annotations of Tabular Data. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY [10.1145/3719206].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/554902
Citazioni
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
Social impact