Ink identification using only spectral reflectance information poses significant challenges due to material degradation, aging, and spectral overlap between ink classes. This study explores the use of hyperspectral imaging and machine learning techniques to classify three distinct types of inks: pure metallo-gallate, carbon-containing, and non-carbon-containing inks. Six supervised classification models, including five traditional algorithms (Support Vector Machines, K-Nearest Neighbors, Linear Discriminant Analysis, Random Forest, and Partial Least Squares Discriminant Analysis) and one Deep Learning-based model, were evaluated. The methodology integrates data fusion from different imaging systems, sample extraction, ground truth creation, and a post-processing step to increase uniformity. The evaluation was performed using both mock-up samples and historical documents, achieving micro-averaged accuracy above 90% for all models. The best performance was obtained using the DL-based model (98% F1-score), followed by the Support Vector Machine model. In the case study documents, the overall performance of the traditional model was better. This study highlights the potential of hyperspectral imaging combined with machine learning for non-invasive ink identification and mapping, even under challenging conditions, contributing to the conservation and analysis of historical manuscripts.

Lopez-Baldomero, A., Buzzelli, M., Moronta-Montero, F., Martinez-Domingo, M., Valero, E. (2025). Ink classification in historical documents using hyperspectral imaging and machine learning methods. SPECTROCHIMICA ACTA. PART A, MOLECULAR AND BIOMOLECULAR SPECTROSCOPY, 335(5 July 2025) [10.1016/j.saa.2025.125916].

Ink classification in historical documents using hyperspectral imaging and machine learning methods

Buzzelli M.;
2025

Abstract

Ink identification using only spectral reflectance information poses significant challenges due to material degradation, aging, and spectral overlap between ink classes. This study explores the use of hyperspectral imaging and machine learning techniques to classify three distinct types of inks: pure metallo-gallate, carbon-containing, and non-carbon-containing inks. Six supervised classification models, including five traditional algorithms (Support Vector Machines, K-Nearest Neighbors, Linear Discriminant Analysis, Random Forest, and Partial Least Squares Discriminant Analysis) and one Deep Learning-based model, were evaluated. The methodology integrates data fusion from different imaging systems, sample extraction, ground truth creation, and a post-processing step to increase uniformity. The evaluation was performed using both mock-up samples and historical documents, achieving micro-averaged accuracy above 90% for all models. The best performance was obtained using the DL-based model (98% F1-score), followed by the Support Vector Machine model. In the case study documents, the overall performance of the traditional model was better. This study highlights the potential of hyperspectral imaging combined with machine learning for non-invasive ink identification and mapping, even under challenging conditions, contributing to the conservation and analysis of historical manuscripts.
Articolo in rivista - Articolo scientifico
Cultural heritage; Data fusion; Historical documents; Hyperspectral imaging; Ink classification; Machine learning approach; Material identification;
English
27-feb-2025
2025
335
5 July 2025
125916
open
Lopez-Baldomero, A., Buzzelli, M., Moronta-Montero, F., Martinez-Domingo, M., Valero, E. (2025). Ink classification in historical documents using hyperspectral imaging and machine learning methods. SPECTROCHIMICA ACTA. PART A, MOLECULAR AND BIOMOLECULAR SPECTROSCOPY, 335(5 July 2025) [10.1016/j.saa.2025.125916].
File in questo prodotto:
File Dimensione Formato  
López-Baldomero et al-2025-Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy-VoR.pdf

accesso aperto

Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Licenza: Creative Commons
Dimensione 5.25 MB
Formato Adobe PDF
5.25 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/588406
Citazioni
  • Scopus 13
  • ???jsp.display-item.citation.isi??? 10
Social impact