The emergence of computer vision foundation models, inspired by the success of task-agnostic pretrained representations in Natural Language Processing (NLP), is revolutionizing the field. These models produce features that excel in downstream tasks even without fine-tuning. Last year, DINOv2 emerged, surpassing previous state-of-the-art general-purpose features on computer vision benchmarks, both at the image and pixel levels. In this work, we focus on what type of color information is embedded in DINOv2 features, and to assess their performance in computer vision tasks where color is a critical cue—for instance, recognizing the color of vehicles for traffic monitoring, detecting skin tones in biometric applications, or assessing product color attributes in fashion and e-commerce. Furthermore, we also propose a training-free feature transformation that increases color selectivity in DINOv2 features, i.e. their ability to respond differently to various colors in an image, boosting the performance on several classes of the color vision tasks considered.

Bianco, S. (2025). Enhancing color selectivity in foundation models for downstream color vision tasks. NEUROCOMPUTING, 645(7 September 2025) [10.1016/j.neucom.2025.130471].

Enhancing color selectivity in foundation models for downstream color vision tasks

Bianco S.
2025

Abstract

The emergence of computer vision foundation models, inspired by the success of task-agnostic pretrained representations in Natural Language Processing (NLP), is revolutionizing the field. These models produce features that excel in downstream tasks even without fine-tuning. Last year, DINOv2 emerged, surpassing previous state-of-the-art general-purpose features on computer vision benchmarks, both at the image and pixel levels. In this work, we focus on what type of color information is embedded in DINOv2 features, and to assess their performance in computer vision tasks where color is a critical cue—for instance, recognizing the color of vehicles for traffic monitoring, detecting skin tones in biometric applications, or assessing product color attributes in fashion and e-commerce. Furthermore, we also propose a training-free feature transformation that increases color selectivity in DINOv2 features, i.e. their ability to respond differently to various colors in an image, boosting the performance on several classes of the color vision tasks considered.
Articolo in rivista - Articolo scientifico
Color sensitivity; Color vision; DINOv2; foundation models; Vision transformer;
English
23-mag-2025
2025
645
7 September 2025
130471
open
Bianco, S. (2025). Enhancing color selectivity in foundation models for downstream color vision tasks. NEUROCOMPUTING, 645(7 September 2025) [10.1016/j.neucom.2025.130471].
File in questo prodotto:
File Dimensione Formato  
Bianco-2025-Neurocomputing-VoR.pdf

accesso aperto

Descrizione: This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/bync-nd/4.0/)
Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Licenza: Creative Commons
Dimensione 3.23 MB
Formato Adobe PDF
3.23 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/568706
Citazioni
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
Social impact