Bicocca Open Archive

The emergence of computer vision foundation models, inspired by the success of task-agnostic pretrained representations in Natural Language Processing (NLP), is revolutionizing the field. These models produce features that excel in downstream tasks even without fine-tuning. Last year, DINOv2 emerged, surpassing previous state-of-the-art general-purpose features on computer vision benchmarks, both at the image and pixel levels. In this work, we focus on what type of color information is embedded in DINOv2 features, and to assess their performance in computer vision tasks where color is a critical cue—for instance, recognizing the color of vehicles for traffic monitoring, detecting skin tones in biometric applications, or assessing product color attributes in fashion and e-commerce. Furthermore, we also propose a training-free feature transformation that increases color selectivity in DINOv2 features, i.e. their ability to respond differently to various colors in an image, boosting the performance on several classes of the color vision tasks considered.

Bianco, S. (2025). Enhancing color selectivity in foundation models for downstream color vision tasks. NEUROCOMPUTING, 645(7 September 2025) [10.1016/j.neucom.2025.130471].

Enhancing color selectivity in foundation models for downstream color vision tasks

Bianco S.

2025

Abstract

The emergence of computer vision foundation models, inspired by the success of task-agnostic pretrained representations in Natural Language Processing (NLP), is revolutionizing the field. These models produce features that excel in downstream tasks even without fine-tuning. Last year, DINOv2 emerged, surpassing previous state-of-the-art general-purpose features on computer vision benchmarks, both at the image and pixel levels. In this work, we focus on what type of color information is embedded in DINOv2 features, and to assess their performance in computer vision tasks where color is a critical cue—for instance, recognizing the color of vehicles for traffic monitoring, detecting skin tones in biometric applications, or assessing product color attributes in fashion and e-commerce. Furthermore, we also propose a training-free feature transformation that increases color selectivity in DINOv2 features, i.e. their ability to respond differently to various colors in an image, boosting the performance on several classes of the color vision tasks considered.

Scheda breve

Scheda completa

Scheda completa (DC)

	Sottotipologia
	
				Articolo in rivista - Articolo scientifico
			
	Parole chiave
	
				Color sensitivity; Color vision; DINOv2; foundation models; Vision transformer;
			
	Lingua del contenuto
	
				English
			
	Data ahead of print o Data prima pubblicazione Online
	
				23-mag-2025
			
	Data di pubblicazione
	
				2025
			
	Rivista
	
				NEUROCOMPUTING
			
	Numero del volume
	
				645
			
	Fascicolo
	
				7 September 2025
			
	Article number
	
				130471
			
	DOI dell'articolo
	
				https://dx.doi.org/10.1016/j.neucom.2025.130471
			
	Fulltext
	
				open
			
	Citazione
	
				Bianco, S. (2025). Enhancing color selectivity in foundation models for downstream color vision tasks. NEUROCOMPUTING, 645(7 September 2025) [10.1016/j.neucom.2025.130471].
			
	Appare nelle tipologie:
	
				01 - Articolo su rivista

File in questo prodotto:

File	Dimensione	Formato
Bianco-2025-Neurocomputing-VoR.pdf accesso aperto Descrizione: This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/bync-nd/4.0/) Tipologia di allegato: Publisher’s Version (Version of Record, VoR) Licenza: Creative Commons Dimensione 3.23 MB Formato Adobe PDF Visualizza/Apri	3.23 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/568706

Citazioni

0

0

Social impact