Bicocca Open Archive

Image cropping aims at the selection of the relevant part of an image maximizing its aesthetic quality and composition. The part of the image that needs to be removed is highly dependent on user preferences and can be related to image aesthetics, composition, informativeness, or other criteria. Since the concept of the perfect crop does not exist, but there are several cropping possibilities, recent cropping algorithms are trained to rank a set of crop candidates based on their compositional quality. To this end, several benchmark databases have been released that provide for each image a series of human-annotated crop candidates with corresponding scores. Many of the image cropping methods rely on a single criterion to define the best crop or crops in an image. However, a single criterion misses the complexity of human opinions which can differ in personal preferences and backgrounds. Motivated by this, we formulate the cropping problem as a ranking problem of candidate crop regions using a grid anchor based approach and multiple criteria. To evaluate the goodness of a crop region, we design a cropping method by combining three efficient and lightweight neural networks specifically designed to evaluate the quality of a crop in terms of aesthetics, composition, and semantics. Our results on standard datasets show that using more criteria yields better crops than state-of-the-art approaches. This result is also confirmed by a subjective study on user preferences that involved a panel of users.

Celona, L., Ciocca, G., Napoletano, P. (2021). A grid anchor based cropping approach exploiting image aesthetics, geometric composition, and semantics. EXPERT SYSTEMS WITH APPLICATIONS, 186 [10.1016/j.eswa.2021.115852].

A grid anchor based cropping approach exploiting image aesthetics, geometric composition, and semantics

Celona, Luigi;Ciocca, Gianluigi;Napoletano, Paolo

2021

Abstract

Image cropping aims at the selection of the relevant part of an image maximizing its aesthetic quality and composition. The part of the image that needs to be removed is highly dependent on user preferences and can be related to image aesthetics, composition, informativeness, or other criteria. Since the concept of the perfect crop does not exist, but there are several cropping possibilities, recent cropping algorithms are trained to rank a set of crop candidates based on their compositional quality. To this end, several benchmark databases have been released that provide for each image a series of human-annotated crop candidates with corresponding scores. Many of the image cropping methods rely on a single criterion to define the best crop or crops in an image. However, a single criterion misses the complexity of human opinions which can differ in personal preferences and backgrounds. Motivated by this, we formulate the cropping problem as a ranking problem of candidate crop regions using a grid anchor based approach and multiple criteria. To evaluate the goodness of a crop region, we design a cropping method by combining three efficient and lightweight neural networks specifically designed to evaluate the quality of a crop in terms of aesthetics, composition, and semantics. Our results on standard datasets show that using more criteria yields better crops than state-of-the-art approaches. This result is also confirmed by a subjective study on user preferences that involved a panel of users.

Scheda breve

Scheda completa

Scheda completa (DC)

	Sottotipologia
	
				Articolo in rivista - Articolo scientifico
			
	Parole chiave
	
				Deep learning; Image aesthetics; Image composition; Image cropping; Semantic content;
			
	Lingua del contenuto
	
				English
			
	Data ahead of print o Data prima pubblicazione Online
	
				6-set-2021
			
	Data di pubblicazione
	
				2021
			
	Rivista
	
				EXPERT SYSTEMS WITH APPLICATIONS
			
	Numero del volume
	
				186
			
	Article number
	
				115852
			
	DOI dell'articolo
	
				https://dx.doi.org/10.1016/j.eswa.2021.115852
			
	Fulltext
	
				reserved
			
	Citazione
	
				Celona, L., Ciocca, G., Napoletano, P. (2021). A grid anchor based cropping approach exploiting image aesthetics, geometric composition, and semantics. EXPERT SYSTEMS WITH APPLICATIONS, 186 [10.1016/j.eswa.2021.115852].
			
	Appare nelle tipologie:
	
				01 - Articolo su rivista

File in questo prodotto:

File	Dimensione	Formato
Celona-2021-Expert Systems with Applications-VoR.pdf Solo gestori archivio Tipologia di allegato: Publisher’s Version (Version of Record, VoR) Licenza: Tutti i diritti riservati Dimensione 3.1 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	3.1 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/325845

Citazioni

5

2

Social impact