Bicocca Open Archive

In this manuscript, we review the work we undertake to build a large-scale benchmark dataset for an understudied Information Retrieval task called Semantic Query Labeling. This task is particularly relevant for search tasks that involve structured documents, such as Vertical Search, and consists of automatically recognizing the parts that compose a query and unfolding the relations between the query terms and the documents' fields. We first motivate the importance of building novel evaluation datasets for less popular Information Retrieval tasks. Then, we give an in-depth description of the procedure we followed to build our dataset.

Bassani, E., Pasi, G. (2021). On building benchmark datasets for understudied information retrieval tasks: The case of semantic query labeling. In Proceedings of the 11th Italian Information Retrieval Workshop 2021. CEUR-WS.

On building benchmark datasets for understudied information retrieval tasks: The case of semantic query labeling

Bassani E.;Pasi G.

2021

Abstract

In this manuscript, we review the work we undertake to build a large-scale benchmark dataset for an understudied Information Retrieval task called Semantic Query Labeling. This task is particularly relevant for search tasks that involve structured documents, such as Vertical Search, and consists of automatically recognizing the parts that compose a query and unfolding the relations between the query terms and the documents' fields. We first motivate the importance of building novel evaluation datasets for less popular Information Retrieval tasks. Then, we give an in-depth description of the procedure we followed to build our dataset.

Scheda breve

Scheda completa

Scheda completa (DC)

	Tipo di intervento
	
				paper
			
	Parole chiave
	
				Dataset; Semantic query labeling; Structured document search; Vertical search;
			
	Lingua del contenuto
	
				English
			
	Nome del convegno
	
				11th Italian Information Retrieval Workshop 2021 -
			
	Anno del convegno
	
				2021
			
	Curatori della monografia
	
				Anelli, VW; Di Noia, T; Ferro, N; Narducci, F
			
	Titolo degli atti
	
				Proceedings of the 11th Italian Information Retrieval Workshop 2021
			
	Collana o serie
	
				CEUR WORKSHOP PROCEEDINGS
			
	Data di pubblicazione
	
				2021
			
	Numero del volume
	
				2947
			
	URL alternativo
	
				https://ceur-ws.org/Vol-2947/
			
	Fulltext
	
				none
			
	Citazione
	
				Bassani, E., Pasi, G. (2021). On building benchmark datasets for understudied information retrieval tasks: The case of semantic query labeling. In Proceedings of the 11th Italian Information Retrieval Workshop 2021. CEUR-WS.
			
	Appare nelle tipologie:
	
				02 - Intervento a convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/557169

Citazioni

1

ND

Social impact