In this paper the problem of indexing heterogeneous structured documents and of retrieving semi-structured documents is considered. We propose a flexible paradigm for both indexing such documents and formulating user queries specifying soft constraints on both documents structure and content. At the indexing level we propose a model that achieves flexibility by constructing personalised document representations based on users views of the documents. This is obtained by allowing users to specify their preferences on the documents sections that they estimate to bear the most interesting information, as well as to linguistically quantify the number of sections which determine the global potential interest of the documents. At the query language level, a flexible query language for expressing soft selection conditions on both the documents structure and content is proposed.

Bordogna, G., Pasi, G. (2005). Personalized Indexing and Retrieval of Heterogeneous Structured Documents. INFORMATION RETRIEVAL, 8(2), 301-318 [10.1007/s10791-005-5664-x].

Personalized Indexing and Retrieval of Heterogeneous Structured Documents

PASI, GABRIELLA
2005

Abstract

In this paper the problem of indexing heterogeneous structured documents and of retrieving semi-structured documents is considered. We propose a flexible paradigm for both indexing such documents and formulating user queries specifying soft constraints on both documents structure and content. At the indexing level we propose a model that achieves flexibility by constructing personalised document representations based on users views of the documents. This is obtained by allowing users to specify their preferences on the documents sections that they estimate to bear the most interesting information, as well as to linguistically quantify the number of sections which determine the global potential interest of the documents. At the query language level, a flexible query language for expressing soft selection conditions on both the documents structure and content is proposed.
Articolo in rivista - Articolo scientifico
personalized, indexing, information retrieval, structured documents
English
2005
8
2
301
318
none
Bordogna, G., Pasi, G. (2005). Personalized Indexing and Retrieval of Heterogeneous Structured Documents. INFORMATION RETRIEVAL, 8(2), 301-318 [10.1007/s10791-005-5664-x].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/15012
Citazioni
  • Scopus 16
  • ???jsp.display-item.citation.isi??? 6
Social impact