In this paper, the definition of a Meta search engine model named SOFIA (SOft Fusion of Information Access) is proposed that applies a soft and flexible fusion of the ranked lists of documents retrieved by distinct search engines available over the Internet. The peculiarity of the fusion is that the final rank is not determined by a linear combination of the ranks in the lists, as generally happens in most metasearch engines. Instead, a linguistic quantifier modeled by an Induced Ordered Average (IOWA) operator that allows us to realize soft fusions in between that of the intersection and the union of the lists respectively expresses the fusion criterion. Flexibility is obtained by allowing user to specify his/her retrieval attitude that can be either recall or precision oriented, and by computing distinct fitness scores of the search engines based on a relevance feedback mechanism. These scores are used to define the IOWA reorder vector. In this way, the search engines with highest fitness determine more heavily the ranking of the documents in the fused list.
Bordogna, G., Pasi, G. (2004). A model for a SOft Fusion of Information Accesses on the web. FUZZY SETS AND SYSTEMS, 148(1), 105-118 [10.1016/j.fss.2004.03.008].
A model for a SOft Fusion of Information Accesses on the web
PASI, GABRIELLA
2004
Abstract
In this paper, the definition of a Meta search engine model named SOFIA (SOft Fusion of Information Access) is proposed that applies a soft and flexible fusion of the ranked lists of documents retrieved by distinct search engines available over the Internet. The peculiarity of the fusion is that the final rank is not determined by a linear combination of the ranks in the lists, as generally happens in most metasearch engines. Instead, a linguistic quantifier modeled by an Induced Ordered Average (IOWA) operator that allows us to realize soft fusions in between that of the intersection and the union of the lists respectively expresses the fusion criterion. Flexibility is obtained by allowing user to specify his/her retrieval attitude that can be either recall or precision oriented, and by computing distinct fitness scores of the search engines based on a relevance feedback mechanism. These scores are used to define the IOWA reorder vector. In this way, the search engines with highest fitness determine more heavily the ranking of the documents in the fused list.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.