Faceted interfaces are omnipresent on the web to support data ex- ploration and ltering. A facet is a triple: a domain (e.g., Book), a property (e.g., author,lanдuaдe), and a set of property values (e.g., {Austen,Beauvoir,Coelho,Dostoevsky,Eco,Kerouac,Suskind,...}, {French,Enдlish,German,Italian,Portuдuese,Russian,...}). Given a property (e.g., lanдuaдe), selecting one or more of its values (Enдlish and Italian) returns the domain entities (of type Book) that match the given values (the books that are written in English or Italian). To implement faceted interfaces in a way that is scalable to very large datasets, it is necessary to automate facet extraction. Prior work associates a facet domain with a set of homogeneous values, but does not annotate the facet property. In this paper, we annotate the facet property with a predicate from a reference Knowledge Base (KB) so as to maximize the semantic similarity between the property and the predicate. We define semantic similarity in terms of three new metrics: specificity, coverage, and frequency. Our experimental evaluation uses the DBpedia and YAGO K Bs and shows that for the facet annotation problem, we obtain better results than a state-of-the-art approach for the annotation of web tables as modified to annotate a set of values
Porrini, R., Palmonari, M., Cruz, I. (2018). Facet Annotation Using Reference Knowledge Bases. In WEB CONFERENCE 2018: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW2018) (pp.1215-1224). ACM [10.1145/3178876.3186020].
Facet Annotation Using Reference Knowledge Bases
Riccardo PorriniPrimo
;Matteo Palmonari
Secondo
;
2018
Abstract
Faceted interfaces are omnipresent on the web to support data ex- ploration and ltering. A facet is a triple: a domain (e.g., Book), a property (e.g., author,lanдuaдe), and a set of property values (e.g., {Austen,Beauvoir,Coelho,Dostoevsky,Eco,Kerouac,Suskind,...}, {French,Enдlish,German,Italian,Portuдuese,Russian,...}). Given a property (e.g., lanдuaдe), selecting one or more of its values (Enдlish and Italian) returns the domain entities (of type Book) that match the given values (the books that are written in English or Italian). To implement faceted interfaces in a way that is scalable to very large datasets, it is necessary to automate facet extraction. Prior work associates a facet domain with a set of homogeneous values, but does not annotate the facet property. In this paper, we annotate the facet property with a predicate from a reference Knowledge Base (KB) so as to maximize the semantic similarity between the property and the predicate. We define semantic similarity in terms of three new metrics: specificity, coverage, and frequency. Our experimental evaluation uses the DBpedia and YAGO K Bs and shows that for the facet annotation problem, we obtain better results than a state-of-the-art approach for the annotation of web tables as modified to annotate a set of valuesFile | Dimensione | Formato | |
---|---|---|---|
facet-annotation.pdf
Solo gestori archivio
Dimensione
1.9 MB
Formato
Adobe PDF
|
1.9 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.