Recommender systems are emerging as an interesting application scenario for Linked Data (LD). In fact, by exploiting the knowledge encoded in LD datasets, a new generation of semantics-aware recommendation engines have been developed in the last years. As Linked Data is often very rich and contains many information that may result irrelevant and noisy for a recommendation task, an initial step of feature selection is always required in order to select the most meaningful portion of the original dataset. Many approaches have been proposed in the literature for feature selection that exploit different statistical dimensions of the original data. In this paper we investigate the role of the semantics encoded in an ontological hierarchy when exploited to select the most relevant properties for a recommendation task. In particular, we compare an approach based on schema summarization with a \classical" one, i.e., Information Gain. We evaluated the performance of the two methods in terms of accuracy and aggregate diversity by setting up an experimental testbed relying on the Movielens dataset.

Ragone, A., Tomeo, P., Magarelli, C., Di Noia, T., Palmonari, M., Maurino, A., et al. (2017). Schema-summarization in Linked-Data-based feature selection for recommender systems. In Proceedings of the ACM Symposium on Applied Computing (pp.330-335). Association for Computing Machinery [10.1145/3019612.3019837].

Schema-summarization in Linked-Data-based feature selection for recommender systems

Ragone, Azzurra;Magarelli, Corrado;Palmonari, Matteo;Maurino, Andrea;
2017

Abstract

Recommender systems are emerging as an interesting application scenario for Linked Data (LD). In fact, by exploiting the knowledge encoded in LD datasets, a new generation of semantics-aware recommendation engines have been developed in the last years. As Linked Data is often very rich and contains many information that may result irrelevant and noisy for a recommendation task, an initial step of feature selection is always required in order to select the most meaningful portion of the original dataset. Many approaches have been proposed in the literature for feature selection that exploit different statistical dimensions of the original data. In this paper we investigate the role of the semantics encoded in an ontological hierarchy when exploited to select the most relevant properties for a recommendation task. In particular, we compare an approach based on schema summarization with a \classical" one, i.e., Information Gain. We evaluated the performance of the two methods in terms of accuracy and aggregate diversity by setting up an experimental testbed relying on the Movielens dataset.
paper
Information gain; Linked Data; Ontology summarization; Data profiling
English
32nd Annual ACM Symposium on Applied Computing, SAC 2017
2017
Proceedings of the ACM Symposium on Applied Computing
9781450344869
2017
128005
330
335
128005
reserved
Ragone, A., Tomeo, P., Magarelli, C., Di Noia, T., Palmonari, M., Maurino, A., et al. (2017). Schema-summarization in Linked-Data-based feature selection for recommender systems. In Proceedings of the ACM Symposium on Applied Computing (pp.330-335). Association for Computing Machinery [10.1145/3019612.3019837].
File in questo prodotto:
File Dimensione Formato  
Schema-summarization in Linked-Data-based feature selection for recommender systems.pdf

Solo gestori archivio

Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Dimensione 361.5 kB
Formato Adobe PDF
361.5 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/195340
Citazioni
  • Scopus 22
  • ???jsp.display-item.citation.isi??? ND
Social impact