The amount of scholarly data available on the web is steadily increasing, enabling different types of analytics which can provide important insights into the research activity. In order to make sense of and explore this large-scale body of knowledge we need an accurate, comprehensive and up-to-date ontology of research topics. Unfortunately, human crafted classifications do not satisfy these criteria, as they evolve too slowly and tend to be too coarse-grained. Current automated methods for generating ontologies of research areas also present a number of limitations, such as: i) they do not consider the rich amount of indirect statistical and semantic relationships, which can help to understand the relation between two topics – e.g., the fact that two research areas are associated with a similar set of venues or technologies; ii) they do not distinguish between different kinds of hierarchical relationships; and iii) they are not able to handle effectively ambiguous topics characterized by a noisy set of relationships. In this paper we present Klink-2, a novel approach which improves on our earlier work on automatic generation of semantic topic networks and addresses the aforementioned limitations by taking advantage of a variety of knowledge sources available on the web. In particular, Klink-2 analyses networks of research entities (including papers, authors, venues, and technologies) to infer three kinds of semantic relationships between topics. It also identifies ambiguous keywords (e.g., “ontology”) and separates them into the appropriate distinct topics – e.g., “ontology/philosophy” vs. “ontology/semantic web”. Our experimental evaluation shows that the ability of Klink-2 to integrate a high number of data sources and to generate topics with accurate contextual meaning yields significant improvements over other algorithms in terms of both precision and recall.

Osborne, F., Motta, E. (2015). Klink-2: Integrating multiple web sources to generate semantic topic networks. In The Semantic Web - ISWC 2015. ISWC 2015 (pp.408-424). Springer Verlag [10.1007/978-3-319-25007-6_24].

Klink-2: Integrating multiple web sources to generate semantic topic networks

Osborne F
;
2015

Abstract

The amount of scholarly data available on the web is steadily increasing, enabling different types of analytics which can provide important insights into the research activity. In order to make sense of and explore this large-scale body of knowledge we need an accurate, comprehensive and up-to-date ontology of research topics. Unfortunately, human crafted classifications do not satisfy these criteria, as they evolve too slowly and tend to be too coarse-grained. Current automated methods for generating ontologies of research areas also present a number of limitations, such as: i) they do not consider the rich amount of indirect statistical and semantic relationships, which can help to understand the relation between two topics – e.g., the fact that two research areas are associated with a similar set of venues or technologies; ii) they do not distinguish between different kinds of hierarchical relationships; and iii) they are not able to handle effectively ambiguous topics characterized by a noisy set of relationships. In this paper we present Klink-2, a novel approach which improves on our earlier work on automatic generation of semantic topic networks and addresses the aforementioned limitations by taking advantage of a variety of knowledge sources available on the web. In particular, Klink-2 analyses networks of research entities (including papers, authors, venues, and technologies) to infer three kinds of semantic relationships between topics. It also identifies ambiguous keywords (e.g., “ontology”) and separates them into the appropriate distinct topics – e.g., “ontology/philosophy” vs. “ontology/semantic web”. Our experimental evaluation shows that the ability of Klink-2 to integrate a high number of data sources and to generate topics with accurate contextual meaning yields significant improvements over other algorithms in terms of both precision and recall.
paper
Bibliographic data; Data mining; Ontology learning; Scholarly data; Scholarly ontologies;
English
14th International Semantic Web Conference, ISWC 2015 - 11 October 2015 through 15 October 2015
2015
Arenas, M; Corcho, O; Simperl, E; Strohmaier, M; d’Aquin, M; Srinivas, K; Groth, P; Dumontier, M; Heflin, J; Thirunarayan, K; Staab, S
The Semantic Web - ISWC 2015. ISWC 2015
978-3-319-25006-9
2015
9366
408
424
https://link.springer.com/chapter/10.1007/978-3-319-25007-6_24
none
Osborne, F., Motta, E. (2015). Klink-2: Integrating multiple web sources to generate semantic topic networks. In The Semantic Web - ISWC 2015. ISWC 2015 (pp.408-424). Springer Verlag [10.1007/978-3-319-25007-6_24].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/381549
Citazioni
  • Scopus 60
  • ???jsp.display-item.citation.isi??? 38
Social impact