Given the potential rise in the amount of user-generated content on social network, research efforts towards Information Extraction have significantly increased, giving leeway to the emergence of numerous Named Entity Recognition (NER) systems. Based on varying application scenarios and/or requirements, different NER systems use different entity classification schemas/ontologies to classify the discovered entity mentions into entity types. Indeed, comparisons and integrations among NER systems become complex. The situation is further worsened due to varying granularity levels of such ontologies used to train the NER systems. This problem has been approached in the state of the art by developing a deterministic manual mapping between concepts belonging to different ontologies. In this paper, we discuss the limitations of these methods and, inspired by a transfer learning paradigm, we propose a novel approach named LearningToAdapt (L2A) to mitigate them. L2A learns to transfer an input probability distribution over a set of ontology types defined in a source domain, into a probability distribution over the types of a new ontology in a target domain. By using the inferred probability distribution, we are able to re-classify the entity mentions using the most probable type in the target domain. Experiments conducted with benchmark data show remarkable performance, suggesting L2A as a promising approach for domain adaptation of NER systems.
Fersini, E., Manchanda, P., Messina, E., Nozza, D., Palmonari, M. (2018). Adapting named entity types to new ontologies in a microblogging environment. In Recent Trends and Future Technology in Applied Intelligence (pp.783-795). Springer Verlag [10.1007/978-3-319-92058-0_76].
Adapting named entity types to new ontologies in a microblogging environment
Fersini, E
;Manchanda, P;Messina E.;Nozza D;Palmonari, M
2018
Abstract
Given the potential rise in the amount of user-generated content on social network, research efforts towards Information Extraction have significantly increased, giving leeway to the emergence of numerous Named Entity Recognition (NER) systems. Based on varying application scenarios and/or requirements, different NER systems use different entity classification schemas/ontologies to classify the discovered entity mentions into entity types. Indeed, comparisons and integrations among NER systems become complex. The situation is further worsened due to varying granularity levels of such ontologies used to train the NER systems. This problem has been approached in the state of the art by developing a deterministic manual mapping between concepts belonging to different ontologies. In this paper, we discuss the limitations of these methods and, inspired by a transfer learning paradigm, we propose a novel approach named LearningToAdapt (L2A) to mitigate them. L2A learns to transfer an input probability distribution over a set of ontology types defined in a source domain, into a probability distribution over the types of a new ontology in a target domain. By using the inferred probability distribution, we are able to re-classify the entity mentions using the most probable type in the target domain. Experiments conducted with benchmark data show remarkable performance, suggesting L2A as a promising approach for domain adaptation of NER systems.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.