Cross-Lingual Mapping (CLM) establishes semantic relations between source and target concepts to align two re- sources lexicalized in different languages, e.g., ontologies, thesauri, or concept inventories, or to enrich a multilingual resource. In this paper, we focus on purely lexical matching algorithms to support CLM between lexically-rich resources, where concepts can be identified by synsets. The key idea of these algorithms is to use the results of word translations as evidence to map synsets lexicalized in different languages. We propose a new cross-lingual similarity measure inspired by a classification-based mapping semantics. Then we ap- ply a novel local similarity optimization method to select the best matches for each source synset. To evaluate our approach we use wordnets in four different languages, which have been manually mapped to the English WordNet. Re- sults show that despite our method uses only lexical information about the concepts, it obtains good performance and significantly outperforms several baseline methods.
ABU HELOU, M., Palmonari, M. (2015). Cross-lingual lexical matching with word translation and local similarity optimization. In Proceedings of the 11th International Conference on Semantic Systems (SEMANTICS '15) (pp.97-104). Association for Computing Machinery [10.1145/2814864.2814888].
Cross-lingual lexical matching with word translation and local similarity optimization
ABU HELOU, MAMOUNPrimo
;PALMONARI, MATTEO LUIGIUltimo
2015
Abstract
Cross-Lingual Mapping (CLM) establishes semantic relations between source and target concepts to align two re- sources lexicalized in different languages, e.g., ontologies, thesauri, or concept inventories, or to enrich a multilingual resource. In this paper, we focus on purely lexical matching algorithms to support CLM between lexically-rich resources, where concepts can be identified by synsets. The key idea of these algorithms is to use the results of word translations as evidence to map synsets lexicalized in different languages. We propose a new cross-lingual similarity measure inspired by a classification-based mapping semantics. Then we ap- ply a novel local similarity optimization method to select the best matches for each source synset. To evaluate our approach we use wordnets in four different languages, which have been manually mapped to the English WordNet. Re- sults show that despite our method uses only lexical information about the concepts, it obtains good performance and significantly outperforms several baseline methods.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.