Language embeddings are a promising approach for handling natural language expressions. Current embeddings encompass a large language corpus, and need to be retrained to deal with specific sub-domains. On the other hand, these embeddings often disregard even basic domain knowledge, making them specially fragile when handling technical, specific, knowledge domains, and requiring costly retraining. To alleviate this issue, we propose a combined approach where the embedding is seen as a model of a logical knowledge base. Through a continuous learning approach, the embedding improves its satisfaction of the knowledge base, and in turn produces better training examples by labelling previously unseen text. In this position paper we describe the general framework for this continuous learning, along with its main features.

Tenti, P., Pasi, G., Penaloza, R. (2021). Complementing language embeddings with knowledge bases for specific domains. In 3rd International Workshop on Data Meets Applied Ontologies in Explainable AI, DAO-XAI 2021. CEUR-WS.

Complementing language embeddings with knowledge bases for specific domains

Tenti P.;Pasi G.;Penaloza R.
2021

Abstract

Language embeddings are a promising approach for handling natural language expressions. Current embeddings encompass a large language corpus, and need to be retrained to deal with specific sub-domains. On the other hand, these embeddings often disregard even basic domain knowledge, making them specially fragile when handling technical, specific, knowledge domains, and requiring costly retraining. To alleviate this issue, we propose a combined approach where the embedding is seen as a model of a logical knowledge base. Through a continuous learning approach, the embedding improves its satisfaction of the knowledge base, and in turn produces better training examples by labelling previously unseen text. In this position paper we describe the general framework for this continuous learning, along with its main features.
paper
Knowledge Bases; Language embedding; Natural Language Understanding; Neuro-Symbolic Learning
English
3rd International Workshop on Data Meets Applied Ontologies in Explainable AI, DAO-XAI 2021 - 18 September 2021through 19 September 2021
2021
3rd International Workshop on Data Meets Applied Ontologies in Explainable AI, DAO-XAI 2021
2021
2998
none
Tenti, P., Pasi, G., Penaloza, R. (2021). Complementing language embeddings with knowledge bases for specific domains. In 3rd International Workshop on Data Meets Applied Ontologies in Explainable AI, DAO-XAI 2021. CEUR-WS.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/414317
Citazioni
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
Social impact