Personalization in Information Retrieval (IR) is a topic studied by the research community since a long time. Nevertheless, the availability of high-quality, real-world datasets for large-scale experiments and model evaluation remains limited. This paper helps to fill this gap by introducing SE-PQA (StackExchange - Personalized Question Answering), a new curated dataset designed for the development and evaluation of personalized models in the domain of community Question Answering (cQA). SE-PQA encompasses over one million queries and two million answers, annotated with a rich set of features that capture the social interactions among users on a cQA platform. We provide reproducible baseline methods for the cQA task based on the resource, including deep learning and personalized approaches. The results of the preliminary experiments conducted show the appropriateness of SE-PQA to train effective cQA models; they also show that personalization remarkably improves the effectiveness of all the methods tested.

Kasela, P., Braga, M., Pasi, G., Perego, R. (2024). SE-PQA: StackExchange Personalized Community Question Answering. In Proceedings of the 14th Italian Information Retrieval Workshop (pp.99-102). CEUR-WS.

SE-PQA: StackExchange Personalized Community Question Answering

Kasela P.;Braga M.;Pasi G.;
2024

Abstract

Personalization in Information Retrieval (IR) is a topic studied by the research community since a long time. Nevertheless, the availability of high-quality, real-world datasets for large-scale experiments and model evaluation remains limited. This paper helps to fill this gap by introducing SE-PQA (StackExchange - Personalized Question Answering), a new curated dataset designed for the development and evaluation of personalized models in the domain of community Question Answering (cQA). SE-PQA encompasses over one million queries and two million answers, annotated with a rich set of features that capture the social interactions among users on a cQA platform. We provide reproducible baseline methods for the cQA task based on the resource, including deep learning and personalized approaches. The results of the preliminary experiments conducted show the appropriateness of SE-PQA to train effective cQA models; they also show that personalization remarkably improves the effectiveness of all the methods tested.
paper
Personalization; Question Answering; Resource; User Model;
English
14th Italian Information Retrieval Workshop - September 5-6, 2024
2024
Roitero, K; Viviani, M; Maddalena, E; Mizzaro, S
Proceedings of the 14th Italian Information Retrieval Workshop
2024
3802
99
102
https://ceur-ws.org/Vol-3802/
none
Kasela, P., Braga, M., Pasi, G., Perego, R. (2024). SE-PQA: StackExchange Personalized Community Question Answering. In Proceedings of the 14th Italian Information Retrieval Workshop (pp.99-102). CEUR-WS.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/557163
Citazioni
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
Social impact