This article presents a longitudinal, trilingual (French, Spanish and Italian) school writing corpus. Its aim is to compare the evolution of literacy skills between these three languages in primary school. Actually, only CP (grade 1) has been entirely treated but the same pupils will be followed up to CM2 (grade 5). The exploitation of this corpus proposes a tool-based approach based on comparisons between the version of the texts produced by the pupils and a so-called ‹standardised› version. This approach makes it possible to produce a linguistic description of the achievements and problems that pupils face in their texts. In particular, this article focus on segmentation between words, highlighting both the frequency of the phenomenon and the categories of words in which it occurs. The comparative analysis reveals the presence of similar characteristics in the trilingual corpus

Farina, E., Ponton, C., Gutierrez-Caceres, R., Teruggi, L., Brissaud, C. (2021). SCOLINTER: A trilingual corpus. The example of word segmentation [SCOLINTER: Un corpus trilingue. L'exemple de la segmentation en mots]. LANGUE FRANÇAISE, 211(3), 37-50 [10.3917/LF.211.0037].

SCOLINTER: A trilingual corpus. The example of word segmentation [SCOLINTER: Un corpus trilingue. L'exemple de la segmentation en mots]

Farina, E;Teruggi, L;
2021

Abstract

This article presents a longitudinal, trilingual (French, Spanish and Italian) school writing corpus. Its aim is to compare the evolution of literacy skills between these three languages in primary school. Actually, only CP (grade 1) has been entirely treated but the same pupils will be followed up to CM2 (grade 5). The exploitation of this corpus proposes a tool-based approach based on comparisons between the version of the texts produced by the pupils and a so-called ‹standardised› version. This approach makes it possible to produce a linguistic description of the achievements and problems that pupils face in their texts. In particular, this article focus on segmentation between words, highlighting both the frequency of the phenomenon and the categories of words in which it occurs. The comparative analysis reveals the presence of similar characteristics in the trilingual corpus
Articolo in rivista - Articolo scientifico
school corpora, natural language processing, word segmentation;
French
2021
211
3
37
50
none
Farina, E., Ponton, C., Gutierrez-Caceres, R., Teruggi, L., Brissaud, C. (2021). SCOLINTER: A trilingual corpus. The example of word segmentation [SCOLINTER: Un corpus trilingue. L'exemple de la segmentation en mots]. LANGUE FRANÇAISE, 211(3), 37-50 [10.3917/LF.211.0037].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/350786
Citazioni
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
Social impact