Bicocca Open Archive

In this paper a new graph-based model is proposed for the representation of textual documents. Graph-structures are obtained from textual documents by making use of the well-known Part-Of-Speech (POS) tagging technique. More specifically, a simple rule-based (re)classifier is used to map each tag onto graph vertices and edges. As a result, a decomposition of textual documents is obtained where tokens are automatically parsed and attached to either a vertex or an edge. It is shown how textual documents can be aggregated through their graph-structures and finally, it is shown how vertex-ranking methods can be used to find relevant tokens.1 © 2013. The authors-Published by Atlantis Press.

Bronselaer, A., Pasi, G. (2013). An approach to graph-based analysis of textual documents. In 8th Conference of the European Society for Fuzzy Logic and Technology, EUSFLAT 2013 - Advances in Intelligent Systems Research (pp.634-641). Atlantis Press [10.2991/eusflat.2013.96].

An approach to graph-based analysis of textual documents

Bronselaer, A;PASI, GABRIELLA^Ultimo

2013

Abstract

In this paper a new graph-based model is proposed for the representation of textual documents. Graph-structures are obtained from textual documents by making use of the well-known Part-Of-Speech (POS) tagging technique. More specifically, a simple rule-based (re)classifier is used to map each tag onto graph vertices and edges. As a result, a decomposition of textual documents is obtained where tokens are automatically parsed and attached to either a vertex or an edge. It is shown how textual documents can be aggregated through their graph-structures and finally, it is shown how vertex-ranking methods can be used to find relevant tokens.1 © 2013. The authors-Published by Atlantis Press.

Scheda breve

Scheda completa

Scheda completa (DC)

	Tipo di intervento
	
				paper
			
	Parole chiave
	
				Graph model; Multi Document Summarization; Text Analysis;
			
	Parole chiave
	
				Graph model; Multi Document Summarization; Text Analysis; Computational Theory and Mathematics; Information Systems
			
	Lingua del contenuto
	
				English
			
	Nome del convegno
	
				8th Conference of the European Society for Fuzzy Logic and Technology, EUSFLAT 2013 11 September 2013 through 13 September 2013;
			
	Anno del convegno
	
				2013
			
	Titolo degli atti
	
				8th Conference of the European Society for Fuzzy Logic and Technology, EUSFLAT 2013 - Advances in Intelligent Systems Research
			
	ISBN del volume degli atti
	
				9781629932194
			
	Data di pubblicazione
	
				2013
			
	Numero del volume
	
				32
			
	Pagina iniziale
	
				634
			
	Pagina finale
	
				641
			
	DOI dell'intervento
	
				https://dx.doi.org/10.2991/eusflat.2013.96
			
	Fulltext
	
				open
			
	Citazione
	
				Bronselaer, A., Pasi, G. (2013). An approach to graph-based analysis of textual documents. In 8th Conference of the European Society for Fuzzy Logic and Technology, EUSFLAT 2013 - Advances in Intelligent Systems Research (pp.634-641). Atlantis Press [10.2991/eusflat.2013.96].
			
	Appare nelle tipologie:
	
				02 - Intervento a convegno

File in questo prodotto:

File	Dimensione	Formato
8458.pdf accesso aperto Tipologia di allegato: Publisher’s Version (Version of Record, VoR) Licenza: Creative Commons Dimensione 1.51 MB Formato Adobe PDF Visualizza/Apri	1.51 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/58512

Citazioni

12

5

Social impact