Bicocca Open Archive

Persistence probability is a statistical index for detecting one or more communities embedded in a network. However, despite its straightforward definition (the probability that a random walk remains in a group of nodes after a time period), it is seldom used, possibly because of the difficulty of developing an efficient algorithm to calculate it. Proposed here is a new mathematical programming model to find the community with the largest persistence probability. The model is formulated as integer fractional programming and then is reduced to mixed-integer linear programming with appropriate variable substitutions. Nevertheless, the exact problem is solved in a reasonable time only for small networks, so heuristic procedures are developed to approximate the optimal solution for large networks. First, a randomized greedy-ascent method is elaborated, taking advantage of a particular data structure to generate feasible solutions quickly. After analyzing the greedy output and determining where the optimal solution is eventually located, improvement procedures are implemented based on a local exchange but applying different long-term diversification principles, and based on tree neighborhood search and random restart. When applied to simulated graphs to determine the reliability and effectiveness of the proposed method, it accurately reproduces the clustering characteristics found in real networks. Finally, it is applied to two real networks, and comparing the results to those from two well-known community-detection procedures confirms that this index complements the other methods well.

Avellone, A., Benati, S., Grassi, R., Rizzini, G. (2024). On finding the community with maximum persistence probability. 4OR, 22(4), 435-463 [10.1007/s10288-023-00559-z].

On finding the community with maximum persistence probability

Avellone, A;Benati, S;Grassi, R;Rizzini, G

2024

Abstract

Persistence probability is a statistical index for detecting one or more communities embedded in a network. However, despite its straightforward definition (the probability that a random walk remains in a group of nodes after a time period), it is seldom used, possibly because of the difficulty of developing an efficient algorithm to calculate it. Proposed here is a new mathematical programming model to find the community with the largest persistence probability. The model is formulated as integer fractional programming and then is reduced to mixed-integer linear programming with appropriate variable substitutions. Nevertheless, the exact problem is solved in a reasonable time only for small networks, so heuristic procedures are developed to approximate the optimal solution for large networks. First, a randomized greedy-ascent method is elaborated, taking advantage of a particular data structure to generate feasible solutions quickly. After analyzing the greedy output and determining where the optimal solution is eventually located, improvement procedures are implemented based on a local exchange but applying different long-term diversification principles, and based on tree neighborhood search and random restart. When applied to simulated graphs to determine the reliability and effectiveness of the proposed method, it accurately reproduces the clustering characteristics found in real networks. Finally, it is applied to two real networks, and comparing the results to those from two well-known community-detection procedures confirms that this index complements the other methods well.

Scheda breve

Scheda completa

Scheda completa (DC)

	Sottotipologia
	
				Articolo in rivista - Articolo scientifico
			
	Parole chiave
	
				05C85; 05C90; 90C11; 90C35; 90C59; Community index; Heuristics; Integer fractional programming; Persistence probability;
			
	Lingua del contenuto
	
				English
			
	Data ahead of print o Data prima pubblicazione Online
	
				26-dic-2023
			
	Data di pubblicazione
	
				2024
			
	Rivista
	
				4OR
			
	Numero del volume
	
				22
			
	Fascicolo
	
				4
			
	Pagina iniziale
	
				435
			
	Pagina finale
	
				463
			
	DOI dell'articolo
	
				https://dx.doi.org/10.1007/s10288-023-00559-z
			
	Fulltext
	
				none
			
	Citazione
	
				Avellone, A., Benati, S., Grassi, R., Rizzini, G. (2024). On finding the community with maximum persistence probability. 4OR, 22(4), 435-463 [10.1007/s10288-023-00559-z].
			
	Appare nelle tipologie:
	
				01 - Articolo su rivista

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/455889

Citazioni

1

1

Social impact