Bicocca Open Archive

A comprehensive literature on the Traveling Salesman Problem (TSP) is available, and this problem has become a valuable benchmark to test new heuristic methods for general Combinatorial Optimisation problems. For this reason, recently developed Deep Learning-driven heuristics have been tried on the TSP. These Deep Learning frameworks use the city coordinates as inputs, and are trained using reinforcement learning to predict a distribution over the TSP feasible solutions. The aim of the present work is to show how easy-to-calculate Combinatorial Optimization concepts can improve the performances of such systems. In particular, we show how passing Minimum Spanning Tree information during training can lead to significant improvements to the quality of TSP solutions. As a side result, we also propose a Deep Learning architecture able to predict in real time the optimal length of a TSP instance. The proposed architectures have been tested on random 2D Euclidean graphs with 50 and 100 nodes, showing significant results.

Mele, U., Chou, X., Gambardella, L., Montemanni, R. (2021). Reinforcement Learning and Additional Rewards for the Traveling Salesman Problem. In ACM International Conference Proceeding Series (pp.198-204). Association for Computing Machinery [10.1145/3463858.3463885].

Reinforcement Learning and Additional Rewards for the Traveling Salesman Problem

Mele, UJ;Chou, X;Gambardella, LM;Montemanni, R

2021

Abstract

A comprehensive literature on the Traveling Salesman Problem (TSP) is available, and this problem has become a valuable benchmark to test new heuristic methods for general Combinatorial Optimisation problems. For this reason, recently developed Deep Learning-driven heuristics have been tried on the TSP. These Deep Learning frameworks use the city coordinates as inputs, and are trained using reinforcement learning to predict a distribution over the TSP feasible solutions. The aim of the present work is to show how easy-to-calculate Combinatorial Optimization concepts can improve the performances of such systems. In particular, we show how passing Minimum Spanning Tree information during training can lead to significant improvements to the quality of TSP solutions. As a side result, we also propose a Deep Learning architecture able to predict in real time the optimal length of a TSP instance. The proposed architectures have been tested on random 2D Euclidean graphs with 50 and 100 nodes, showing significant results.

Scheda breve

Scheda completa

Scheda completa (DC)

	Tipo di intervento
	
				paper
			
	Parole chiave
	
				Combinatorial Optimization; Deep Learning; Minimum Spanning Tree Problem; Reinforcement Learning; Traveling Salesman Problem;
			
	Lingua del contenuto
	
				English
			
	Nome del convegno
	
				8th International Conference on Industrial Engineering and Applications, ICIEA 2021-Europe - 8 January 2021 through 11 January 2021
			
	Anno del convegno
	
				2021
			
	Titolo degli atti
	
				ACM International Conference Proceeding Series
			
	ISBN del volume degli atti
	
				9781450389921
			
	Data di pubblicazione
	
				2021
			
	Pagina iniziale
	
				198
			
	Pagina finale
	
				204
			
	DOI dell'intervento
	
				https://dx.doi.org/10.1145/3463858.3463885
			
	Fulltext
	
				none
			
	Citazione
	
				Mele, U., Chou, X., Gambardella, L., Montemanni, R. (2021). Reinforcement Learning and Additional Rewards for the Traveling Salesman Problem. In ACM International Conference Proceeding Series (pp.198-204). Association for Computing Machinery [10.1145/3463858.3463885].
			
	Appare nelle tipologie:
	
				02 - Intervento a convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/467067

Citazioni

5

2

Social impact