Bicocca Open Archive

The growing demand for machine learning tools to solve hard tasks, from natural language processing to image understanding, recently shifted the attention to understand and possibly to explain the behaviour of deep learning. Deep neural networks represent today the state-of-the-art in many applications that have been shown to be solved by data-driven approaches. However, they are also well known for their complexity, which hinders the interpretation of their functioning. To address this issue, researchers have lately focused either on understanding the optimization algorithms or on extracting information from a trained model; in this context we propose the Activation Pattern Diagram (APD) as a new tool to analyse neural networks by mainly focusing on the input data. The APD is a graphical representation of how a dataset is learned by a neural network with piecewise linear activation functions, such as the ReLU activation. By analysing the evolution of the diagram during the training procedure, the APD sheds light on the learning process and how data influences it. Additionally, we introduce a way to plot the APD to help the visualization and interpretation of the diagram.

Craighero, F., Angaroni, F., Graudenzi, A., Stella, F., Antoniotti, M. (2020). Understanding deep learning with activation pattern diagrams. In CEUR Workshop Proceedings (pp.119-126). CEUR-WS.

Understanding deep learning with activation pattern diagrams

Craighero F.;Angaroni F.;Graudenzi A.;Stella F.;Antoniotti M.

2020

Abstract

The growing demand for machine learning tools to solve hard tasks, from natural language processing to image understanding, recently shifted the attention to understand and possibly to explain the behaviour of deep learning. Deep neural networks represent today the state-of-the-art in many applications that have been shown to be solved by data-driven approaches. However, they are also well known for their complexity, which hinders the interpretation of their functioning. To address this issue, researchers have lately focused either on understanding the optimization algorithms or on extracting information from a trained model; in this context we propose the Activation Pattern Diagram (APD) as a new tool to analyse neural networks by mainly focusing on the input data. The APD is a graphical representation of how a dataset is learned by a neural network with piecewise linear activation functions, such as the ReLU activation. By analysing the evolution of the diagram during the training procedure, the APD sheds light on the learning process and how data influences it. Additionally, we introduce a way to plot the APD to help the visualization and interpretation of the diagram.

Scheda breve

Scheda completa

Scheda completa (DC)

	Tipo di intervento
	
				paper
			
	Parole chiave
	
				Activation Patterns; Neural Networks; Piecewise Linear Functions; Visualization;
			
	Lingua del contenuto
	
				English
			
	Nome del convegno
	
				2020 Italian Workshop on Explainable Artificial Intelligence, XAI.it 2020
			
	Anno del convegno
	
				2020
			
	Titolo degli atti
	
				CEUR Workshop Proceedings
			
	Collana o serie
	
				CEUR WORKSHOP PROCEEDINGS
			
	Data di pubblicazione
	
				2020
			
	Numero del volume
	
				2742
			
	Pagina iniziale
	
				119
			
	Pagina finale
	
				126
			
	Fulltext
	
				none
			
	Citazione
	
				Craighero, F., Angaroni, F., Graudenzi, A., Stella, F., Antoniotti, M. (2020). Understanding deep learning with activation pattern diagrams. In CEUR Workshop Proceedings (pp.119-126). CEUR-WS.
			
	Appare nelle tipologie:
	
				02 - Intervento a convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/302793

Citazioni

0

ND

Social impact