The growing demand for machine learning tools to solve hard tasks, from natural language processing to image understanding, recently shifted the attention to understand and possibly to explain the behaviour of deep learning. Deep neural networks represent today the state-of-the-art in many applications that have been shown to be solved by data-driven approaches. However, they are also well known for their complexity, which hinders the interpretation of their functioning. To address this issue, researchers have lately focused either on understanding the optimization algorithms or on extracting information from a trained model; in this context we propose the Activation Pattern Diagram (APD) as a new tool to analyse neural networks by mainly focusing on the input data. The APD is a graphical representation of how a dataset is learned by a neural network with piecewise linear activation functions, such as the ReLU activation. By analysing the evolution of the diagram during the training procedure, the APD sheds light on the learning process and how data influences it. Additionally, we introduce a way to plot the APD to help the visualization and interpretation of the diagram.

Craighero, F., Angaroni, F., Graudenzi, A., Stella, F., & Antoniotti, M. (2020). Understanding deep learning with activation pattern diagrams. In CEUR Workshop Proceedings (pp.119-126). CEUR-WS.

Understanding deep learning with activation pattern diagrams

Craighero F.;Angaroni F.;Graudenzi A.;Stella F.;Antoniotti M.
2020

Abstract

The growing demand for machine learning tools to solve hard tasks, from natural language processing to image understanding, recently shifted the attention to understand and possibly to explain the behaviour of deep learning. Deep neural networks represent today the state-of-the-art in many applications that have been shown to be solved by data-driven approaches. However, they are also well known for their complexity, which hinders the interpretation of their functioning. To address this issue, researchers have lately focused either on understanding the optimization algorithms or on extracting information from a trained model; in this context we propose the Activation Pattern Diagram (APD) as a new tool to analyse neural networks by mainly focusing on the input data. The APD is a graphical representation of how a dataset is learned by a neural network with piecewise linear activation functions, such as the ReLU activation. By analysing the evolution of the diagram during the training procedure, the APD sheds light on the learning process and how data influences it. Additionally, we introduce a way to plot the APD to help the visualization and interpretation of the diagram.
No
paper
Scientifica
Activation Patterns; Neural Networks; Piecewise Linear Functions; Visualization
English
2020 Italian Workshop on Explainable Artificial Intelligence, XAI.it 2020
Craighero, F., Angaroni, F., Graudenzi, A., Stella, F., & Antoniotti, M. (2020). Understanding deep learning with activation pattern diagrams. In CEUR Workshop Proceedings (pp.119-126). CEUR-WS.
Craighero, F; Angaroni, F; Graudenzi, A; Stella, F; Antoniotti, M
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: http://hdl.handle.net/10281/302793
Citazioni
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
Social impact