Similarity has always been a key aspect in computer science and statistics. Any time two element vectors are compared, many different similarity approaches can be used, depending on the final goal of the comparison (Euclidean distance, Pearson correlation coefficient, Spearman’s rank correlation coefficient, and others). But if the comparison has to be applied to more complex data samples, with features having different dimensionality and types which might need compression before processing, these measures would be unsuitable. In these cases, a siamese neural network may be the best choice: it consists of two identical artificial neural networks each capable of learning the hidden representation of an input vector. The two neural networks are both feedforward perceptrons, and employ error back-propagation during training; they work parallelly in tandem and compare their outputs at the end, usually through a cosine distance. The output generated by a siamese neural network execution can be considered the semantic similarity between the projected representation of the two input vectors. In this overview we first describe the siamese neural network architecture, and then we outline its main applications in a number of computational fields since its appearance in 1994. Additionally, we list the programming languages, software packages, tutorials, and guides that can be practically used by readers to implement this powerful machine learning model.

Chicco, D. (2021). Siamese Neural Networks: An Overview. In H. Cartwright (a cura di), Artificial Neural Networks (pp. 73-94). Humana Press Inc. [10.1007/978-1-0716-0826-5_3].

Siamese Neural Networks: An Overview

Chicco, D
Primo
2021

Abstract

Similarity has always been a key aspect in computer science and statistics. Any time two element vectors are compared, many different similarity approaches can be used, depending on the final goal of the comparison (Euclidean distance, Pearson correlation coefficient, Spearman’s rank correlation coefficient, and others). But if the comparison has to be applied to more complex data samples, with features having different dimensionality and types which might need compression before processing, these measures would be unsuitable. In these cases, a siamese neural network may be the best choice: it consists of two identical artificial neural networks each capable of learning the hidden representation of an input vector. The two neural networks are both feedforward perceptrons, and employ error back-propagation during training; they work parallelly in tandem and compare their outputs at the end, usually through a cosine distance. The output generated by a siamese neural network execution can be considered the semantic similarity between the projected representation of the two input vectors. In this overview we first describe the siamese neural network architecture, and then we outline its main applications in a number of computational fields since its appearance in 1994. Additionally, we list the programming languages, software packages, tutorials, and guides that can be practically used by readers to implement this powerful machine learning model.
Capitolo o saggio
Artificial neural networks; Deep learning; Neural networks; Overview; Review; Semantic similarity; Siamese networks; Siamese neural networks; Survey;
English
Artificial Neural Networks
Cartwright, H
18-ago-2020
2021
9781071608258
2190
Humana Press Inc.
73
94
Chicco, D. (2021). Siamese Neural Networks: An Overview. In H. Cartwright (a cura di), Artificial Neural Networks (pp. 73-94). Humana Press Inc. [10.1007/978-1-0716-0826-5_3].
reserved
File in questo prodotto:
File Dimensione Formato  
Chicco-2021-Artificial Neural Networks-VoR.pdf

Solo gestori archivio

Descrizione: Contributo in libro
Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Licenza: Tutti i diritti riservati
Dimensione 541.14 kB
Formato Adobe PDF
541.14 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/433779
Citazioni
  • Scopus 326
  • ???jsp.display-item.citation.isi??? 253
Social impact