We consider the 2-Wasserstein space of probability measures supported on the unit-circle, and propose a framework for Principal Component Analysis (PCA) for data living in such a space. We build on a detailed investigation of the optimal transportation problem for measures on the unit-circle which might be of independent interest. In particular, building on previously obtained results, we derive an expression for optimal transport maps in (almost) closed form and propose an alternative definition of the tangent space at an absolutely continuous probability measure, together with fundamental characterizations of the associated exponential and logarithmic maps. PCA is performed by mapping data on the tangent space at the Wasserstein barycentre, which we approximate via an iterative scheme, and for which we establish a sufficient a posteriori condition to assess its convergence. Our methodology is illustrated on several simulated scenarios and a real data analysis of measurements of optical nerve thickness.

Beraha, M., Pegoraro, M. (2024). Wasserstein principal component analysis for circular measures. STATISTICS AND COMPUTING, 34(5), 1-17 [10.1007/s11222-024-10473-x].

Wasserstein principal component analysis for circular measures

Beraha M.;
2024

Abstract

We consider the 2-Wasserstein space of probability measures supported on the unit-circle, and propose a framework for Principal Component Analysis (PCA) for data living in such a space. We build on a detailed investigation of the optimal transportation problem for measures on the unit-circle which might be of independent interest. In particular, building on previously obtained results, we derive an expression for optimal transport maps in (almost) closed form and propose an alternative definition of the tangent space at an absolutely continuous probability measure, together with fundamental characterizations of the associated exponential and logarithmic maps. PCA is performed by mapping data on the tangent space at the Wasserstein barycentre, which we approximate via an iterative scheme, and for which we establish a sufficient a posteriori condition to assess its convergence. Our methodology is illustrated on several simulated scenarios and a real data analysis of measurements of optical nerve thickness.
Articolo in rivista - Articolo scientifico
Directional Data; Distributional data analysis; Optimal transport; PCA; Weak Riemannian structure;
English
24-ago-2024
2024
34
5
1
17
171
open
Beraha, M., Pegoraro, M. (2024). Wasserstein principal component analysis for circular measures. STATISTICS AND COMPUTING, 34(5), 1-17 [10.1007/s11222-024-10473-x].
File in questo prodotto:
File Dimensione Formato  
Beraha-2024-Statistics and Computing-VoR.pdf

accesso aperto

Descrizione: CC BY 4.0 This article is licensed under a Creative Commons Attribution 4.0 International License To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Licenza: Creative Commons
Dimensione 3.41 MB
Formato Adobe PDF
3.41 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/545392
Citazioni
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
Social impact