Matrix factorization exploits the idea that, in complex high-dimensional data, the actual signal typically lies in lower-dimensional structures. These lower dimensional objects provide useful insights, with interpretation favored by sparse structures. Sparsity, in addition, is beneficial in terms of regularization and, thus, to avoid over-fitting. By exploiting Bayesian shrinkage priors, we devise a computationally convenient approach for high-dimensional matrix factorization. The dependence between row and column entities is modeled by inducing flexible sparse patterns within factors. The availability of external information is accounted for in such a way that structures are allowed while not imposed. Inspired by boosting algorithms, we pair the proposed approach with a numerical strategy relying on a sequential inclusion and estimation of low-rank contributions, with a data-driven stopping rule. Practical advantages of the proposed approach are demonstrated by means of a simulation study and the analysis of soccer heatmaps obtained from new generation tracking data. Supplementary materials for this article are available online.

Schiavon, L., Nipoti, B., Canale, A. (2024). Accelerated Structured Matrix Factorization. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 1-11 [10.1080/10618600.2023.2301072].

Accelerated Structured Matrix Factorization

Nipoti B.;
2024

Abstract

Matrix factorization exploits the idea that, in complex high-dimensional data, the actual signal typically lies in lower-dimensional structures. These lower dimensional objects provide useful insights, with interpretation favored by sparse structures. Sparsity, in addition, is beneficial in terms of regularization and, thus, to avoid over-fitting. By exploiting Bayesian shrinkage priors, we devise a computationally convenient approach for high-dimensional matrix factorization. The dependence between row and column entities is modeled by inducing flexible sparse patterns within factors. The availability of external information is accounted for in such a way that structures are allowed while not imposed. Inspired by boosting algorithms, we pair the proposed approach with a numerical strategy relying on a sequential inclusion and estimation of low-rank contributions, with a data-driven stopping rule. Practical advantages of the proposed approach are demonstrated by means of a simulation study and the analysis of soccer heatmaps obtained from new generation tracking data. Supplementary materials for this article are available online.
Articolo in rivista - Articolo scientifico
Dimensionality reduction; Heatmap; Infinite factor models; Shrinkage priors; Side information; Soccer tracking data;
English
7-feb-2024
2024
1
11
none
Schiavon, L., Nipoti, B., Canale, A. (2024). Accelerated Structured Matrix Factorization. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 1-11 [10.1080/10618600.2023.2301072].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/477692
Citazioni
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
Social impact