We construct confidence regions in high dimensions by inverting the globaltest statistics, and use them to choose the tuning parameter for penalized regression. The selected model corresponds to the point in the confidence region of the parameters that minimizes the penalty, making it the least complex model that still has acceptable fit according to the test that defines the confidence region. As the globaltest is particularly powerful in the presence of many weak predictors, it connects well to ridge regression, and we thus focus on ridge penalties in this paper. The confidence region method is quick to calculate, intuitive, and gives decent predictive potential. As a tuning parameter selection method it may even outperform classical methods such as cross-validation in terms of mean squared error of prediction, especially when the signal is weak. We illustrate the method for linear models in simulation study and for Cox models in real gene expression data of breast cancer samples.

Xu, N., Solari, A., Goeman, J. (2021). Globaltest confidence regions and their application to ridge regression. BIOMETRICAL JOURNAL, 63(7 (October 2021)), 1351-1365 [10.1002/bimj.202000063].

Globaltest confidence regions and their application to ridge regression

Solari A.;
2021

Abstract

We construct confidence regions in high dimensions by inverting the globaltest statistics, and use them to choose the tuning parameter for penalized regression. The selected model corresponds to the point in the confidence region of the parameters that minimizes the penalty, making it the least complex model that still has acceptable fit according to the test that defines the confidence region. As the globaltest is particularly powerful in the presence of many weak predictors, it connects well to ridge regression, and we thus focus on ridge penalties in this paper. The confidence region method is quick to calculate, intuitive, and gives decent predictive potential. As a tuning parameter selection method it may even outperform classical methods such as cross-validation in terms of mean squared error of prediction, especially when the signal is weak. We illustrate the method for linear models in simulation study and for Cox models in real gene expression data of breast cancer samples.
Articolo in rivista - Articolo scientifico
confidence regions; high dimensional; tuning parameter selection;
English
27-mag-2021
2021
63
7 (October 2021)
1351
1365
none
Xu, N., Solari, A., Goeman, J. (2021). Globaltest confidence regions and their application to ridge regression. BIOMETRICAL JOURNAL, 63(7 (October 2021)), 1351-1365 [10.1002/bimj.202000063].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/330357
Citazioni
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
Social impact