Data correlation is an old great problem in multivariate analysis. In this paper a new correlation index, called K, is proposed to evaluate the correlation content into the data. Their mathematical properties are simple and their behavior is tested on some theoretical cases and compared with other correlation indices on 31 real data sets. From the proposed K correlation index, two functions are derived with the aim to estimate the significant number of principal components to retain in Principal Component Analysis. An extensive comparison with several other methods is also performed on real data sets. The obtained results show that the two functions give a number of significant principal components which can be interpreted as the maximum theoretical number and the safest number, respectively

Todeschini, R. (1997). Data correlation, number of significant principal components and shape of molecules. The K correlation index. ANALYTICA CHIMICA ACTA, 348(1-3), 419-430 [10.1016/S0003-2670(97)00290-0].

Data correlation, number of significant principal components and shape of molecules. The K correlation index

Todeschini, R.
1997

Abstract

Data correlation is an old great problem in multivariate analysis. In this paper a new correlation index, called K, is proposed to evaluate the correlation content into the data. Their mathematical properties are simple and their behavior is tested on some theoretical cases and compared with other correlation indices on 31 real data sets. From the proposed K correlation index, two functions are derived with the aim to estimate the significant number of principal components to retain in Principal Component Analysis. An extensive comparison with several other methods is also performed on real data sets. The obtained results show that the two functions give a number of significant principal components which can be interpreted as the maximum theoretical number and the safest number, respectively
Articolo in rivista - Articolo scientifico
multivariate correlation, molecular shape, principal components
English
1997
348
1-3
419
430
none
Todeschini, R. (1997). Data correlation, number of significant principal components and shape of molecules. The K correlation index. ANALYTICA CHIMICA ACTA, 348(1-3), 419-430 [10.1016/S0003-2670(97)00290-0].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/10202
Citazioni
  • Scopus 59
  • ???jsp.display-item.citation.isi??? 53
Social impact