A robust estimator for a wide family of mixtures of linear regression is presented. Robustness is based on the joint adoption of the cluster weighted model and of an estimator based on trimming and restrictions. The selected model provides the conditional distribution of the response for each group, as in mixtures of regression, and further supplies local distributions for the explanatory variables. A novel version of the restrictions has been devised, under this model, for separately controlling the two sources of variability identified in it. This proposal avoids singularities in the log-likelihood, caused by approximate local collinearity in the explanatory variables or local exact fits in regressions, and reduces the occurrence of spurious local maximizers. In a natural way, due to the interaction between the model and the estimator, the procedure is able to resist the harmful influence of bad leverage points along the estimation of the mixture of regressions, which is still an open issue in the literature. The given methodology defines a well-posed statistical problem, whose estimator exists and is consistent to the corresponding solution of the population optimum, under widely general conditions. A feasible EM algorithm has also been provided to obtain the corresponding estimation. Many simulated examples and two real datasets have been chosen to show the ability of the procedure, on the one hand, to detect anomalous data, and, on the other hand, to identify the real cluster regressions without the influence of contamination.

García Escudero, L., Gordaliza, A., Greselin, F., Ingrassia, S., Mayo Iscar, A. (2017). Robust estimation of mixtures of regressions with random covariates, via trimming and constraints. STATISTICS AND COMPUTING, 27(2), 377-402 [10.1007/s11222-016-9628-3].

Robust estimation of mixtures of regressions with random covariates, via trimming and constraints

GRESELIN, FRANCESCA
;
2017

Abstract

A robust estimator for a wide family of mixtures of linear regression is presented. Robustness is based on the joint adoption of the cluster weighted model and of an estimator based on trimming and restrictions. The selected model provides the conditional distribution of the response for each group, as in mixtures of regression, and further supplies local distributions for the explanatory variables. A novel version of the restrictions has been devised, under this model, for separately controlling the two sources of variability identified in it. This proposal avoids singularities in the log-likelihood, caused by approximate local collinearity in the explanatory variables or local exact fits in regressions, and reduces the occurrence of spurious local maximizers. In a natural way, due to the interaction between the model and the estimator, the procedure is able to resist the harmful influence of bad leverage points along the estimation of the mixture of regressions, which is still an open issue in the literature. The given methodology defines a well-posed statistical problem, whose estimator exists and is consistent to the corresponding solution of the population optimum, under widely general conditions. A feasible EM algorithm has also been provided to obtain the corresponding estimation. Many simulated examples and two real datasets have been chosen to show the ability of the procedure, on the one hand, to detect anomalous data, and, on the other hand, to identify the real cluster regressions without the influence of contamination.
Articolo in rivista - Articolo scientifico
Cluster weighted modeling Mixture of regressions Robustness Trimming Constrained estimation
English
3-feb-2016
2017
27
2
377
402
reserved
García Escudero, L., Gordaliza, A., Greselin, F., Ingrassia, S., Mayo Iscar, A. (2017). Robust estimation of mixtures of regressions with random covariates, via trimming and constraints. STATISTICS AND COMPUTING, 27(2), 377-402 [10.1007/s11222-016-9628-3].
File in questo prodotto:
File Dimensione Formato  
Robust estimation of mixtures of regressions with random covariates, via trimming and constraints.pdf

Solo gestori archivio

Descrizione: pre-print on Arkiv
Tipologia di allegato: Submitted Version (Pre-print)
Dimensione 508.04 kB
Formato Adobe PDF
508.04 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
143525.pdf

Solo gestori archivio

Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Dimensione 1.34 MB
Formato Adobe PDF
1.34 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/107931
Citazioni
  • Scopus 16
  • ???jsp.display-item.citation.isi??? 18
Social impact