Abstract
Mixture regression models (MRMs) are widely used to capture the heterogeneity of relationships between the response variable and one or more predictors coming from several non-homogeneous groups. Since the conventional MRMs are quite sensitive to departures from normality caused by extra skewness and possible heavy tails, various extensions built on more flexible distributions have been put forward in the last decade. The class of normal mean-variance mixture (NMVM) distributions that arise from scaling both the mean and variance of a normal random variable with a common mixing distribution encompasses many prominent (symmetric or asymmetrical) distributions as special cases. A unified approach to robustifying MRMs is proposed by considering the class of NMVM distributions for component errors. An expectation conditional maximization either (ECME) algorithm, which incorporates membership indicators and the latent scaling variables as the missing data, is developed for carrying out maximum likelihood (ML) estimation of model parameters. Four simulation studies are conducted to examine the finite-sample property of ML estimators and the robustness of the proposed model against outliers for contaminated and noisy data. The usefulness and superiority of our methodology are demonstrated through applications to two real datasets.
Original language | English |
---|---|
Article number | 107661 |
Number of pages | 19 |
Journal | Computational Statistics and Data Analysis |
Volume | 180 |
Early online date | 21 Nov 2022 |
DOIs | |
Publication status | Published - 1 Apr 2023 |
Externally published | Yes |
Keywords
- EM-type algorithm
- Mixture regression model
- Normal mean-variance mixture
- Outliers
- Robustness
- Skewness