Abstract
Non‐random sampling is a source of bias in empirical research. It is common for the outcomes of interest (e.g. wage distribution) to be skewed in the source population. Sometimes, the outcomes are further subjected to sample selection, which is a type of missing data, resulting in partial observability. Thus, methods based on complete cases for skew data are inadequate for the analysis of such data and a general sample selection model is required. Heckman proposed a full maximum likelihood estimation method under the normality assumption for sample selection problems, and parametric and non‐parametric extensions have been proposed. We generalize Heckman selection model to allow for underlying skew‐normal distributions. Finite‐sample performance of the maximum likelihood estimator of the model is studied via simulation. Applications illustrate the strength of the model in capturing spurious skewness in bounded scores, and in modelling data where logarithm transformation could not mitigate the effect of inherent skewness in the outcome variable.
Original language | English |
---|---|
Pages (from-to) | 172-190 |
Number of pages | 19 |
Journal | Scandinavian Journal of Statistics |
Volume | 43 |
Issue number | 1 |
Early online date | 30 Jul 2015 |
DOIs | |
Publication status | Published - 16 Feb 2016 |
Keywords
- generalized skew-normal distribution
- missing data
- non-random sample
- sample selection