A comparison of incomplete-data methods for categorical data

Authors	D.W. van der Palm L.A. van der Ark J.K. Vermunt
Publication date	2016
Journal	Statistical Methods in Medical Research
Volume \| Issue number	25 \| 2
Pages (from-to)	754-774
Organisations	Faculty of Social and Behavioural Sciences (FMG) Faculty of Social and Behavioural Sciences (FMG) - Research Institute of Child Development and Education (RICDE)
Abstract	We studied four methods for handling incomplete categorical data in statistical modeling: (1) maximum likelihood estimation of the statistical model with incomplete data, (2) multiple imputation using a loglinear model, (3) multiple imputation using a latent class model, (4) and multivariate imputation by chained equations. Each method has advantages and disadvantages, and it is unknown which method should be recommended to practitioners. We reviewed the merits of each method and investigated their effect on the bias and stability of parameter estimates and bias of the standard errors. We found that multiple imputation using a latent class model with many latent classes was the most promising method for handling incomplete categorical data, especially when the number of variables used in the imputation model is large.
Document type	Article
Language	English
Published at	https://doi.org/10.1177/0962280212465502
Downloads	A comparison of incomplete-data methods for categorical data (Final published version)
Permalink to this page

Back

UvA-DARE