Missing data assumptions and methods in a smoking cessation study
A sizable percentage of subjects do not respond to follow-up attempts in smoking cessation studies. The usual procedure in the smoking cessation literature is to assume that non-respondents have resumed smoking. This study used data from a study with a high follow-up rate to assess the degree of bias that may be caused by different methods of imputing missing data. Design and methods
Based on a large data set with very little missing follow-up information at 12 months, a simulation study was undertaken to compare and contrast missing data imputation methods (assuming smoking, propensity score matching and optimal matching) under various assumptions as to how the missing data arose (randomly generated missing values, increased non-response from smokers and a hybrid of the two). Findings
Missing data imputation methods all resulted in some degree of bias which increased with the amount of missing data. Conclusion
None of the missing data imputation methods currently available can compensate for bias when there are substantial amounts of missing data.
Document Type: Research Article
Affiliations: 1: Iowa State University, Department of Statistics and Center for Survey Statistics and Methodology, Ames, IA, USA and 2: Mayo Clinic College of Medicine, Division of Biostatistics, Rochester, MN, USA
Publication date: 01 March 2010