Simultaneous Edit-Imputation for Continuous Microdata

Alan Francis Karr; HJ Kim; LH Cox; Alan Francis Karr; JP Reiter; QL Wang

Simultaneous Edit-Imputation for Continuous Microdata

Kim, HJ., Cox, LH., Karr, A., Reiter, JP., & Wang, QL. (2015). Simultaneous Edit-Imputation for Continuous Microdata. Journal of the American Statistical Association, 110(511), 987-999. https://doi.org/10.1080/01621459.2015.1040881

Copy citation

Abstract

Many statistical organizations collect data that are expected to satisfy linear constraints; as examples, component variables should sum to total variables, and ratios of pairs of variables should be bounded by expert-specified constants. When reported data violate constraints, organizations identify and replace values potentially in error in a process known as edit-imputation. To date, most approaches separate the error localization and imputation steps, typically using optimization methods to identify the variables to change followed by hot deck imputation. We present an approach. that fully integrates editing and imputation for continuous microdata under linear constraints. Our approach relies on a Bayesian hierarchical model that includes (i) a flexible joint probability model for the underlying true values of the data with support only on the set of values that satisfy all editing constraints, (ii) a model for latent indicators of the variables that are in error, and (iii) a model for the reported responses for variables in error. We illustrate the potential advantages of the Bayesian editing approach over existing approaches using simulation studies. We apply the model to edit faulty data from the 2007 U.S. Census of Manufactures. Supplementary materials for this article are available online.

Recent Publications

Article

The early motor questionnaire facilitates the remote assessment of normative motor development in infancy and toddlerhood

January 01, 2025

Article

Adult vaccination coverage in the United States

December 31, 2024

Article

Outcomes of substance use and sexual power among adolescent girls and young women in Cape Town

December 31, 2024

Article

The impact of violations of expected utility theory on choices in the face of multiple risks

December 01, 2024

View All Publications