Beat the heap: An imputation strategy for valid inferences from rounded income data
Abstract
"Questions on income in surveys are prone to two sources of errors that can cause bias if not addressed adequately at the analysis stage. On the one hand, income is considered sensitive information, and response rates on income questions generally tend to be lower than response rates for other nonsensitive questions. On the other hand, respondents usually do not remember their exact income and thus tend to provide a rounded estimate. The negative effects of item nonresponse are well studied, and most statistical agencies have developed sophisticated imputation methods to correct for this potential source of bias. However, to our knowledge, the effects of rounding are hardly ever considered in practice, despite the fact that several studies have found strong evidence that most of the respondents round their reported income values. In this article, we illustrate the substantial impact that rounding can have on important measures derived from the income variable, such as the poverty rate. To obtain unbiased estimates, we propose a two-stage imputation strategy that estimates the posterior probability for rounding given the observed income values at the first stage and reimputes the observed income values given the rounding probabilities at the second stage. A simulation study shows that the proposed imputation model can help overcome the possible negative effects of rounding. We also present results based on the household income variable from the German panel study Labour Market and Social Security." (Author's abstract, IAB-Doku) ((en))
Cite article
Drechsler, J. & Kiesl, H. (2016): Beat the heap: An imputation strategy for valid inferences from rounded income data. In: Journal of Survey Statistics and Methodology, Vol. 4, No. 1, p. 22-42. DOI:10.1093/jssam/smv032
Further information
earlier released (possibly different) as: IAB-Discussion Paper , 02/2014