Estimation of a linear regression under microaggregation with the response variable as a sorting variable
Microaggregation is a popular statistical disclosure control technique for continuous data. The basic principle of microaggregation is to group the observations in a data set and to replace them by their corresponding group means. However, while reducing the disclosure risk of data files, the technique also affects the results of statistical analyses. The paper deals with the impact of microaggregation on a multiple linear regression in continuous variables. We show that parameter estimates are biased if the dependent variable is used to form the groups. Using this result, we develop a consistent estimator that removes the aggregation bias, and derive its asymptotic covariance matrix.
No Supplementary Data
No Article Media