Re-identification Methods for Masked Microdata

Skip Navigation

Re-identification Methods for Masked Microdata

April 21, 2004

Written by:

William E. Winkler

RRS2004-03

Abstract

Download Re-identification Methods for Masked Microdata [PDF - <1.0 MB]

Statistical agencies often mask (or distort) microdata in public-use files so that the confidentiality of information associated with individual entities is preserved. The intent of many of the masking methods is to cause only minor distortions in some of the distributions of the data and possibly no distortion in a few aggregate or marginal statistics In record linkage (as in nearest neighbor methods), metrics are used to determine how close a value of a variable in a record is from the value of the corresponding variable in another record. If a sufficient number of variables in one record have values that are close to values in another record, then the records may be a match and correspond to the same entity. This paper shows that it is possible to create metrics for which re-identification is straightforward in many situations where masking is currently done. We begin by demonstrating how to quickly construct metrics for continuous variables that have been micro-aggregated one at a time using conventional methods. We extend the methods to situations where rank swapping is performed and discuss the situation where several continuous variables are micro-aggregated simultaneously. We close by indicating how metrics might be created for situations of synthetic microdata satisfying several sets of analytic constraints.

Others in Series

Working Paper

Improving EM Algorithm Estimates for Record Linkage Parameters

February 18, 2004

Improving EM Algorithm Estimates for Record Linkage Parameters

Working Paper

An Adaptive String Comparator for Record Linkage

February 19, 2004

An Adaptive String Comparator for Record Linkage

Working Paper

Tabular Statistical Disclosure Control: Optimization Techniques in ...

September 23, 2004

Tabular Statistical Disclosure Control: Optimization Techniques in Suppression and Controlled Tabular Adjustment

View All

Related Information

WORKING PAPER

Statistical Research Reports and Studies

Page Last Revised - October 28, 2021

Some content on this site is available in several different electronic formats. Some of the files may require a plug-in or additional software to view.

Is this page helpful?
Thumbs Up Image

Yes

NO THANKS

255 characters maximum

255 characters maximum reached

Thank you for your feedback.
Comments or suggestions?

Top