U.S. flag

An official website of the United States government

Skip Header


Cleaning and Using Administrative Lists: Enhanced Practices and Computational Algorithms for Record Linkage and Modeling/Editing/Imputation

Written by:
RRS2018-05

Abstract

The national statistical institutes (NSIs), including the U.S. Census Bureau, developed the original methods for processing survey data. The NSIs developed systematic, efficient generalized methods/software based on the Fellegi-Holt model of statistical data editing (1976) and the Fellegi-Sunter model of record linkage (1969). The generalized software is suitable for processing files for businesses, survey institutes, and administrative organizations. Early systems at five NSIs yielded high quality results but were often much too slow for even a hundred thousand records. New computational algorithms yield drastic hundred-plus fold speed increases over algorithms used outside the NSIs and previously within the NSIs. The software is suitable for processing hundreds of millions or billions of records in a few days

Related Information


Page Last Revised - October 28, 2021
Is this page helpful?
Thumbs Up Image Yes Thumbs Down Image No
NO THANKS
255 characters maximum 255 characters maximum reached
Thank you for your feedback.
Comments or suggestions?

Top

Back to Header