U.S. flag

An official website of the United States government

Skip Header


Additional Results from a Nationwide Matching of 2000 Census Data

Written by:
RRS2008-02

Abstract

A nationwide unduplication procedure is being considered for the 2010 Census. One potential problem is the possibility of finding large numbers of false positives, especially when matching above the county level. To help evaluate the extent of this problem, the matching and modeling procedures are being run on the data from the 2000 Census.

This report provides an overview of the results from Within Response Modeling, which evaluates households with multiple links, and of an analysis of the resulting Residual Person links. As expected, name frequency does not seem to have much effect for links accepted in Within Response Modeling, while most of the problem with apparent false matches in the Residual Person links seems to be concentrated in the most common surnames and the most common Hispanic surnames, especially for matches outside the state. In contrast, for given names there does not appear to be a strong effect of name frequency on false matches.

Related Information


Page Last Revised - October 28, 2021
Is this page helpful?
Thumbs Up Image Yes Thumbs Down Image No
NO THANKS
255 characters maximum 255 characters maximum reached
Thank you for your feedback.
Comments or suggestions?

Top

Back to Header