U.S. flag

An official website of the United States government

Skip Header


Recommendations Regarding the Use of the 2020 Post-Enumeration Survey Coverage Results in the Vintage 2023 Population Estimates

Written by:

Estimated reading time: 13 minutes

Each year, the U.S. Census Bureau’s population estimates are used to allocate federal funding, as controls to improve the data from demographic surveys, as denominators to calculate vital rates and other key statistical indicators, and for many other uses. Because the population estimates play such a critical role in the nation’s statistical system, we continuously strive to improve the methods we use to produce them.

Census Bureau experts have been examining whether to use Post-Enumeration Survey (PES) results – estimates of undercounts and overcounts in the census – to improve the annual population estimates. This is a topic the Census Bureau has also explored in prior decades.

Each decade, the Census Bureau conducts a census of the U.S. population. Typically, we then use the decennial census counts as the base for building the yearly estimates of the population by age, sex, race and Hispanic origin at the national, state and county levels. We also estimate annual population totals of cities and towns based on the results of the decennial census. Communities and government entities use this annual time series of population estimates to inform research, planning and funding over the next decade.

This decade, the COVID-19 pandemic delayed the 2020 Census results needed to produce the 2021 estimates, so we created a “blended base” to build the yearly estimates. The blended base integrates several sources of data, rather than relying strictly on the 2020 Census results. More information on this work is available in the America Counts story Post-Pandemic Challenges and Opportunities for Population Estimates.

The blended base innovations prompted the Census Bureau to create the Base Evaluation and Research Team (BERT), consisting of Census Bureau subject matter experts focused on exploring ways to further improve the estimates base.

One question the team has explored is whether we could use the 2020 PES results to adjust the April 1, 2020, base population for the estimates because the 2020 PES results showed that some populations had statistically significant undercounts and overcounts in the 2020 Census at the national level:

  • The Hispanic or Latino population had a 4.99% undercount.
  • The Black or African American population had a 3.30% undercount.
  • The American Indian or Alaska Native population living on reservations had a 5.64% undercount.
  • The White alone, non-Hispanic population had a 1.64% overcount.
  • The Asian population had a 2.62% overcount.

It’s worth noting that PES results by race and Hispanic origin are only available for the household population at the national level.

The PES was designed to estimate the accuracy of the census by independently surveying a sample of the population. It was not, however, designed to adjust the annual population estimates.

Therefore, after a year-and-a-half of researching the issue, the PES staff and coverage experts on BERT recommend that the Census Bureau should:

  • Continue to use the PES to help inform the strengths and limitations of the 2020 Census counts as well as products derived from the 2020 counts, such as the population estimates, but not use the 2020 PES results to adjust the population estimates base.
  • Start planning early for a coverage measurement program that can adjust the population estimates base if there are large coverage errors in the 2030 Census.

In the sections below, we expand on each of these recommendations.

The Census Bureau should not use the 2020 PES results to adjust the population estimates base.

After 18 months of exploring how to use the PES results and considering the various constraints, BERT determined that there are no viable strategies for using the estimates of coverage directly to adjust the estimates base.

While the PES is helpful in identifying coverage issues at the national level, it is not able to identify them as accurately at lower levels of geography because of its design. The PES sample size was simply too small.

The key challenge with incorporating the 2020 PES in the population estimates base is that the PES net coverage error results by age, sex, race and Hispanic origin are only available for the nation and not for states and counties. This is because of the limited sample size of the PES and the overall approach to producing estimates of net coverage error. It’s these estimates of net coverage error that would form the basis of an adjustment to the population estimates. And they would be needed at the county level. That’s because the annual population estimates provide estimates for detailed demographic groups down to the county level.

We’ve faced these challenges before. In prior decades, the Census Bureau has encountered similar constraints that prevented us from adjusting the population estimates based on the PES results. We describe some of the history in more detail with the next recommendation.

We’ve considered a synthetic approach. In geographic areas where we have little or no PES sample data, we could use “synthetic estimation” to produce net coverage error estimates for demographic subgroups. This means we could model the estimates using averages across areas with a similar demographic composition.

To do this, we would assume that the coverage rates for individuals in a certain state or county are the same as individuals with similar characteristics at the national level. Basically, the coverage rate for a state or county would be determined by the characteristics of the people in that area rather than a unique area-level effect.  

This approach inevitably is accompanied by a systematic error or bias. The difference between the true coverage rate and the synthetic coverage rate is called synthetic bias. Prior to 2020, the PES state-level estimates were synthetic estimates because the estimation models did not include any state-specific effects. This is discussed more in the blog The Post-Enumeration Survey: Measuring Coverage Error.

For 2020, we implemented an improvement. The PES reduced the synthetic bias of the state estimates by including state-specific effects in the estimation models. The following graph shows the impact of this methodological decision.

This graph shows the PES net coverage error estimates for the total state household population from 1990 to 2020.

The PES state estimates for 1990, 2000 and 2010 were all synthetic estimates. This means there were no state-specific effects in the models used to estimate net coverage error. In other words, the state net coverage error estimates were determined by:

  • The characteristics of the people in the states.
  • The national-level estimates of census coverage for these characteristics.

An impact of the synthetic estimation is that the state estimates were driven by national and regional trends. In 2020, the estimation models included state effects that reduced the synthetic bias but also increased the variability in the net coverage error estimates.

Let’s focus on Washington, D.C., represented in the graph by the dashed line. From 1990 to 2010, Washington, D.C., had a point estimate of negative net coverage error, though these estimated net undercounts were not necessarily statistically significant. This makes sense because of its large Black or African American and renter populations – two groups historically undercounted at the national level. The synthetic estimates in these years reflected those national-level coverage patterns.

These national coverage patterns persisted in 2020, but the models for 2020 included a state effect. Using these models, the net coverage error estimate for Washington, D.C., shows an overcount (which was not statistically significantly different from zero) in 2020. The state effect for Washington, D.C., clearly overpowered the national-level effects for race and homeownership. Perhaps the Black or African American population and renters interacted with the census differently here than in the rest of the United States.

While we believe this reduction in synthetic bias in the state estimates is an improvement in the methodology, the tradeoff is that some state estimates of net coverage error had large sampling errors. Thus, even at the state level, there is much uncertainty in the PES estimates. Using the PES results to adjust the population estimates base at the state level would infuse this uncertainty into the population estimates at a magnitude that could overtake the adjustment being made.

Furthermore, this change in methodology for the state estimates shows how different the synthetic estimates of census coverage and estimates that include state effects can be. Because the 2020 PES sample size was too small to support models with effects for substate geographies or subnational demographic groups, any estimates of census coverage for counties or subnational demographic groups would be subject to synthetic bias in addition to substantial sampling variability. This would make the resulting population estimates unacceptably imprecise.

Instead, we recommend continuing to use the PES to help inform the strengths and limitations of the 2020 Census counts. This also provides insight on the strengths and limitations of the population estimates to the extent that they are derived from the counts.

The Census Bureau should start planning early for a coverage measurement program that can adjust the population estimates base if there are large coverage errors in the 2030 Census.

The 2020 PES was never designed to produce results that could be used to adjust the official population estimates for coverage errors in the 2020 Census.

Since the 1990 Census, various technical limitations have led the Census Bureau to determine that the PES results – while informative at the national and state levels – were insufficiently robust for improving the population estimates base for lower levels of geography.

  • After the 1990 Census, the Committee to Adjust the Postcensal Estimates (CAPE), a senior level group of Census Bureau statisticians and demographers, decided that an adjustment using the PES would improve the accuracy of the population estimates at the national level. But it recommended against using the PES to adjust the population estimates base because it was unclear if it would also improve the accuracy of the state and county estimates. CAPE’s findings are discussed in a 1993 Federal Register notice (download the pdf on the left of the Federal Register page; the discussion begins on p. 69 of the printed document, which is p. 77 of the pdf). The 1990 PES had a sample size of approximately 170,000 addresses and included both housing units and noninstitutional group quarters.
  • The initial plan for the 2000 Census was to conduct a PES with a sample size of 750,000 housing units and use the results to adjust the census and the base for the population estimates for coverage errors. However, a U.S. Supreme Court decision in 1999 prevented the Census Bureau from using a sample statistic to adjust the census results for the purpose of apportionment. The plans for the 2000 PES were ultimately revised and the sample size was reduced to 300,000 housing units. The Supreme Court decision did not preclude the Census Bureau from adjusting the base for the intercensal population estimates. However, the Census Bureau decided to use the unadjusted census counts for the intercensal population estimates base, citing various technical concerns.
  • The 2010 PES was originally planned with a sample size of 300,000 housing units, but this was reduced in 2009 to 170,000 housing units as part of an effort to lower nonsampling errors through additional quality control operations. The 2010 PES produced net coverage error estimates for the 50 states and Washington, D.C., as well as counties and places with populations of a least 100,000 people. The estimates for states, counties and places were all synthetic estimates.
  • The 2020 PES kept the sample size at 170,000 housing units, but changed the methodology for state estimates and did not release county or place estimates. The 2020 PES faced many of the same COVID-19 pandemic-related challenges and delays as the 2020 Census. For example, the primary interview operation was delayed from June to September 2020. Also, the data collection period ran longer than expected and was even reopened to give people more time to respond. Despite these challenges, the 2020 PES provided measures of census coverage for demographic groups and census operations. However, the 2020 PES was not designed to adjust the population estimates, especially at lower levels of geography where the sample size was too small to support high-quality coverage estimates.

BERT, based on its findings from PES staff and coverage experts, recommends the Census Bureau start planning now for a coverage measurement program that could be used to adjust the population estimates base for potential coverage errors in the 2030 Census.

This could include changes to the sample size and overall design of the estimation methodology — if the PES program is planning to field a traditional post-enumeration survey — or including the population estimates base as a use-case if a different approach to coverage estimation is being considered.

Another possibility is for the Census Bureau to explore whether we can use administrative records to produce estimates of net coverage error for the 2030 Census.

We often get questions about whether a specific city or neighborhood was undercounted in the 2020 Census. We even get questions about undercounts for specific population groups in a city or county.

Again, the coverage programs for the 2020 Census were not able to produce reliable estimates at these lower levels of geography. In fact, if we wanted to produce accurate net coverage error estimates for every city or neighborhood using a post-enumeration survey, we would need to essentially conduct a second census, which is not practical.

The key is for the Census Bureau to start planning now so the coverage measurement program is ready in time for the 2030 Census, and so the early stages of research are underway.

Looking Ahead

BERT and PES staff have shared these recommendations with senior leaders at the Census Bureau who have approved the approach. We have also presented these recommendations to a panel of external experts and received positive and supportive feedback. Additionally, we’re sharing the recommendations with the public through this blog and a webinar.

While the Census Bureau is unable to directly use the 2020 PES results to adjust the population estimates, we will continue to use the 2020 PES research to identify populations that could benefit from new sources or techniques to improve the population estimates base. For example, BERT is also researching how using administrative records might inform future coverage adjustments to the population estimates base.

Meanwhile, the “blended base” approach is helping us improve the annual population estimates by accounting for coverage issues in the 2020 Census. Specifically, the blended base incorporates national data by age and sex from the 2020 Demographic Analysis (DA) estimates. We have confidence in these estimates at the national level particularly because they include the full resident population, whereas the PES only includes the household population.  

By including the age and sex results from the 2020 DA, the blended base approach increases the number of young children in the population estimates base relative to what was enumerated in the 2020 Census. (Young children age 0 to 4 had a larger undercount than any other age group.) In this way, the blended base approach is having a mitigating effect on the undercount of young children in the 2020 Census. Given that a large percentage of young children are Black or African American and Hispanic or Latino, this method also indirectly mitigates the undercounts for these populations.

The BERT research on how to improve the annual population estimates is ongoing, and the team will continue to keep the public informed of its progress.

This article was filed under:

   
Page Last Revised - December 18, 2023
Is this page helpful?
Thumbs Up Image Yes Thumbs Down Image No
NO THANKS
255 characters maximum 255 characters maximum reached
Thank you for your feedback.
Comments or suggestions?

Top

Back to Header