U.S. flag

An official website of the United States government

Skip Header


2020 Census Processing Updates

Written by:

In keeping with the U.S. Census Bureau’s long-established commitment to being entirely transparent in the production of our statistics and data products, I’m writing to provide an update on data processing for the 2020 Census. In every decennial census, we are the first to identify and analyze the quality of our data, including the extent to which we overcount or undercount key population groups in our country. We cannot do this in detail until we complete the Post-Enumeration Survey later this year, however we know a lot already about the accuracy and completeness of our population counts in the 2020 Census. I blogged with some initial impressions in early November, and we’ve made a lot of progress since then. But as reported in the media, some issues have surfaced as well. Most of these issues are typical and are similar to those we’ve encountered in prior decennial censuses. Others are novel to planned improvements for the 2020 Census, and some are related to the difficulties experienced collecting data during the COVID-19 pandemic.

The main consequence of these issues regards the schedule. The Census Bureau takes its constitutional and statutory duties very seriously. Even with the pandemic delaying data collection, we hoped to deliver the state population counts for apportionment by the statutory deadline of Dec. 31, 2020. Even with data collection not ending until mid-October, this was technically possible as long as we didn’t encounter any significant processing issues. However, we were also realistic knowing that all prior decennial censuses encountered such issues. We devoted additional resources, including staff working weekends and holidays, to meet the deadline. Even with these additional resources, we knew that we would need to stay on what we referred to as the “happy path,” where each stage of processing would be completed with no issues in order to process the data in two and a half months rather than the five months that our final Operational Plan called for. The path we actually experienced was much more like those we’ve historically experienced in prior censuses than the “happy path” we had hoped would allow us to deliver the apportionment data on time. The result is that our current schedule points to April 30, 2021, for the completion of the apportionment counts.

The desire to meet the statutory deadline meant that there was unprecedented attention by Census Bureau and U.S. Department of Commerce leadership and by outside observers on the data processing schedule. It is important to know that while we had the goal of finishing by the statutory deadline, or as close to it as possible, the Census Bureau’s most important objective — the objective that has driven our entire approach to the 2020 Census — is to deliver a complete and accurate census. That is, to count every person residing in the country once, only once, and in the right place. To achieve this objective, ALL processing issues we find are carefully researched, a fix is developed and tested, and then implemented. Because this can be a time-consuming process, the “happy path” that would have met the statutory deadline was not achievable.

The issues we’ve uncovered are varied in their underlying cause, their magnitude, and the complexity of their remedies. Coming soon will be more detailed information about these issues and how we’re addressing them from experts far more qualified to comment on them than I. But I want to talk briefly about some broad classes of issues we’re seeing, some more concerning that others.

First, there are what one might call “standard” problems that arise in processing any large survey. These might include what we’ve been referring to as processing anomalies such as basic errors in processing code, mismatches between code and business rules, errors in data handoffs between systems, and misalignment of processing business rules between phases. These make up the majority of the issues we’ve encountered in processing the 2020 Census. It is common to encounter issues like these when one runs the entire country through post-collection processing and these issues are relatively straightforward to address. Thus, we’re not concerned about them impacting the quality of the final data. And while we try to apply lessons learned from prior censuses and surveys to minimize the prevalence and impact of these types of issues, design changes to the 2020 Census intended to make it easier for everyone to respond can create new and unanticipated complexities for data processing as the number of response modes (e.g., internet, phone, and paper self-response; administrative records; and visits by enumerators) has increased relative to 2010.

The other class of issues arise from the nature of the responses to the 2020 Census. Again, we’ve experienced most of these before, but some have been exacerbated by design changes for 2020 and, most importantly, by the pandemic. These include whether households respond (response rates), the completeness of their responses (I mentioned the issue of item nonresponse in my November blog), and the ability of our enumerators to contact nonresponding households (e.g., in areas impacted by natural disasters) and the cooperation of those households.   

Enumerating Group Quarters (GQs) facilities is a challenge in every decennial census, but we are seeing additional complications brought on by the COVID-19 pandemic. GQs are facilities such as college dormitories, prisons and nursing homes. We delayed this and other field operations due to the pandemic. This delay, and the fact that some facilities emptied in the spring due to the pandemic, has caused issues with our GQs enumeration. Even though these issues affect a relatively small part of the total count, they can have a big impact on the count for the communities in which they’re located. As a result, we re-contacted thousands of facilities and have brought in new data sources such as the Integrated Postsecondary Education Data System (for college dormitories) to resolve these issues.

Another issue we experience with every decennial census is duplicate responses. The addition of the internet response option and the ability to respond without a Census ID# have increased duplicate responses, as we expected. Our data processing in the 2010 Census was able to handle this challenge, and we are again well-equipped to handle duplicates in the 2020 Census.

Counting everyone once, only once, and in the right place is a daunting challenge even in the best of circumstances, and the circumstances presented for the 2020 Census were not the best. At the end of the day, the key question is, did those circumstances impact the fitness of the data we will release based on the 2020 Census? Knowing that the COVID-19 pandemic might pose data quality issues for the 2020 Census, I chartered the 2020 Data Quality Executive Guidance Group last April to ensure that we had the right focus and resources dedicated to detecting and addressing data quality issues. Since then, I and other senior leaders have met regularly with various teams working the 2020 Census to discuss a range of quality-related issues. During data collection, these discussions centered on how game-time decisions to cope with the pandemic, hurricanes, wildfires and civil unrest might affect data quality and what we could do to mitigate any impacts to data quality. During post-collection processing, we’ve reviewed processing anomalies, discussed remedies, and reviewed early quality indicators. Most importantly, we’ve ensured that our dedicated staff have the time and resources to do the job right. To increase the transparency of our efforts, we will be releasing additional blogs from Census Bureau experts that dive more deeply into the issues discussed above. Also, we are working with a team of experts from the American Statistical Association on quality indicators (I mentioned our intention to engage them in my November blog) and members of  JASON who, since the beginning of data collections, continue to review our processes, procedures and key decisions. JASON is an independent group of technical experts that advise the federal government on sensitive matters in science and technology. These efforts will give the public an unprecedented behind-the-scenes look at the 2020 Census and should provide additional confidence in assessing the fitness of the 2020 Census data.

As we complete each major stage of processing and release 2020 Census data products to the public, we will be releasing quality indicators appropriate for each release. Later this month, we will begin processing the Census Unedited File (CUF) from which the apportionment counts are tabulated. For the release of the apportionment data by the end of April, we plan to release state-level quality indicators based on the CUF. Once the CUF is complete, we will begin processing the Census Edited File from which the Public Law 94-171 redistricting data are tabulated. We hope to have an update on this schedule soon. Here again, we are developing appropriate quality indicators to accompany this release. Please continue to watch for more updates from my colleagues in the coming weeks.

Related blogs


Random Samplings Blog
Updates to OMB’s Race/Ethnicity Standards
OMB published the results of its review of SPD 15 and issued updated standards for collecting and reporting race and ethnicity data across federal agencies.


Random Samplings Blog
Upcoming 2020 Census Coverage Estimates
The U.S. Census Bureau released coverage estimates for the 2020 Census.


Random Samplings Blog
The Post-Enumeration Survey: Measuring Coverage Error
Although we undertake extensive efforts to accurately count everyone in the decennial census, sometimes people are missed or duplicated.


Random Samplings Blog
Using Demographic Benchmarks to Help Evaluate 2020 Census Results
One of the primary methods of evaluating the quality of a census is comparing the results to other population benchmarks.


Random Samplings Blog
Programa de Evaluaciones y Experimentos del Censo del 2020
Este blog describe la serie de evaluaciones formales que miden diferentes aspectos de las operaciones del censo y los desafíos.


Random Samplings Blog
2020 Census Program for Evaluations, Experiments, and Assessments
This blog describes the series of formal evaluations and assessments that measure different aspects of census operations and specific challenges.


Random Samplings Blog
Improvements to the 2020 Census Race and Hispanic Origin Question Designs, Data Processing, and Coding Procedures
This blog discusses how we improved the census questions on race and Hispanic origin, also known as ethnicity, between 2010 and 2020.


Random Samplings Blog
Improvements to the 2020 Census Race and Hispanic Origin Question Designs, Data Processing, and Coding Procedures
This blog discusses how we improved the census questions on race and Hispanic origin, also known as ethnicity, between 2010 and 2020.


Random Samplings Blog
How We Complete the Census When Demographic and Housing Characteristics Are Missing
Although we strive to obtain all demographic and housing data from every individual in the census, missing data are part of every census process.


Random Samplings Blog
Censo del 2020: Métricas de calidad, Publicación 2
Este blog proporciona datos destacados del segundo grupo de métricas operacionales de calidad del Censo del 2020.


Random Samplings Blog
2020 Census Operational Quality Metrics: Release 2
Today we released the second round of 2020 Census operational quality metrics.


Random Samplings Blog
Examining Operational Quality Metrics
The Census Bureau is taking a multifaceted approach to studying the quality of the 2020 Census, so as to produce a more complete and informative picture.


Random Samplings Blog
Comparisons to Benchmarks as a Measure of Quality
Data quality is multidimensional and so approaching it from multiple angles produces a more insightful and holistic picture of a dataset.


Random Samplings Blog
2020 Census Data Review
For the 2020 Census, we are conducting one of the most comprehensive reviews in recent census history.


Random Samplings Blog
Revisión de los datos del Censo del 2020
En este blog hablamos sobre cómo estamos realizando una de las revisiones de datos más completas en la historia reciente del censo, para el Censo del 2020.


Random Samplings Blog
Completing the Census When Households or Group Quarters Don't Respond
As we continue to process 2020 Census responses, people have asked what happens when we don’t get a response from an address.


Random Samplings Blog
Cómo completamos el censo cuando los hogares no responden
Mientras continuamos procesando las respuestas al Censo del 2020, las personas han preguntado qué sucede cuando no obtenemos una respuesta de una dirección.


Random Samplings Blog
Administrative Records and the 2020 Census
Each decade we are asked, “Why don’t you just use the information the government already has about me for the census? Why ask me again?”


Random Samplings Blog
Los registros administrativos y el Censo del 2020
Este blog describe cómo el Censo del 2020 usó los registros administrativos para contar a las personas que no respondieron.


Random Samplings Blog
Introduction to Quality Indicators: Operational Metrics
In the coming weeks, the U.S. Census Bureau will release the first set of results from the 2020 Census. Our goal for every census is to count everyone once, only once, and in the right place.


Random Samplings Blog
2020 Census Group Quarters
As we continue processing 2020 Census results, we’d like to provide more information on how we count people living in group quarters (GQs).


Random Samplings Blog
Encontrar ‘anomalías’ demuestra que los controles de calidad funcionan
El 9 de marzo de 2021, la Oficina del Censo de los EE. UU. publicó un blog (en inglés) sobre las “anomalías” que encontramos al procesar los datos del Censo del 2020.


Random Samplings Blog
Finding 'Anomalies' Illustrates 2020 Census Quality Checks Are Working
We’re in the midst of data processing for the 2020 Census. As Acting Census Bureau Director Ron Jarmin acknowledged in a recent blog, we’ve discovered some “anomalies” along the way that we’re looking into and resolving.


Random Samplings Blog
Adapting Field Operations to Meet Unprecedented Challenges
As we process census responses and analyze the quality of the 2020 Census, it’s helpful to look back at some of the unprecedented challenges we faced during this census.


Random Samplings Blog
Adaptación de las operaciones de campo para enfrentar desafíos
La oficina del Censo de los EE. UU. compartió información en una publicación de blog el 1 de marzo de 2021, acerca de cómo la realización de un censo es una tarea enorme, incluso en circunstancias ideales.


Random Samplings Blog
Ensuring a Robust and Accurate Data Quality Analysis in the 2020 Census
Asking outside experts to review our work is standard operating procedure at the U.S. Census Bureau. It underscores our commitment to quality and transparency.


Random Samplings Blog
Timeline for Releasing Redistricting Data
We expect to deliver the redistricting data to the states and the public by Sept. 30, 2021.


Random Samplings Blog
Census Data Processing 101
Michael Thieme describes how census data processing works to ensure the census is accurate.


Director's Blog
2020 Census Processing Updates
I’m writing to provide an update on data processing for the 2020 Census.


Random Samplings Blog
Update on 2020 Census Data Processing and Quality
The Census Bureau has begun processing the data collected for the 2020 Census. Data collection for the decennial census is always a herculean task and 2020 was no exception.

Related Information


Page Last Revised - October 8, 2021
Is this page helpful?
Thumbs Up Image Yes Thumbs Down Image No
NO THANKS
255 characters maximum 255 characters maximum reached
Thank you for your feedback.
Comments or suggestions?

Top

Back to Header