About the EHealth Program

Good policy and improved health start with quality data. The Enhancing Health Data (EHealth) Program uses an advanced data integration process to link disparate heath, demographic, and economic data. This process produces nationwide resources that decision makers can use to improve healthcare outcomes and healthcare delivery on a large scale.

We structure our efforts around three main areas of work that collectively help to answer some of our nation’s most pressing healthcare questions:

  1. Repurpose existing data to produce cost-effective statistics, minimize burden on patients and clinicians, and improve data quality for new applications through record linkage.
  2. Leverage partnerships to produce collaborative work with associations, government agencies, and academic institutions, address real-world data challenges through strategic collaborations, and integrate partner electronic health records to produce novel insights.
  3. Modernize federal statistics to overcome survey and traditional analytical limitations by applying data science methods, pilot innovative solutions for maintaining privacy of integrated data, and deliver broad capabilities reusable beyond health applications.
How do we ensure sensitive data are protected?

The EHealth Program works with anonymized health data in a protected environment accessed only by approved users to produce innovative statistics and data products. The creation of blended electronic health records (EHR) and Census Bureau data expands the type and quality of data products and discoveries that can be created. This allows the public and policymakers to improve decision making by implementing policies that directly impact individuals’ access to care, quality of care, and health outcomes.

The secure disclosure of data derived from EHRs to the Census Bureau satisfies the data owner’s obligation to protect these data as required by the Health Insurance Portability and Accountability Act (HIPAA). Additionally, Title 13 of the U.S. Code authorizes the Census Bureau to collect information from other entities and requires the Census Bureau to keep the information confidential and use it only for statistical purposes. All analyses are conducted within a secure computing environment and data access is limited to the number of individuals necessary to achieve the objectives. All users must successfully complete annual data handling training and are sworn for life to protect the confidentiality of the data. Any and all aggregate statistics using linked EHR and Census Bureau microdata are produced employing robust disclosure avoidance methods in full compliance with the Census Bureau’s confidentiality requirements as required by law (Title 13 U.S.C., Section 9). Finally, all data products are reviewed and approved by the Census Bureau’s Disclosure Review Board (DRB) prior to public release. 

Page Last Revised - May 28, 2025