Skip to content

COVID-19 Resources

Access the latest information on COVID-19 for clinical researchers
  • Home
  • About
    • NIH Collaboratory
      • Coordinating Center
      • NIH Collaboratory Trials
      • Core Working Groups
      • Steering Committee
      • Distributed Research Network
      • Our Impact
    • Living Textbook
      • Table of Contents
      • How to Use This Site
  • Resources
    • Data and Resource Sharing
    • Training Resources
    • Tools for Researchers
    • Publications
    • Knowledge Repository
  • Webinar
  • Podcast
  • News
    • News Feed
    • Calendar
    • Subscribe
return to home
Subscribe to Newsletter go to twitter feed go to linkedin go to blue sky feed
Search
NIH Collaboratory
Living Textbook of
Pragmatic Clinical Trials

COVID-19 Resources

Access the latest information on COVID-19 for clinical researchers
home button

Rethinking Clinical Trials

A Living Textbook of Pragmatic Clinical Trials

  • Design
    • What is a Pragmatic Clinical Trial?
    • Decentralized Pragmatic Clinical Trials
    • Developing a Compelling Grant Application
    • Experimental Designs and Randomization Schemes
    • Endpoints and Outcomes
    • Analysis Plan
    • Using Electronic Health Record Data
    • Building Partnerships and Teams to Ensure a Successful Trial
    • Intervention Delivery and Complexity
    • Patient Engagement
  • Data, Tools & Conduct
    • Assessing Feasibility
    • Acquiring Real-World Data
    • Assessing Fitness-for-Use of Real-World Data
    • Study Startup
    • Participant Recruitment
    • Monitoring Intervention Fidelity and Adaptations
    • Patient-Reported Outcomes
    • Clinical Decision Support
    • Mobile Health
    • Electronic Health Records–Based Phenotyping
    • Navigating the Unknown
  • Dissemination & Implementation
    • Data Sharing and Embedded Research
    • Dissemination Approaches for Different Audiences
    • Implementation
    • End-of-Trial Decision-Making
  • Ethics & Regulatory
    • Privacy Considerations
    • Identifying Those Engaged in Research
    • Collateral Findings
    • Consent, Disclosure, and Non-Disclosure
    • Data and Safety Monitoring
    • Ethical Considerations of Data Sharing in Pragmatic Clinical Trials
    • Ethics for AI and ML
    • IRB Responsibilities and Procedures

Electronic Health Record Data Extraction

CHAPTER SECTIONS

Analysis Plan


Section 6

Electronic Health Record Data Extraction

Expand Contributors

Patrick J. Heagerty, PhD
Elizabeth R. DeLong, PhD
For the NIH Pragmatic Trials Collaboratory Biostatistics and Study Design Core

Contributing Editors
Damon M. Seils, MA
Jonathan McCall, MS

Many pragmatic clinical trials, whether designed as cluster randomized trials or as individually randomized trials, rely on data extraction from the participant’s electronic health record (EHR). Although study data extraction allows pragmatic trials to be performed quickly and at less expense than traditional clinical trials that establish redundant parallel data capture systems, they also introduce methodological and logistical challenges, such as those described in the white paper, "Assessing Data Quality for Healthcare Systems Data Used in Clinical Research."

EHR data extraction also poses challenges for statistical analysis. Data gathered from EHRs (which, by definition, are not purposely designed or optimized to support research activities) may have higher rates of missingness and error than data captured with purpose-built systems and subjected to “cleaning” and validation. Missing data, including that caused by the dropout of whole clusters, pose special issues for pragmatic trials. Preliminary data capture and assessment will provide a guide as to whether the intended study is feasible, given the availability and quality of the data.

Previous Section Next Section

SECTIONS

CHAPTER SECTIONS

sections

  1. Introduction
  2. Intraclass Correlation
  3. Unequal Cluster Sizes
  4. Accounting for Residual Confounding in the Analysis
  5. Missing Data and Intention-to-Treat Analyses
  6. Electronic Health Record Data Extraction
  7. Unanticipated Changes
  8. Interim Reassessment of Sample Size in Cluster Randomized Trials
  9. Case Study: STOP CRC Trial

Resources

Using Electronic Health Record Data
Living Textbook chapter describing considerations for the use of EHR data in pragmatic trials

What Are the Key Factors in Using EHR Data for Endpoints and Outcomes?
Two-minute training module from the NIH Pragmatic Trials Collaboratory’s video library

What Are the Challenges of Using Data Directly From the EHRs?
Two-minute training module from the NIH Pragmatic Trials Collaboratory’s video library

Key Issues in Extracting Usable Data from Electronic Health Records for Pragmatic Clinical Trials
Guidance document from the Biostatistics and Study Design Core


Version History

April 30, 2024: Made nonsubstantive changes to the text and added items to the Resources sidebar as part of the annual content update (changes made by D. Seils).

June 23, 2022: Updated the name of the NIH Collaboratory in the contributors list and made nonsubstantive changes as part of the annual content update (changes made by D. Seils).

July 2, 2020: Minor corrections to layout and formatting (changes made by D. Seils).

May 1, 2020: Made nonsubstantive changes to the Resources sidebar as part of the annual content update (changes made by D. Seils).

January 16, 2019: Made nonsubstantive changes to the text as part of the annual content update (changes made by D. Seils).

Published August 25, 2017

current section :

Electronic Health Record Data Extraction

  1. Introduction
  2. Intraclass Correlation
  3. Unequal Cluster Sizes
  4. Accounting for Residual Confounding in the Analysis
  5. Missing Data and Intention-to-Treat Analyses
  6. Electronic Health Record Data Extraction
  7. Unanticipated Changes
  8. Interim Reassessment of Sample Size in Cluster Randomized Trials
  9. Case Study: STOP CRC Trial

Citation:

Heagerty PJ, DeLong ER; for the NIH Health Care Systems Research Collaboratory Biostatistics and Study Design Core. Analysis Plan: Electronic Health Record Data Extraction. In: Rethinking Clinical Trials: A Living Textbook of Pragmatic Clinical Trials. Bethesda, MD: NIH Pragmatic Trials Collaboratory. Available at: https://rethinkingclinicaltrials.org/chapters/design/analysis-plan-top/ehr-data-extraction/. Updated April 30, 2024. DOI: 10.28929/019.

Footer Menu

  • How to Use This Site
  • About NIH Collaboratory
  • Enrollment Reporting
  • Grand Rounds
  • Funding Statement
Link to Twitter Link to LinkedIn Link to Blue Sky Link to NIH Collaboratory email

Reference in this Web site to any specific commercial products, process, service, manufacturer, or company does not constitute its endorsement or recommendation by the U.S. Government or National Institutes of Health (NIH). NIH is not responsible for the contents of any “off-site” Web page referenced from this server.

Log in
Privacy Statement
WordPress is a content management system and should not be used to upload any PHI as it is not an environment for which we exercise oversight, meaning you the author are responsible for the content you post. Please use this system accordingly. Site Map