Analysis Plan

Section 6 Electronic Health Record Data Extraction

Contributors

Many pragmatic clinical trials, whether designed as cluster randomized trials or as individually randomized trials, rely on data extraction from the participant’s electronic health record (EHR). Although study data extraction allows pragmatic trials to be performed quickly and at less expense than traditional clinical trials that establish redundant parallel data capture systems, they also introduce methodological and logistical challenges, such as those described in the white paper, "Assessing Data Quality for Healthcare Systems Data Used in Clinical Research."

EHR data extraction also poses challenges for statistical analysis. Data gathered from EHRs (which, by definition, are not purposely designed or optimized to support research activities) may have higher rates of missingness and error than data captured with purpose-built systems and subjected to “cleaning” and validation. Missing data, including that caused by the dropout of whole clusters, pose special issues for pragmatic trials. Preliminary data capture and assessment will provide a guide as to whether the intended study is feasible, given the availability and quality of the data.

Previous Section Next Section

SECTIONS

CHAPTER SECTIONS

sections

Resources

Using Electronic Health Record Data
Living Textbook chapter describing considerations for the use of EHR data in pragmatic trials

What Are the Key Factors in Using EHR Data for Endpoints and Outcomes?
Two-minute training module from the NIH Pragmatic Trials Collaboratory’s video library

What Are the Challenges of Using Data Directly From the EHRs?
Two-minute training module from the NIH Pragmatic Trials Collaboratory’s video library

Key Issues in Extracting Usable Data from Electronic Health Records for Pragmatic Clinical Trials
Guidance document from the Biostatistics and Study Design Core

Version History

April 30, 2024: Made nonsubstantive changes to the text and added items to the Resources sidebar as part of the annual content update (changes made by D. Seils).

June 23, 2022: Updated the name of the NIH Collaboratory in the contributors list and made nonsubstantive changes as part of the annual content update (changes made by D. Seils).

July 2, 2020: Minor corrections to layout and formatting (changes made by D. Seils).

May 1, 2020: Made nonsubstantive changes to the Resources sidebar as part of the annual content update (changes made by D. Seils).

January 16, 2019: Made nonsubstantive changes to the text as part of the annual content update (changes made by D. Seils).

Published August 25, 2017

COVID-19 Resources

COVID-19 Resources

Rethinking Clinical Trials

A Living Textbook of Pragmatic Clinical Trials

Electronic Health Record Data Extraction

Analysis Plan

Section 6

Electronic Health Record Data Extraction

SECTIONS

sections

Resources

current section :

Electronic Health Record Data Extraction

Citation: