Skip to content

COVID-19 Resources

Access the latest information on COVID-19 for clinical researchers
  • Home
  • About
    • NIH Collaboratory
      • Coordinating Center
      • NIH Collaboratory Trials
      • Core Working Groups
      • Steering Committee
      • Distributed Research Network
      • Our Impact
    • Living Textbook
      • Table of Contents
      • How to Use This Site
  • Resources
    • Data and Resource Sharing
    • Training Resources
    • Tools for Researchers
    • Publications
    • Knowledge Repository
  • Webinar
  • Podcast
  • News
    • News Feed
    • Calendar
    • Subscribe
return to home
Subscribe to Newsletter go to twitter feed go to linkedin go to blue sky feed
Search
NIH Collaboratory
Living Textbook of
Pragmatic Clinical Trials

COVID-19 Resources

Access the latest information on COVID-19 for clinical researchers
home button

Rethinking Clinical Trials

A Living Textbook of Pragmatic Clinical Trials

  • Design
    • What is a Pragmatic Clinical Trial?
    • Decentralized Pragmatic Clinical Trials
    • Developing a Compelling Grant Application
    • Experimental Designs and Randomization Schemes
    • Endpoints and Outcomes
    • Analysis Plan
    • Using Electronic Health Record Data
    • Building Partnerships and Teams to Ensure a Successful Trial
    • Intervention Delivery and Complexity
    • Patient Engagement
  • Data, Tools & Conduct
    • Assessing Feasibility
    • Acquiring Real-World Data
    • Assessing Fitness-for-Use of Real-World Data
    • Study Startup
    • Participant Recruitment
    • Monitoring Intervention Fidelity and Adaptations
    • Patient-Reported Outcomes
    • Clinical Decision Support
    • Mobile Health
    • Electronic Health Records–Based Phenotyping
    • Navigating the Unknown
  • Dissemination & Implementation
    • Data Sharing and Embedded Research
    • Dissemination Approaches for Different Audiences
    • Implementation
    • End-of-Trial Decision-Making
  • Ethics & Regulatory
    • Privacy Considerations
    • Identifying Those Engaged in Research
    • Collateral Findings
    • Consent, Disclosure, and Non-Disclosure
    • Data and Safety Monitoring
    • Ethical Considerations of Data Sharing in Pragmatic Clinical Trials
    • Ethics for AI and ML
    • IRB Responsibilities and Procedures

Evaluating Fitness for Use

CHAPTER SECTIONS

Assessing Fitness for Use of Real-World Data Sources


Section 3

Evaluating Fitness for Use

Expand Contributors

Keith A. Marsolo, PhD
Rachel Richesson, MS, PhD, MPH
Bradley G. Hammill, DrPH, MA
Lesley Curtis, PhD

Contributing Editor

Karen Staman, MS

As described in the previous section, the FDA has provided initial guidance on assessing dataset fitness, outlining the broad concepts of relevance (eg, do the data apply to the question of interest and can they be used in the proposed analysis) and reliability (eg, are the data accurate and complete, is provenance known, and are the data traceable). It is essential to know that there is not a single approach to assessing dataset fitness because it depends on the study and the context in which the data are used.

The authors of a recent empirical study conducted a series of interviews and surveys with PCT study teams to explore their concerns, practices, and decision-making around the fitness for use of real-world data (RWD), with a focus on EHR data. Their analysis showed that, while many PCTs conducted fitness-for-use assessments, less than a quarter did so before choosing a data source. Fitness-for-use activities, findings, and resulting study design changes were not often publicly documented, and overall costs were barriers to assessments. From their analysis, the study authors developed considerations that could help researchers improve the characterization of RWD in PCTs, including (1) defining and articulating how study-specific fitness for use will be assessed, (2) conducting fitness-for-use assessments before the trial begins, and (3) sharing the results of fitness assessments and relevant challenges and facilitators.

An article in the Journal of the American Informatics Association (JAMIA) provides more detail on evaluating fitness-for-use of electronic health records in pragmatic clinical trials (Rahman et al, 2022).

The following case study illustrates the challenges associated with getting approvals for and obtaining RWD for prospective studies (especially internationally, for US-based researchers), and evaluating the general fitness-for use of RWD for trials.

Case Study: Harmony Outcomes EHR Ancillary Study (eHARMONY)

The Harmony Outcomes trial was designed to determine the effect of albiglutide, when added to standard blood glucose-lowering therapies, on major cardiovascular events in patients with type 2 diabetes. eHARMONY was an ancillary study conducted alongside the Harmony Outcomes trial with 3 objectives related to understanding the potential of RWD in clinical trials:

  • Understand how EHR data are used to facilitate trial recruitment and the barriers to that use
  • Evaluate the fitness of RWD data for use in populating baseline characteristics in the electronic case report form (eCRF)
  • Evaluate the fitness of RWD data for use in identifying clinical endpoints (Hammil et al. 2022)

Getting approvals for and obtaining RWD for prospective studies

Originally, this multinational ancillary study planned to include RWD from the United States and several European countries, including the National Hospital Discharge registry (Sweden); the National Health Service Register (Denmark); and the National Health Service hospital discharge data (UK). However, beginning in 2018, the General Data Protection Regulation (GDPR) placed new restrictions on the movement of individual-level data outside the European Union. As a result, planned approaches to international data flow in the study were no longer feasible. Instead, the ancillary study had to rely on Medicare insurance claims data and EHR data from selected US study sites. For the EHR strategy to work, it required sites with a data warehouse based on EHR data; the ability to organize EHR data into a common data format; and an integrated clinical, operational, and technical team. Many of the sites approached for this ancillary study chose not participate because they knew they could not perform this work.

In the eHARMONY ancillary study, assessing a site’s technical capabilities and the quality of their EHR data up front was often not possible. In the end, not all selected sites were able to contribute meaningful data to the study. Among the lessons learned were:

  • Standalone clinical research sites had very little extractable EHR data about patients.
  • Most lab results and medications were either not extractable or not mapped to a useful terminology.
  • Many sites did not have the ability to transform their data into a common format and had to send rudimentary data extracts to the ancillary study coordinating center. Sites participating in other research networks, such as PCORnet, had no difficulty with this task.

Evaluating the general fitness for use of RWD for trials

The study team compared EHR data and Medicare claims data with the trial-collected data and found that agreement varied by data domain.

Demographics EHR and claims information consistently agreed with the eCRF.
Medical history There were inconsistent results, but RWD often had low sensitivity and high specificity.
Medications EHR data had low sensitivity and high specificity; claims data had substantially higher sensitivity than EHR data.
Lab results EHR lab results were often missing but agreed with the eCRF when present.
Events There was a very small number of events in the ancillary study population; EHR data had low sensitivity and high specificity; and claims data had substantially higher sensitivity than the EHR.

The study authors had further recommendations for future studies considering incorporating RWD as a data source:

  • Define inclusion/exclusion and outcome concepts to be more RWD-friendly. Prevalent disease (eg, cerebrovascular disease, coronary artery disease) is easier to identify than historical clinical events (eg, stroke, myocardial infarction). Focus on what’s available in structured data (eg, hospitalization with primary diagnosis of myocardial infarction), avoiding detailed clinical results (eg, ECG findings).
  • When studies have both purpose-collected data (ie, trial database) and RWD, they should perform validation studies as they are able, to contribute to the evidence base for RWD in clinical research.

Watch the Grand Rounds presentation with Q&A: Leveraging RWD in a Multinational Trial: Results from the other eHARMONY (Harmony Outcomes EHR Ancillary Study).

 

Previous Section Next Section

SECTIONS

CHAPTER SECTIONS

sections

  1. Introduction
  2. Defining Fitness for Use
  3. Evaluating Fitness for Use
  4. Data Quality Measures
  5. Use of Medicare Data in PCTs
  6. Data Source Accuracy: Case Study from TRANSLATE-ACS
  7. Data Provenance
  8. Operationalizing Fitness-for-Use Assessments

Resources

FDA guidance on use of EHR and claims data (2021)

This guidance for industry provides sponsors, researchers, and other interested parties with considerations for the use of EHR and claims data for regulatory decision making.

Evaluating fitness-for-use of EHRs in PCTs (2022)

This article empirically explores how ePCTs that used real-world data (RWD) assessed study-specific fitness-for-use.

REFERENCES

back to top

Hammill BH, Leimberger JD, Lampron Z, Raman SR, O'Brien EC, Wurst KE , Mountcastle S, Cunnington M, Janmohamed S, Curtis LH. Fitness of real-world data for clinical trial data collection: Results and lessons from a HARMONY Outcomes ancillary study. Clinical Trials. 24:17407745221114298 DOI: 10.1177/17407745221114298 PMID: 35876156

 

Raman SR, O’Brien EC, Hammill BG, et al. 2022. Evaluating fitness-for-use of electronic health records in pragmatic clinical trials: reported practices and recommendations. Journal of the American Medical Informatics Association. 29:798–804. doi:10.1093/jamia/ocac004. PMID: 35171985

back to top


Version History

Updated January 31, 2023: Added resource: Assessing Fitness-for-Use of Clinical Data for PCTs

Updated October 24, 2022: Updated references (changes made by K. Staman).

Published August 26, 2022

current section :

Evaluating Fitness for Use

  1. Introduction
  2. Defining Fitness for Use
  3. Evaluating Fitness for Use
  4. Data Quality Measures
  5. Use of Medicare Data in PCTs
  6. Data Source Accuracy: Case Study from TRANSLATE-ACS
  7. Data Provenance
  8. Operationalizing Fitness-for-Use Assessments

Citation:

Marsolo K, Richesson R, Hammill B, et al.. Assessing Fitness for Use of Real-World Data Sources: Evaluating Fitness for Use. In: Rethinking Clinical Trials: A Living Textbook of Pragmatic Clinical Trials. Bethesda, MD: NIH Pragmatic Trials Collaboratory. Available at: https://rethinkingclinicaltrials.org/chapters/conduct/assessing-fitness-for-use-of-real-world-data-sources/evaluating-fitness-for-use/. Updated December 3, 2025. DOI: 10.28929/185.

Footer Menu

  • How to Use This Site
  • About NIH Collaboratory
  • Enrollment Reporting
  • Grand Rounds
  • Funding Statement
Link to Twitter Link to LinkedIn Link to Blue Sky Link to NIH Collaboratory email

Reference in this Web site to any specific commercial products, process, service, manufacturer, or company does not constitute its endorsement or recommendation by the U.S. Government or National Institutes of Health (NIH). NIH is not responsible for the contents of any “off-site” Web page referenced from this server.

Log in
Privacy Statement
WordPress is a content management system and should not be used to upload any PHI as it is not an environment for which we exercise oversight, meaning you the author are responsible for the content you post. Please use this system accordingly. Site Map