Skip to content

COVID-19 Resources

Access the latest information on COVID-19 for clinical researchers
  • Home
  • About
    • NIH Collaboratory
      • Coordinating Center
      • NIH Collaboratory Trials
      • Core Working Groups
      • Steering Committee
      • Distributed Research Network
      • Our Impact
    • Living Textbook
      • Table of Contents
      • How to Use This Site
  • Resources
    • Data and Resource Sharing
    • Training Resources
    • Tools for Researchers
    • Publications
    • Knowledge Repository
  • Webinar
  • Podcast
  • News
    • News Feed
    • Calendar
    • Subscribe
return to home
Subscribe to Newsletter go to twitter feed go to linkedin go to blue sky feed
Search
NIH Collaboratory
Living Textbook of
Pragmatic Clinical Trials

COVID-19 Resources

Access the latest information on COVID-19 for clinical researchers
home button

Rethinking Clinical Trials

A Living Textbook of Pragmatic Clinical Trials

  • Design
    • What is a Pragmatic Clinical Trial?
    • Decentralized Pragmatic Clinical Trials
    • Developing a Compelling Grant Application
    • Experimental Designs and Randomization Schemes
    • Endpoints and Outcomes
    • Analysis Plan
    • Using Electronic Health Record Data
    • Building Partnerships and Teams to Ensure a Successful Trial
    • Intervention Delivery and Complexity
    • Patient Engagement
  • Data, Tools & Conduct
    • Assessing Feasibility
    • Acquiring Real-World Data
    • Assessing Fitness-for-Use of Real-World Data
    • Study Startup
    • Participant Recruitment
    • Monitoring Intervention Fidelity and Adaptations
    • Patient-Reported Outcomes
    • Clinical Decision Support
    • Mobile Health
    • Electronic Health Records–Based Phenotyping
    • Navigating the Unknown
  • Dissemination & Implementation
    • Data Sharing and Embedded Research
    • Dissemination Approaches for Different Audiences
    • Implementation
    • End-of-Trial Decision-Making
  • Ethics & Regulatory
    • Privacy Considerations
    • Identifying Those Engaged in Research
    • Collateral Findings
    • Consent, Disclosure, and Non-Disclosure
    • Data and Safety Monitoring
    • Ethical Considerations of Data Sharing in Pragmatic Clinical Trials
    • Ethics for AI and ML
    • IRB Responsibilities and Procedures

Developing and Refining the Research Questions

CHAPTER SECTIONS

Using Electronic Health Record Data in Pragmatic Clinical Trials


Section 4

Developing and Refining the Research Questions

Expand Contributors
Rachel Richesson, MS, PhD, MPH
Richard Platt, MD, MSc
Gregory Simon, MD, MPH
Lesley Curtis, PhD
Reesa Laws, BS
Adrian Hernandez, MD, MSH
Jon Puro, MPA-HA
Doug Zatzick, MD
Erik van Eaton, MD, FACS
Vincent Mor, PhDKeith A Marsolo, PhD

Contributing Editor
Karen Staman, MS

As with any type of research study, PCTs begin with a scientific question. A clearly articulated research question defines the phenomena of interest, the purpose for using EHR data, the possible sources of data to detect that phenomena, and, more specifically, the data requirements, definitions, quality, and data collection plan. In practice, however, there is often an iterative process between defining the data requirements for the EHR and understanding what is actually available at a given institution. The researcher must consider what information is needed to answer the question, and in turn, the available data may then influence the research question (i.e., ongoing cycles of this conversation: Health System: What data do you need for the trial? Researcher: That depends… What data is available?) This dialogue may lead investigators to refine their research question slightly to one that is likely more “answerable” based upon what data are collected or available, and is part of the process of Building Partnerships to Ensure a Successful Trial. Anecdotally, many researchers see the potential for discovery in clinical data warehouses derived from EHRs, and are excited to use the available data to generate questions and answers. While this approach is understandable and practical in theory, the complexity of EHR architectures, and their inherent bias and possible error, could produce inaccurate data and lead to misinterpretation of results and erroneous conclusions. Consequently, we assert that the scientific research question should be the fundamental driver for the study design and hence the foundation for any PCT.

Good clinical research practice and ethics dictate that clinical trials collect the necessary data (and ONLY the necessary data) to answer a specific research question (FDA Good Clinical Practice Guidance 2018). In most PCTs, it is vitally important to identify and measure co-variates between study arms or clusters. In these cases, there need to be high quality (accurate, complete) data from a number of variables so that researchers can assess the comparability between groups. The objective is to achieve balance between the groups along as many dimensions that are relevant, important, and feasible.

Because developing a PCT that will use data from the EHR can be extremely complicated, the NIH Collaboratory has developed a set of papers and chapters that provide a deeper dive into many of the issues, including developing data definitions and phenotyping, acquiring real-world data, and assessing fitness-for use.

Previous Section Next Section

SECTIONS

CHAPTER SECTIONS

sections

  1. Introduction
  2. Interoperability
  3. Data as a Surrogate for Clinical Phenomena
  4. Developing and Refining the Research Questions
  5. Specific Uses for EHR Data in PCTs
  6. Estimating and Identifying the Study Population and Assessing Baseline Prognostic Characteristics
  7. Implementing and Monitoring the Delivery of an Intervention
  8. Assessing Outcomes
  9. The Research Question Drives the Data Requirements
  10. Additional Resources

Resources

Electronic Health Records-Based Phenotyping

A resource chapter describing mechanisms for identifying and evaluating phenotype definitions, with a particular focus on standardization efforts from the Collaboratory

Assessing Fitness-for-Use of Real-World Data Sources

This Living Textbook chapter describes several approaches that can be used to facilitate assessments to determine whether data are fit for their intended use or purpose prior to their use in research settings

Acquiring Real-World Data

This Living Textbook chapter outlines strategies for obtaining real-world data for use in research.

 

REFERENCES

back to top

U.S. Food and Drug Administration. Guidance Document: E6(R2) Good Clinical Practice: Integrated Addendum to ICH E6(R1). 2018. https://www.fda.gov/regulatory-information/search-fda-guidance-documents/e6r2-good-clinical-practice-integrated-addendum-ich-e6r1. Assessed December 3, 2025.

back to top


Version History

December 3, 2025: Updated hyperlinks (changes made by G. Uhlenbrauck).

October 7, 2025: Updated text as part of annual review (changes made by K. Staman).

July 3, 2020: Minor corrections to layout and formatting (changes made by D. Seils).

February 11, 2020: Added links and a sentence regarding Building Partnerships to Ensure a Successful Trial (changes made by K. Staman).

Published August 25, 2017

current section :

Developing and Refining the Research Questions

  1. Introduction
  2. Interoperability
  3. Data as a Surrogate for Clinical Phenomena
  4. Developing and Refining the Research Questions
  5. Specific Uses for EHR Data in PCTs
  6. Estimating and Identifying the Study Population and Assessing Baseline Prognostic Characteristics
  7. Implementing and Monitoring the Delivery of an Intervention
  8. Assessing Outcomes
  9. The Research Question Drives the Data Requirements
  10. Additional Resources

Citation:

Richesson R, Platt R, Simon G, et al. Using Electronic Health Record Data in Pragmatic Clinical Trials: Developing and Refining the Research Questions. In: Rethinking Clinical Trials: A Living Textbook of Pragmatic Clinical Trials. Bethesda, MD: NIH Pragmatic Trials Collaboratory. Available at: https://rethinkingclinicaltrials.org/chapters/design/using-electronic-health-record-data-pragmatic-clinical-trials-top/developing-refining-research-questions/. Updated December 3, 2025. DOI: 10.28929/032.

Footer Menu

  • How to Use This Site
  • About NIH Collaboratory
  • Enrollment Reporting
  • Grand Rounds
  • Funding Statement
Link to Twitter Link to LinkedIn Link to Blue Sky Link to NIH Collaboratory email

Reference in this Web site to any specific commercial products, process, service, manufacturer, or company does not constitute its endorsement or recommendation by the U.S. Government or National Institutes of Health (NIH). NIH is not responsible for the contents of any “off-site” Web page referenced from this server.

Log in
Privacy Statement
WordPress is a content management system and should not be used to upload any PHI as it is not an environment for which we exercise oversight, meaning you the author are responsible for the content you post. Please use this system accordingly. Site Map