Skip to content

COVID-19 Resources

Access the latest information on COVID-19 for clinical researchers
  • Home
  • About
    • NIH Collaboratory
      • Coordinating Center
      • NIH Collaboratory Trials
      • Core Working Groups
      • Steering Committee
      • Distributed Research Network
      • Our Impact
    • Living Textbook
      • Table of Contents
      • How to Use This Site
  • Resources
    • Data and Resource Sharing
    • Training Resources
    • Tools for Researchers
    • Publications
    • Knowledge Repository
  • Webinar
  • Podcast
  • News
    • News Feed
    • Calendar
    • Subscribe
return to home
Subscribe to Newsletter go to twitter feed go to linkedin go to blue sky feed
Search
NIH Collaboratory
Living Textbook of
Pragmatic Clinical Trials

COVID-19 Resources

Access the latest information on COVID-19 for clinical researchers
home button

Rethinking Clinical Trials

A Living Textbook of Pragmatic Clinical Trials

  • Design
    • What is a Pragmatic Clinical Trial?
    • Decentralized Pragmatic Clinical Trials
    • Developing a Compelling Grant Application
    • Experimental Designs and Randomization Schemes
    • Endpoints and Outcomes
    • Analysis Plan
    • Using Electronic Health Record Data
    • Building Partnerships and Teams to Ensure a Successful Trial
    • Intervention Delivery and Complexity
    • Patient Engagement
  • Data, Tools & Conduct
    • Assessing Feasibility
    • Acquiring Real-World Data
    • Assessing Fitness-for-Use of Real-World Data
    • Study Startup
    • Participant Recruitment
    • Monitoring Intervention Fidelity and Adaptations
    • Patient-Reported Outcomes
    • Clinical Decision Support
    • Mobile Health
    • Electronic Health Records–Based Phenotyping
    • Navigating the Unknown
  • Dissemination & Implementation
    • Data Sharing and Embedded Research
    • Dissemination Approaches for Different Audiences
    • Implementation
    • End-of-Trial Decision-Making
  • Ethics & Regulatory
    • Privacy Considerations
    • Identifying Those Engaged in Research
    • Collateral Findings
    • Consent, Disclosure, and Non-Disclosure
    • Data and Safety Monitoring
    • Ethical Considerations of Data Sharing in Pragmatic Clinical Trials
    • Ethics for AI and ML
    • IRB Responsibilities and Procedures

Introduction

CHAPTER SECTIONS

Data Sharing and Embedded Research


Section 1

Introduction

Expand Contributors
Gregory E. Simon, MD, MPH
Gloria Coronado, PhD
Lynn L. DeBar, PhD, MPH
Laura M. Dember, MD
Beverly Green, MD, MPH
Susan S. Huang, MD, MPH
Jeffrey G. Jarvik, MD, MPH
Vincent Mor, PhD
Joakim Ramsberg, PhD
Edward J. Septimus, MD
Miguel A. Vazquez, MD
William M. Vollmer, PhD
Douglas Zatzick, MD
Adrian F. Hernandez, MD, MHS
Richard Platt, MD, MS

Contributing Editor
Karen Staman, MS

The contributors to this chapter initially wrote an opinion piece for Annals of Internal Medicine (Simon et al. 2017) on data sharing. In this chapter, we expand on the ideas presented there and frame them using lessons learned from the Collaboratory.

Video originally published in Annals of Internal Medicine (Simon et al. 2017) and used with permission.

Emerging policies and procedures for sharing analyzable research datasets hold great promise for increasing transparency and reproducibility in medical research. Expanded use of the data can increase the knowledge base through secondary analyses, decrease selective reporting, and lead to improvements in clinical care (Institute of Medicine 2015; Krumholz et al. 2016; Warren 2016; NIH Data Sharing Policy 2015). Enabling the responsible sharing of data is a global priority and a number of solutions have been proposed, including by the National Academy of Medicine (2015), the International Committee of Medical Journal Editors (ICMJE; Taichman et al. 2016) and the National Institutes of Health. In Europe, access to data from industry sponsored trials has increased markedly, and there are encouraging programs in the U.S., such as the Yale University Open Data Access (YODA) partnership with Medtronic (Krumholz et al. 2013), the Academic Research Organization Consortium for Continuing Evaluation of Scientific Studies — Cardiovascular (ACCESS CV 2016) the Supporting Open Access to Researchers (SOAR) Initiative (Pencina et al. 2016), and the OptumLabs healthcare industry collaborative research and innovation center.

While we enthusiastically support data sharing, the conceptual framework for it is rooted in individually randomized controlled trials (RCTs) with participants’ explicit informed consent, which can include authorization for data sharing. Pragmatic research embedded in health systems is different from conventional trials: it often involves a waiver of patient consent, uses data from the electronic health record (EHR), and often includes information that could identify patients, health care providers, and healthcare facilities or organizations. In some cases, the primary data for enrolled patients includes every encounter, medication, and procedure. As we describe in the Annals article, even if study data would not allow identification of individual participants, the potential for disclosure of sensitive information regarding providers or health systems may still be substantial. These data have the capacity to do harm if taken out of context, used inappropriately or for comparative purposes, or to single out an individual, provider or institution. Healthcare systems voluntarily participate in embedded research and have raised these concerns about releasing information from electronic health records. Their specific concern is that health systems or facilities volunteering to participate in research might be penalized by release of detailed operational information that others are not required to make public. Participation in public-domain research is distinct from health systems’ participation in public quality reporting programs, where measures are standardized and public comparison of providers or facilities is either required or a clear expectation of multiple organizations.

Because of the unique concerns of clinicians and healthcare systems participating in embedded research, a requirement to share data using mechanisms designed for conventional, individually randomized trials will be challenging and might dissuade some healthcare systems from participating, thereby reducing opportunities to answer important scientific and healthcare questions using data acquired from clinical health care delivery.

Embedding clinical trials in healthcare systems as part of the delivery of care could improve the speed, quality, and cost of research, but it is not currently necessary (or required) for the systems to participate, and it often imposes opportunity costs that can distract from operational priorities. Although the healthcare systems sometimes derive direct benefit from participation in research (e.g., when there is congruence with prioritized quality improvement efforts), their principal motive is typically an altruistic one—to contribute to the knowledge base about the relative benefits, risks, and burdens of treatments. In this respect, health system participants in pragmatic research are similar to individual participants in conventional clinical trials. For individuals who participate in clinical research, researchers offer guarantees through the informed consent process that sensitive information will not be misused and ensure that individual protected health information is not exposed through trial activities or data sharing. In the same way, pragmatic research needs to consider the specific confidentiality concerns of participating health care providers or systems and to identify appropriate processes and technical structures for data sharing. We use experiences from the NIH Collaboratory, which supports embedded clinical trials that address major national priorities, to explore the most important concerns and potential solutions in this chapter.

Next Section

SECTIONS

CHAPTER SECTIONS

sections

  1. Introduction
  2. Data Sharing Concerns
  3. Data Sharing Solutions for Embedded Research
  4. Patient Perspectives on Data Sharing
  5. Data-sharing Policy at the NIH, Collaboratory, and HEAL
  6. Incentive Structure and Citations for Data Sets
  7. Preparing for Data Sharing
  8. Moving Forward
  9. Additional Resources
  10. FAQ

Resources

Data and Resource Sharing Informational Document

Onboarding Data and Resource Sharing Questionnaire

Closeout Data and Resource Sharing Checklist

NIH Collaboratory Data Sharing Policy

NIH Data Sharing Policy and Implementation Guidance

Institute of Medicine. Sharing Clinical Trial Data: Maximizing Benefits, Minimizing Risks

Data Sharing Statements for Clinical Trials — A Requirement of the International Committee of Medical Journal Editors

REFERENCES

back to top

Institute of Medicine. 2015. Sharing Clinical Trial Data: Maximizing Benefits, Minimizing Risk. Washington, D.C: National Academies Press. https://doi.org/10.17226/18998.

Krumholz HM, Ross JS, Gross CP, et al. 2013. A historic moment for open science: the Yale University Open Data Access project and medtronic. Ann Intern Med. 158:910–911. doi:10.7326/0003-4819-158-12-201306180-00009. PMID:23778908.

Krumholz HM, Terry SF, Waldstreicher J. 2016. Data acquisition, curation, and use for a continuously learning health system. JAMA. 316:1669–1670. doi:10.1001/jama.2016.12537. PMID:27668668.

Pencina MJ, Louzao DM, McCourt BJ, et al. 2016. Supporting open access to clinical trial data for researchers: The Duke Clinical Research Institute–Bristol-Myers Squibb Supporting Open Access to Researchers Initiative. Am Heart J. 172:64–69. doi:10.1016/j.ahj.2015.11.002. PMID:26856217.

back to top

Simon GE, Corondo G, DeBar LL, et al. 2017. Data Sharing and Embedded Research. Ann Intern Med. doi:10.7326/M17-0863. PMID:28973353.

Taichman DB, Backus J, Baethge C, et al. 2016. Sharing clinical trial data: a proposal from the International Committee of Medical Journal Editors. Ann Intern Med. 164:505. doi:10.7326/M15-2928. PMID:26792258.

The Academic Research Organization Consortium for Continuing Evaluation of Scientific Studies — Cardiovascular (ACCESS CV). 2016. Sharing data from cardiovascular clinical trials — a proposal. New Engl J Med. 375:407–409. doi:10.1056/NEJMp1605260. PMID:27518659.

Warren E. 2016. Strengthening research through data sharing. New Engl J Med. 375:401–403. doi:10.1056/NEJMp1607282. PMID:27518656.


Version History

February 25, 2025: Updated hyperlinks (change made by G. Uhlenbrauck).

March 22, 2023: Updated hyperlinks (change made by G. Uhlenbrauck).

Published August 25, 2017

current section :

Introduction

  1. Introduction
  2. Data Sharing Concerns
  3. Data Sharing Solutions for Embedded Research
  4. Patient Perspectives on Data Sharing
  5. Data-sharing Policy at the NIH, Collaboratory, and HEAL
  6. Incentive Structure and Citations for Data Sets
  7. Preparing for Data Sharing
  8. Moving Forward
  9. Additional Resources
  10. FAQ

Citation:

Simon G, Coronado G, DeBar L, et al. Data Sharing and Embedded Research: Introduction. In: Rethinking Clinical Trials: A Living Textbook of Pragmatic Clinical Trials. Bethesda, MD: NIH Pragmatic Trials Collaboratory. Available at: https://rethinkingclinicaltrials.org/chapters/dissemination/data-share-top/data-sharing-and-embedded-research-introduction/. Updated September 25, 2025. DOI: 10.28929/068.

Footer Menu

  • How to Use This Site
  • About NIH Collaboratory
  • Enrollment Reporting
  • Grand Rounds
  • Funding Statement
Link to Twitter Link to LinkedIn Link to Blue Sky Link to NIH Collaboratory email

Reference in this Web site to any specific commercial products, process, service, manufacturer, or company does not constitute its endorsement or recommendation by the U.S. Government or National Institutes of Health (NIH). NIH is not responsible for the contents of any “off-site” Web page referenced from this server.

Log in
Privacy Statement
WordPress is a content management system and should not be used to upload any PHI as it is not an environment for which we exercise oversight, meaning you the author are responsible for the content you post. Please use this system accordingly. Site Map