Grand Rounds February 13, 2026: The Making of the COMPARE-Pediatric IBD Study (Michael D. Kappelman, MD, MPH)

Speaker

Michael D. Kappelman, MD, MPH
Professor, Pediatric Gastroenterology
University of North Carolina at Chapel Hill

Keywords

PCORnet; PCORI; Inflammatory Bowel Disease; Pediatrics; Common Data Model; Study Design

Key Points

  • Inflammatory Bowel Disease (IBD) is a chronic gastrointestinal condition affecting roughly 100,000 youth in the United States. It has a profound impact on nutrition, growth, physical, and psychosocial development. Anti-TNF biologics are the only FDA-approved advanced therapies for children, and approximately 30% of patients experience treatment failure within 2 years. There’s an urgent need for comparative effectiveness research that can guide treatment decisions when anti-TNF fails.
  • COMPARE-Pediatrics IBD, a PCORnet® study, includes 2 parallel multi-center, prospective cohort studies and retrospective cohort studies. The former, developed with multi-stakeholder input, will compare the effectiveness of emerging therapies in children with IBD; the latter will characterize the safety of these treatments and explore the heterogeneity of treatment effects across subgroups.
  • The study is utilizing PCORnet’s® infrastructure, including Prep-to-Research Queries and the PCORnet® Common Data Model (CDM), to inform the study design; identify administrative efficiencies; support recruitment; ease site burden; assess representativeness of the study population; and otherwise bolster their research.

Discussion Themes

Planning a PCORnet® study is a lot of work (and takes time). Start the process early and know that benefits may be on the back-end.

The study team opted not to conduct a randomized pragmatic trial because they anticipated that desperate families would be reluctant “roll the dice” with randomization and because insurance coverage for expensive off-label medications often dictates which therapy a patient can receive.

While the CDM is effective for structured data (like labs and diagnoses), Dr. Kappelman noted it cannot yet capture nuanced interpretations, such as specific MRI findings, which require more advanced AI or manual review.

Grand Rounds November 21, 2025: TransCelerate and Tufts CSDD Uncover Opportunities to Rethink Data Collection and Optimize Protocol Design (Laura Galuchie, BS; Zachary Smith, MA)

Speakers

Laura Galuchie, BS
Senior Director, Global Clinical Development
Merck & Co, Inc.

Zachary Smith, MA
Assistant Director, Data Sciences & Analytics
Tufts Center for the Study of Drug Development
Tufts University School of Medicine

Keywords

Data; Optimization; Data Collection; Protocol Design

Key Points

  • The TransCelerate Initiative – comprising a group of pharmaceutical companies with research and development organizations – seeks to identify key considerations in protocol design to optimize procedures and their frequency, while providing tools and a value-based framework for internal evaluation.
  • Optimized data collection can improve patient and site experience, reduce complexity, enhance trial execution through better design decisions, and maintain (or potentially improve) quality.
  • A seasoned approach to data collection is timely, as the volume of data is rising (and increasingly exceeds that which is needed). Additionally, recent ICH and ethics updates emphasize fit-for-purpose data and eliminating unnecessary complexity in clinical trials.
  • The TransCelerate-Tufts Center for Study of Drug Development (CSDD) partnership was borne of the need for continued tangible, actionable evidence to demonstrate the opportunity to optimize data collection.
  • In 2024, they workshopped a data collection instrument and 14 companies collected and provided data. Tufts CSDD conducted data quality checks to ensure accuracy, validity, and completeness and conducted a comprehensive quality control process. Data analysis took place in early 2025. Endpoints were defined as “core” and “non-core” based on procedure type.
  • The study sought to quantify the collection and use of non-core and extraneous core protocol data; gather updated benchmarks on the amount, purpose, and impact of data collected in clinical trials; and identify ways to improve protocol design by reducing complexity and easing the burden on sites and participants.
  • The research team found that the mean number of datapoints collected has exploded in the last decade, up from 930,000 in 2012 to nearly 6 million in 2025. More than 1/3 of all data collected comes from non-core and non-essential procedures.
  • Non-core and other non-essential procedures contribute to 25-30% of total participants and site burden. Note that there may be other benefits to some non-essential procedures; for example, making sure patients are heard through site questionnaires.
  • The analysis provides empirical evidence encouraging protocol design discussion and a shift towards more intentional and fit-for-purpose data collection strategies. Planning frameworks and collection assessment tools can reduce unnecessary burden on patients, sites, regulators, and other stakeholders, as well as help sponsors critically assess what data are collected and why.

Discussion Themes

When looking at the factors that contributed to the overcollection of data, the study team found that no one function or department was responsible for the majority of the data points and procedures; the distribution of contributing factors was diffuse. It’s an equal-opportunity problem.

Factors driving non-core data collection included teams’ fear of being asked for data they hadn’t collected by regulators and a lack of on-site experience amongst functional areas. In the latter case, teams are focused on their objectives and lack perspective on how data collection translates into the patient experience, the site experience, or an impact on another function within the group.

In addition to financial costs, there are time costs associated with data collection. The study team found a direct correlation between the amount of data and complexity of a trial – and as complexity increased, the time it took to conduct the trial also increased.

Grand Rounds October 17, 2025: Making Effective Use of Data Infrastructure in PCORnet® (Charles Bailey, MD, PhD; Keith Marsolo, PhD)

Speakers

Charles Bailey, MD, PhD
Department of Pediatrics
Perelman School of Medicine
University of Pennsylvania
Biomedical and Health Informatics
Children’s Hospital of Philadelphia

Keith Marsolo, PhD
Associate Professor
Department of Population Health Sciences
Duke Clinical Research Institute
Duke University School of Medicine

Keywords

PCORnet®; Data; Clinical Research Network; Patient-Centered Research; Common Data Model

Key Points

  • PCORnet® is a clinical research network that connects communities (namely providers; researchers; patients, caregivers, and advocates) and data (EHR, claims, and patient-reported). It functions as a learning health system to help researchers generate answers that advance health outcomes.
  • The network is made up of healthcare institutions, from large academic health centers to local community clinics. As of August 2025, PCORnet® had collated data from healthcare encounters in all 50 states, representing over 47 million people. There had been 57 PCORnet® studies and 991 publications supported by PCORnet® resources.
  • To be useful, data have to be standardized across systems. Frequent data curation and a single language enabled by the PCORnet® Common Data Model (CDM) facilitates this. Data that are in the CDM and currently available for use in research include demographics, diagnoses, and vital signs. Data that may or may not be in the CDM and require additional work for research include immunizations, social determinants of health, and patient-reported outcomes.
  • Quarterly, the PCORnet® team executes a data curation process. This includes a range of checks looking at data completeness; plausibility; persistence; and conformance to the PCORnet® CDM. Over the last decade, PCORnet® network performance has improved in terms of data mapping and latency.
  • Researchers can approach the PCORnet® Front Door with both simple univariate and bivariate statistical questions – i.e. how often a particular medication is used within the PCORnet® population – and with prep-to-research queries, which may identify an eligible population and generate some information about how that population behaves.
  • Once a team is running a PCORnet® study, they can submit queries for study-specific data extracts. This involves identifying a cohort and extracting patient-level data.
  • In the near future, PCORnet® will include additional data visualization options to increase the ease of navigating complex results. The team is also working on a Query Tools repository that will show what other people have already asked about a given set of data.
  • Because each study operates on specific variables and general characteristics do not predict specific characteristics, study-focused assessment of data fitness is critical.
  • The presenters walked attendees through 5 different PCORnet® studies and how they utilized this data infrastructure in their projects.

Discussion Themes

There is no charge for Front Door queries; they are part of the research engagement process. However, prep-to-research queries are limited to those that can be turned around in a reasonable period of time; they don’t extend to statistical modeling or requests that involve asking sites to get new kinds of data. At the pilot level, researchers can execute custom queries that provide a deeper look at the data.

Linkage partners will depend on the needs of a study. For example, PCORnet® Studies have linked to claims data from Centers for Medicare & Medicaid Services, registries that collect lived experience information, and commercial vendors that perform specialty lab or image testing.

An advantage of using PCORnet® for pragmatic and prospective trials is the connection with the health system, local investigators, and data experts. These can serve as valuable resources during the design, recruitment, and analysis stages.