NERDS 2025 : Abstracts

Abstracts

Keynote Session 1

K1.1 Title: The Science of Possibility: Discovery and Development of CFTR modulators to treat Cystic Fibrosis

Speaker: Carmen Bozic

Abstract: Cystic fibrosis (CF) is a rare, life-shortening genetic disease affecting approximately 109,000 people worldwide. Vertex has spent more than 25 years discovering and developing the first and only approved CFTR modulators that address the underlying cause of CF. Along this research and development journey, Vertex has serially innovated to find better medicines and new approaches towards the ultimate goal of restoring normal levels of CFTR function in people living with CF, including treating people earlier in life and those with less common disease-causing mutations. Our collaborations with academia, research organizations and nonprofit organizations help drive scientific innovation and the development of new medicines. To date, Vertex has brought five transformational CFTR modulator medicines to people living with CF, including most recently ALYFTREK. These medicines have fundamentally changed the course of this disease. Our research is ongoing as we continue to discover and develop medicines for potentially all people with CF, including for the approximately 5,000 people with CF who cannot benefit from CFTR modulators.

K1.2 Title: The Molecular Bases of Inherited cardiomyopathies: From genes to therapies

Speaker: Jonathan Seidman

Abstract: The ultimate opportunity presented by discovering the genetic basis of human disease is accurate prediction and disease prevention. To enable this achievement, genetic insights must enable the identification of at-risk individuals prior to end-stage disease and strategies that delay or prevent clinical expression. Unexplained cardiomyopathies, both hypertrophic (HCM) and dilated cardiomyopathy (DCM) provide a paradigm for fulfilling these opportunities. HCM is characterized by abnormal cardiac morphology - increase ventricular wall thickness with myocyte hypertrophy, disarray and fibrosis- and perturbed function -hyperdynamic contraction and poor relaxation. These abnormalities increase risk for atrial and ventricular arrhythmias, heart failure, and sudden cardiac death. Most HCM is caused by dominant-acting rare variants that occur in genes encoding protein components of the sarcomere, the contractile unit in striated muscles. Pathogenic variants predominate in two genes: damaging missense variants in MYH7 encoding the cardiac myosin heavy chain and loss of function variants in MYBPC3, encoding myosin-binding protein C. Defining the biophysical and molecular consequences of these pathogenic variants provided new insights into sarcomere biology and allowed insights that enabled the development of a novel small molecule therapeutic- Camzyos. Despite the benefits of small molecule therapy for HCM, inherited DCM therapy will likely require genetic engineering approaches.

Keynote Session 2

K2.1 Title: Target Learning with Application in Rare Disease Trials

Speaker: Mark van der Laan, Professor of Biostatistics, UC Berkeley

K2.2 Title: Patient-Centric Drug Development: Perspectives from the National Organization for Rare Disorders (NORD)

Speaker: Edward Neilan, MD, PhD, Chief Medical and Scientific Officer, NORD

Abstract: Founded in 1983, the National Organization for Rare Diseases (NORD) is a leading and innovative patient advocacy organization. Our mission is to improve the well-being of all patients with rare diseases through programs in advocacy, research, care, and education. In this talk, I will first briefly review the origins of NORD, the creation of the Orphan Drug Act, and NORD’s ongoing partnerships with >350 more-disease-specific NORD Member Organizations. Then I will attempt to summarize patient perspectives on the urgency of the unmet medicals needs that remain for rare diseases, and I will describe some of the programs NORD has to help empower rare disease patients and advocates to help advance research for their own diseases – including the online NORD “IAMRARE” Registry and Natural History Study Platform and NORD’s new Rare Disease Centers of Excellence network. Finally, I will offer my own thoughts on some of the existing (or proposed) “pathways” for rare disease drug development.

Session 1: Innovative Design and Methods in Rare Diseases

S1.1 Title: Bayesian Approaches for Information Borrowing and Adaptive Decisions

Speaker: Yuan Ji

Abstract: I will introduce Bayesian methods tailored for information borrowing from external source and sample size estimation/re-estimation that empowers adaptive decision making during the course a clinical trial. For information sharing, the idea is to apply Bayesian nonparametric and parametric models to match the distributions of covariates and responses, achieving subgroup matching and accounting for unmeasured confounders. The particular models are related to the Bayesian addictive regression trees (BART) and meta-analytic predictive (MAP) priors. For sample size estimation/re-estimation, the adaptation is centered at balancing the observed and predicted population summary, confidence in decision making, and (re-)estimated sample size. I will provide some numerical examples and case studies illustrating these powerful approaches.

S1.2 Title: FDA case study of Xenpozyme: A Clinical Dose Escalation Strategy for a Rare Disease Drug Program

Speaker: Qi Zhang

Abstract: On March 13, 2025 issue of FDA’s Prevision Policy, FDA highlighted the Sanofi’s acid sphingomyelinase deficiency (ASMD) treatment Xenpozyme (olipudase alfa) as a model that demonstrates the ability to conduct robust dose optimization work in the context of a rare disease development program. ASMD is a rare autosomal recessive lysosomal disorder that results in liver and respiratory failure, which are the primary causes of death in patients with ASMD. In this presentation, the study design and dose optimization strategy from Animal model with ASM Knockout mouse in non-clinical study, first-in-human single ascending dose (SAD) and multiple ascending dose (MAD) phase 1 studies, as well as pivotal phase 2/3 adult and phase 1/2 pediatric study will be discussed. Relevant regulatory discussions, final trial results and conclusions will also be shared. In addition, statistical considerations beyond dose optimization strategy will also be highlighted, particularly regarding the selection of primary endpoints, as well as utilization of modeling and external natural history data to support pediatric indication.

S1.3 Title: Efficient Platform design for screening indications in rare disease

Speaker: Ziqian Geng, PhD and Yibo Wang, PhD, AbbVie

Abstract: Clinical trials in rare diseases often face significant challenges due to low prevalence, making it difficult to recruit sufficient number of patients for adequate statistical power. This constraint calls for innovative design strategies that can maximize the efficiency under limited total sample sizes. Platform trial designs offer a promising solution by evaluating multiple investigational drugs simultaneously against a shared reference arm, thereby reducing the overall sample size required. In this presentation, we investigate a platform trial framework under a fixed total sample size, reflecting practical constraints in rare disease research. A key practical consideration is the staggered availability of investigational drugs — not all investigational drugs may be ready to be evaluated concurrently, and some may be dropped early from the platform due to futility or external factors. To address these dynamics, we develop and evaluate design strategies that determine when to add or drop arms, and more critically, how to optimize patient allocation across treatment arms over time to maximize statistical power for treatment comparisons. Our findings provide practical guidance for designing more informative and resource-efficient platform trials under real-world operational limitations.

S1.4 Title: Enhancing clinical trials through prognostic score covariate adjustment – opportunities & challenges in rare disease

Speaker: Roland Brown

Abstract: Prognostic scores (PS) can improve statistical efficiency in clinical trials by reducing the variance of treatment effect estimates via covariate adjustment, leading to trial derisking and cost savings. This potential illustrated through an example in Alzheimer’s disease where PS adjustment reduced treatment effect variance by 20%, yielding a power increase from 80% to 87%, followed by a discussion of opportunities and challenges for developing and implementing prognostic scores in the rare disease space.

Session 2: Gene and Cell Therapy

S2.1 Title: Modifier-directed therapeutics in Cystic Fibrosis

Speaker: Lisa Strug

Abstract: Cystic Fibrosis (CF) is a life-limiting recessive disease that affects multiple organs including the respiratory and digestive systems. It is caused by severe variants in the CF Transmembrane Conductance Regulator (CFTR). Disease-causing variants result in dysregulated epithelial ion transport. There are highly effective modulator treatments (HEMT) targeting CFTR variant protein folding and gating deficiencies (including the most common F508del variant) that are now approved for individuals with CF (pwCF). The greatest unmet medical need remains for pwCF with two null CFTR alleles, for whom there are no approved HEMT and they represent ~10% of the CF population globally. A mutation-agnostic therapeutic approach that enhances non-CFTR-mediated anion secretion across multiple CF-affected organs could benefit all pwCF.

Individuals with the same CF-causing CFTR genotype have variable disease severity and co-morbidities, presumed to be due, in part, to modifier genes. Modifier genes refer to polymorphisms that do not cause a genetic disease but influence the disease expression and severity and we have been spear-heading a paradigm of modifier-directed therapeutics as an alternative paradigm. Modifier genes can be identified through human population studies and we have carried out several genome-wide association studies (GWAS) for CF disease severity across the affected organs and in response to HEMT. But GWAS identifies loci rather than the responsible gene(s), relevant tissue(s) of origin or mechanism of action for a studied trait. To understand the mechanism by which a locus contributes to disease requires functional investigation, and this needs to be hypothesis-driven, in a relevant cellular model of the contributing gene. To bridge this gap and guide the design of functional studies we developed data integration methods for multi-omic technologies. We applied these to identify Solute Carrier 26 family member A9 (SLC26A9) as a common contributing modifier gene that interacts with CFTR and can enhance its function. I will describe these methods and the follow-up studies by us and the international community to assess the human genetic support for the target that motivated the identification of SLC26A9-agonists through a high throughput screen. These SLC26A9 modulators are now being optimized through a design-make-test-analyze cycle approach, integrating synthetic and computational chemistry, towards refined tools for clinical application and for improved understanding of the SLC26A9 role in CF pathogenesis.

S2.2 Title: Statistics in CDx Development for Gene Therapies

Speaker: Shuguang Huang

Abstract: Recombinant adeno-associated virus (AAV) vectors are the leading delivery vehicle used for in vivo gene therapies. There is a high prevalence of antibodies against the AAV capsid due to prior exposure to wild-type AAVs. Neutralizing AAV Abs could significantly reduce the effectiveness of the treatment by neutralizing the vector before it reaches the target cells. Recent FDA guidance strongly recommends considering development of a CDx when pre-existing antibodies to AAV are to be used as an exclusion criterion. Relying anti-drug antibody (ADA) assays, the CDx for AAV gene therapy products represents a unique category in the CDx development landscape. It involves some special challenges in analytical validation, ADA cutpoint determination, and IDE submission. This presentation will discuss how statisticians can contribute to the CDx development for gene therapy products.

S2.3 Title: A single arm confirmatory trial design for gene therapies

Speaker: Yang Song

Abstract: In the development of cell and gene therapies, a small, uncontrolled, and single arm study has often been employed for registration purpose. Reasons include a highly targeted magnitude of treatment effect, often with cure intent, rarity and severity of the disease associated with unmet medical needs, and the practical challenge to randomize and blind. With the need to conduct frequent interim analyses, formal statistical inference becomes a challenge. In this talk, we will discuss a single arm, open label, confirmatory study design with a small sample size, a large targeted effect size, and multiple interim analyses that is amenable to development of a cell or gene therapy. Formal statistical inference is possible with exact tests and a prescriptive procedure to perform interim analyses.

S2.4 Title: Model Informed Drug Development for Advanced Therapeutics

Speaker: Avery McIntosh

Abstract: Model Informed Drug Development (MIDD) is the practice of developing quantitative models from preclinical and clinical data to inform decision making throughout the drug development process. It is a general framework being implemented across pharmaceutical sponsors that has been a joint effort from industry and academia with support from regulators. MIDD methods can improve clinical trial efficiency, increase the probability of trial and regulatory success, and optimize dosing. MIDD for advanced therapeutics such as cell and gene therapies offers an opportunity where a comprehensive approach for integrating relevant data into quantitative models could greatly aid in the evaluation of safety and efficacy of these drugs. Recent papers in this area highlight the needs and approaches of MIDD in this space. There is a shared outlook from sponsors and regulators that MIDD approaches can provide insight and evidence for product safety and efficacy, and this talk will focus on the workflow and opportunities for MIDD practice in the evolving space of cell and gene therapies and other novel modalities.

S2.5 Discussant

Discussant: Daniel Li

Session 3: AI and Machine Learnings

S3.1 Title: Deep Learning Foundation Models for Medical Image Analysis

Speaker: Jake Gagnon

Abstract: In this talk, we will provide an overview of the current state of deep learning models for medical image analysis focusing on foundation models. These models are trained on a huge amount of unlabeled data allowing the model to extract general visual features. We will begin by describing the core components of foundation models such as choosing a model architecture, performing self-supervised learning, and applying adaptation approaches which allows the model to adapt to downstream imaging tasks. Furthermore, we will discuss some current benefits and challenges when using foundation models for medical image analysis. We will then conclude by illustrating a recent application of foundation models to pathology specifically in the area of rare cancer detection.

S3.2 Title: Artificial Intelligence in Rare Disease Research: Accelerating Relief for the Underserved

Speaker: Tim Smith

Abstract: Rare diseases affect over 300 million people worldwide, yet 95% lack approved treatments due to small patient populations, limited research funding, and prolonged diagnostic journeys. This presentation looks at how artificial intelligence and machine learning stand to transform rare disease research across three critical domains: revolutionizing drug discovery, accelerating accurate diagnosis, and optimizing treatment strategies. Using several examples, we examine how AI-powered diagnostic tools are reducing the average 5–7-year diagnostic odyssey. Additionally, AI/ML continues to improve treatment protocols and, we will look at how computational approaches will speed the identification of novel therapeutic targets and repurposing of existing drugs. In addition to current successes, the presentation will highlight implementation challenges including data scarcity and regulatory hurdles faced in the way of the transformative potential of AI to change the lives of underserved patient communities.

S3.3 Title: AI Co-scientist in Protein-disease Reasoning

Speaker: Bingxin Zhao

Abstract: The rapid expansion of large-scale proteomic resources has enabled the discovery of thousands of protein-phenotype associations. Yet, the scientific bottleneck lies in translating these vast summary-level statistics into structured biological insight and actionable knowledge. This talk introduces an AI co-scientist framework designed to scale the human reasoning process of protein-disease interpretation. Drawing inspiration from the way human researchers integrate statistical findings with prior scientific knowledge, our agentic system performs a multi-stage workflow: planning report sections, querying curated knowledge bases, reasoning over pathway enrichment and literature evidence, and generating text-based summaries. The system combines domain-specific tools with large language models in a united paradigm to automate knowledge synthesis at scale. We demonstrate applications across various diseases, highlighting how AI co-scientists can accelerate biomedical discovery, enhance reproducibility, and support drug development through more systematic and scalable scientific writing.

S3.4 Title: Navigating the Scientific Landscape: A Multi-Agent Approach to Research Synthesis

Speaker: Christian Merrill

Abstract: To address the challenge of information overload in scientific research, we present how one can use AI agents to automate literature curation and augment the discovery process. The system's core is a dual-mode AI agent architecture. For deep exploration, a single agent performs sequential, iterative analysis. For comprehensive synthesis, a multi-agent framework deploys specialized agents such as for analysis, verification, and discovering alternative perspectives to deconstruct research questions and work in parallel. This architecture supports both local and cloud-based Large Language Models, ensuring flexibility and robust performance. The agent framework is complemented by analytical tools designed for strategic insight. Researchers can uncover latent bibliographic and semantic connections through interactive network visualizations, and identify emerging research topics using an automated trend analysis dashboard with temporal clustering and forecasting. By integrating a sophisticated multi-agent system with these powerful tools, this work demonstrates a shift from passive information retrieval to an active, AI-augmented discovery process, enabling researchers to efficiently synthesize knowledge and strategically navigate the evolving scientific landscape.

Session 4: Real-world-evidence and HTA

S4.1 Title: Population-adjusted indirect comparisons in rare disease: methods, challenges and considerations

Speaker: Yingyi Liu

Abstract: Population-adjusted indirect comparisons (PAICs) have emerged as essential techniques for assessing the comparative effectiveness and safety of interventions, especially in cases of rare diseases where direct head-to-head trials are either unavailable or impractical. The most widely used PAIC methods include matching-adjusted indirect comparison (MAIC) and simulated treatment comparison (STC), both of which have gained traction in health technology assessment (HTA) submissions. However, the application of PAIC in rare disease contexts presents unique challenges, including reliance on unanchored comparisons from single-arm trials, small sample sizes, and heterogeneity in patient characteristics. Unanchored indirect treatment comparisons face significant hurdles due to the lack of a shared comparator arm and the strict assumptions necessary for valid comparisons. The limited patient populations exacerbate typical small-sample issues, such as poor overlap between trial populations, difficulties in fitting robust regression models for STC, and reduced capability to account for all prognostic factors and effect modifiers. This talk will highlight these challenges and explore practical considerations for conducting PAICs. Furthermore, it will discuss innovative methods like two-stage MAIC, weight truncation, and STC with G-Computation to enhance the robustness of PAIC findings.

S4.2 Title: Integrating natural history insights into clinical development planning for gene therapy in a rare neurodegenerative disorder

Speaker: Alex Sverdlov

Abstract: The robust integration of natural history (NH) data is essential for designing efficient clinical trials for one-time gene therapies in rare neurodegenerative disorders. We present a pharmacometrics-informed clinical scenario evaluation framework (CSE-PMx) that formally incorporates quantitative NH models directly into the prospective trial planning process. This presentation showcases an application of the framework to optimize a hypothetical gene therapy trial in Autosomal-Recessive Spastic Ataxia of Charlevoix-Saguenay (ARSACS). Through in-silico evaluation, we demonstrate how insights from the ARSACS disease progression model can be used to compare competing design and analysis strategies, ultimately identifying a nonlinear mixed-effects model (NLMEM) as a more powerful approach. This methodology provides a quantitative roadmap for translating NH insights into more efficient clinical development programs, helping to transform and accelerate the path forward for rare disease therapies.

S4.3 Title: Beyond MMRM: The crucial role of targeted machine learning in RCTs with small sample size and loss to follow-up

Speaker: Susan Gruber

Abstract: Little knowledge of natural history of disease, heterogeneity in disease presentation and response to treatment, difficulty in enrolling and retaining patients in randomized controlled trials pose challenges to evaluating treatment effects in patients with a rare disease. Targeted learning provides a roadmap for addressing these challenges, and sophisticated methodologies for estimating causal effects of treatment even when there is informative loss to follow-up. Simulation studies based on actual patient data will compare performance of the MMRM method often used with longitudinal data and targeted machine learning using targeted maximum likelihood estimation and super learning (TMLE+SL).

S4.4 Title: Propensity score-based unequal matching for rare disease clinical trials with external controls

Speaker: Yusuke Yamaguchi

Abstract: Randomized controlled trials (RCTs) are considered the gold standard to evaluate the efficacy and safety of a new therapy for regulatory approval, though in rare diseases there are special situations where maintaining a concurrent control arm is unethical, impractical, or infeasible and can lead to increased patient burden and threaten the completion of the RCTs. Externally controlled designs are gaining a greater attention for their potential to overcome these challenges by supplementing or replacing the concurrent control arm in the RCTs with external data. A major challenge in the externally controlled designs is its inability to eliminate systematic imbalances in measured confounding factors between the current study patients and the external controls, which may cause a serious bias in treatment effect estimation. In these contexts, propensity score matching is known to help reduce systematic imbalances in measured confounders; however, it causes a challenge of unequal matching where fewer external controls are to be matched to more current study patients, leading to the case with a matching ratio of less than 1. This talk presents several approaches for the unequal matching, including an assignment algorithm, a genetic algorithm, and methods with searching all possible combinations. For example, the assignment algorithm is a method finding an optimal set of external controls while minimizing the overall absolute difference of the propensity score. Through a simulation study, we compare the methods in terms of their performance in the propensity score balancing and the treatment effect estimation (e.g., bias, type I error rate, and power).