2026 Sequencing Strategies for Population and Cancer Epidemiology Studies (SeqSPACE) : Submission #6
Submission information
Submission Number: 6
Submission ID: 182394
Submission UUID: d89dabcb-d904-45e9-ab2b-5f0be5accf7e
Submission URI: /egrp/seqspaceabstracts
Submission View: /node/2144/webform/submissions/182394?token=vovR4hdGjAP2CsAmdFBA_NL-aL17yaUAwJckJNryBc4
Submission Update: /egrp/seqspaceabstracts?token=vovR4hdGjAP2CsAmdFBA_NL-aL17yaUAwJckJNryBc4
Created: Wed, 06/03/2026 - 13:04
Completed: Wed, 06/03/2026 - 13:04
Changed: Wed, 06/03/2026 - 13:04
Remote IP address: 10.208.28.77
Submitted by: Anonymous
Language: English
Is draft: No
Webform: seqspace (Abstracts)
| First Name | Taylor |
|---|---|
| Middle Initial | |
| Last Name | Head |
| Degree(s) | Ph.D., MSPH |
| Position/Title/Career Status | Postdoctoral Fellow |
| Organization | University of Texas MD Anderson Cancer Center |
| sthead@mdanderson.org | |
| Abstract Title | Leveraging long-read RNA-seq for isoform-level regulatory discovery and TWAS fine-mapping in breast cancer |
| Abstract Summary | Most expression quantitative trait locus (eQTL) and transcriptome-wide association study (TWAS) analyses rely on large, tissue-agnostic transcript annotations. These overlook tissue-specific isoform usage and can obscure the true regulatory mechanisms underlying disease-associated loci. Through long-read (LR) RNA-sequencing, we can directly observe full-length transcripts and define tissue-relevant isoforms. However, direct integration of de novo long-read annotations into genetic studies remains challenging due to limited sample sizes with incomplete transcript capture. To address this, we first evaluated how transcript annotation influences regulatory discovery in breast cancer. We quantified gene- and isoform-level expression in breast tumor (TCGA), healthy breast tissue, and cultured fibroblasts using standard GENCODE annotations, tissue-specific LR-derived annotations, and combined annotations. Across tissues, most eGenes were concordant between annotations, but approximately one-third of lead cis-eQTLs for shared eGenes differed. Isoform-level regulatory discovery was substantially more annotation-dependent: in healthy breast tissue, 46% of eIsoforms identified using LR-informed annotations were unique despite 93.7% being present in GENCODE. Although combined annotations expanded the transcript catalog by only 0.6–7.6%, 69% of significant isoform-trait associations were specific to a single annotation. These analyses uncovered candidate regulatory isoforms at established breast cancer risk loci that were missed using conventional transcriptome annotations. Motivated by this work, we next developed a Bayesian isoform fine-mapping framework using isoform-specific priors derived from LR evidence while accounting for structural similarity among transcripts. The model prioritizes tissue-relevant isoforms and reduces spurious prioritization of structurally redundant transcripts. In simulations, the approach improves power while maintaining appropriate type I error relative to existing TWAS fine-mapping methods. Together, these results demonstrate how LR data can be incorporated into regulatory mapping and TWAS fine-mapping to improve prioritization of candidate causal isoforms underlying disease-associated loci. |