Repurposing RNA Sequencing for Discovery of RNA Modifications in Clinical Cohorts. (Sci Adv, Aug 2021)

Kar-Tong Tan 1 2 3Ling-Wen Ding 4Chan-Shuo Wu 1Daniel G Tenen 5 6Henry Yang 5 7

Affiliations

1Cancer Science Institute of Singapore, National University of Singapore, Singapore, Singapore.
2Biological and Biomedical Sciences Program, Division of Medical Sciences, Harvard Medical School, Boston, MA, USA.
3Department of Computer Science, School of Computing, National University of Singapore, Singapore, Singapore.
4Department of Pathology, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore.
5Cancer Science Institute of Singapore, National University of Singapore, Singapore, Singapore. daniel.tenen@nus.edu.sg csiyangh@nus.edu.sg.
6Harvard Stem Cell Institute, Harvard Medical School, Boston, MA, USA.
7Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore.

Abstract

The study of RNA modifications in large clinical cohorts can reveal relationships between the epitranscriptome and human diseases, although this is especially challenging. We developed ModTect (https://github.com/ktan8/ModTect), a statistical framework to identify RNA modifications de novo by standard RNA-sequencing with deletion and mis-incorporation signals. We show that ModTect can identify both known (N 1-methyladenosine) and previously unknown types of mRNA modifications (N 2,N 2-dimethylguanosine) at nucleotide-resolution. Applying ModTect to 11,371 patient samples and 934 cell lines across 33 cancer types, we show that the epitranscriptome was dysregulated in patients across multiple cancer types and was additionally associated with cancer progression and survival outcomes. Some types of RNA modification were also more disrupted than others in patients with cancer. Moreover, RNA modifications contribute to multiple types of RNA-DNA sequence differences, which unexpectedly escape detection by Sanger sequencing. ModTect can thus be used to discover associations between RNA modifications and clinical outcomes in patient cohorts.