Unsupervised detection of fragment length signatures of circulating tumor DNA using non-negative matrix factorization

Gabriel Renaud, Maibritt Nørgaard, Johan Lindberg, Henrik Grönberg, Bram De Laere, Jørgen Bjerggaard Jensen, Michael Borre, Claus Lindbjerg Andersen, Karina Dalsgaard Sørensen, Lasse Maretty, Søren Besenbacher

Research output: Contribution to journal/Conference contribution in journal/Contribution to newspaperJournal articleResearchpeer-review

Abstract

Sequencing of cell-free DNA (cfDNA) is currently being used to detect cancer by searching both for mutational and non-mutational alterations. Recent work has shown that the length distribution of cfDNA fragments from a cancer patient can inform tumor load and type. Here, we propose non-negative matrix factorization (NMF) of fragment length distributions as a novel and completely unsupervised method for studying fragment length patterns in cfDNA. Using shallow whole-genome sequencing (sWGS) of cfDNA from a cohort of patients with metastatic castration-resistant prostate cancer (mCRPC), we demonstrate how NMF accurately infers the true tumor fragment length distribution as an NMF component - and that the sample weights of this component correlate with ctDNA levels (r=0.75). We further demonstrate how using several NMF components enables accurate cancer detection on data from various early stage cancers (AUC = 0.96). Finally, we show that NMF, when applied across genomic regions, can be used to discover fragment length signatures associated with open chromatin.

Original languageEnglish
Article numbere71569
JournaleLife
Volume11
Number of pages15
ISSN2050-084X
DOIs
Publication statusPublished - Jul 2022

Keywords

  • Biomarkers, Tumor/genetics
  • Cell-Free Nucleic Acids
  • Circulating Tumor DNA/genetics
  • Genomics/methods
  • Humans
  • Male
  • Mutation

Fingerprint

Dive into the research topics of 'Unsupervised detection of fragment length signatures of circulating tumor DNA using non-negative matrix factorization'. Together they form a unique fingerprint.

Cite this