Evaluating Bioinformatics Processing of Somatic Variant Detection in cfDNA Using Targeted Sequencing with UMIs

Yixin Lin, Mads Heilskov Rasmussen, Mikkel Hovden Christensen, Amanda Frydendahl, Lasse Maretty, Claus Lindbjerg Andersen, Søren Besenbacher*

*Corresponding author for this work

Research output: Contribution to journal/Conference contribution in journal/Contribution to newspaperJournal articleResearchpeer-review

Abstract

Circulating tumor DNA (ctDNA) is a promising cancer biomarker, but accurately detecting tumor mutations in cell-free DNA (cfDNA) is challenging due to their low frequency and sequencing errors. Our study benchmarked Mutect2, VarScan2, shearwater, and DREAMS-vc using deep targeted sequencing of cfDNA with Unique Molecular Identifiers (UMIs) from 111 colorectal cancer patients. Performance was assessed at both the mutation level (distinguish tumor variants from errors) and the sample level (detect if an individual has cancer). Additionally, we investigated the effects of various UMI grouping and consensus strategies. The shearwater-AND variant calling method demonstrated the highest precision in detecting tumor-derived mutations from plasma, and reached the highest ROC-AUC of 0.984 for sample classification in tumor-informed cfDNA analyses. DREAMS-vc exhibited the highest ROC-AUC of 0.808 for sample classification in tumor-agnostic studies. We also found that sequencing depth differences in PBMCs could lead to false positives, particularly with VarScan2 and Mutect2, which was addressed by downsampling to equivalent mean depths. Additionally, network-based UMI grouping methods outperformed those using identical UMIs when all reads were retained. Our findings emphasize that the optimal variant caller depends on the study context—whether focused on mutation or sample classification, and whether conducted under tumor-informed or tumor-agnostic conditions.

Original languageEnglish
Article number11439
JournalInternational Journal of Molecular Sciences
Volume25
Issue21
ISSN1661-6596
DOIs
Publication statusPublished - Nov 2024

Keywords

  • benchmarking
  • cancer sample classification
  • cell-free DNA
  • low-frequency variant calling
  • UMI sequencing

Fingerprint

Dive into the research topics of 'Evaluating Bioinformatics Processing of Somatic Variant Detection in cfDNA Using Targeted Sequencing with UMIs'. Together they form a unique fingerprint.

Cite this