Frequently used bioinformatics tools overestimate the damaging effect of allelic variants

Research output: Contribution to journal/Conference contribution in journal/Contribution to newspaperJournal articleResearchpeer-review

We selected two sets of naturally occurring human missense allelic variants within innate immune genes. The first set represented eleven non-synonymous variants in six different genes involved in interferon (IFN) induction, present in a cohort of patients suffering from herpes simplex encephalitis (HSE) and the second set represented sixteen allelic variants of the IFNLR1 gene. We recreated the variants in vitro and tested their effect on protein function in a HEK293T cell based assay. We then used an array of 14 available bioinformatics tools to predict the effect of these variants upon protein function. To our surprise two of the most commonly used tools, CADD and SIFT, produced a high rate of false positives, whereas SNPs&GO exhibited the lowest rate of false positives in our test. As the problem in our test in general was false positive variants, inclusion of mutation significance cutoff (MSC) did not improve accuracy.

Original languageEnglish
JournalGenes and Immunity
Volume20
Issue1
Pages (from-to)10-22
Number of pages13
ISSN1466-4879
DOIs
Publication statusPublished - Jan 2019

    Research areas

  • Child, Computational Biology/standards, Encephalitis, Herpes Simplex/genetics, False Positive Reactions, Female, Genetic Testing/standards, Genome-Wide Association Study/standards, HEK293 Cells, Humans, Male, Mutation, Missense, Polymorphism, Single Nucleotide, Receptors, Cytokine/genetics, Software/standards

See relations at Aarhus University Citationformats

ID: 120063416