TY - JOUR
T1 - Imputation of ancient human genomes
AU - Sousa da Mota, Bárbara
AU - Rubinacci, Simone
AU - Cruz Dávalos, Diana Ivette
AU - G. Amorim, Carlos Eduardo
AU - Sikora, Martin
AU - Johannsen, Niels Nørkjær
AU - Szmyt, Marzena
AU - Włodarczak, Piotr
AU - Szczepanek, Anita
AU - Przybyła, Marcin M.
AU - Schroeder, Hannes
AU - Allentoft, Morten E.
AU - Willerslev, Eske
AU - Malaspinas, Anna Sapfo
AU - Delaneau, Olivier
PY - 2023/12
Y1 - 2023/12
N2 - Due to postmortem DNA degradation and microbial colonization, most ancient genomes have low depth of coverage, hindering genotype calling. Genotype imputation can improve genotyping accuracy for low-coverage genomes. However, it is unknown how accurate ancient DNA imputation is and whether imputation introduces bias to downstream analyses. Here we re-sequence an ancient trio (mother, father, son) and downsample and impute a total of 43 ancient genomes, including 42 high-coverage (above 10x) genomes. We assess imputation accuracy across ancestries, time, depth of coverage, and sequencing technology. We find that ancient and modern DNA imputation accuracies are comparable. When downsampled at 1x, 36 of the 42 genomes are imputed with low error rates (below 5%) while African genomes have higher error rates. We validate imputation and phasing results using the ancient trio data and an orthogonal approach based on Mendel’s rules of inheritance. We further compare the downstream analysis results between imputed and high-coverage genomes, notably principal component analysis, genetic clustering, and runs of homozygosity, observing similar results starting from 0.5x coverage, except for the African genomes. These results suggest that, for most populations and depths of coverage as low as 0.5x, imputation is a reliable method that can improve ancient DNA studies.
AB - Due to postmortem DNA degradation and microbial colonization, most ancient genomes have low depth of coverage, hindering genotype calling. Genotype imputation can improve genotyping accuracy for low-coverage genomes. However, it is unknown how accurate ancient DNA imputation is and whether imputation introduces bias to downstream analyses. Here we re-sequence an ancient trio (mother, father, son) and downsample and impute a total of 43 ancient genomes, including 42 high-coverage (above 10x) genomes. We assess imputation accuracy across ancestries, time, depth of coverage, and sequencing technology. We find that ancient and modern DNA imputation accuracies are comparable. When downsampled at 1x, 36 of the 42 genomes are imputed with low error rates (below 5%) while African genomes have higher error rates. We validate imputation and phasing results using the ancient trio data and an orthogonal approach based on Mendel’s rules of inheritance. We further compare the downstream analysis results between imputed and high-coverage genomes, notably principal component analysis, genetic clustering, and runs of homozygosity, observing similar results starting from 0.5x coverage, except for the African genomes. These results suggest that, for most populations and depths of coverage as low as 0.5x, imputation is a reliable method that can improve ancient DNA studies.
KW - DNA, Ancient
KW - Genome, Human/genetics
KW - Genome-Wide Association Study/methods
KW - Genotype
KW - Genotyping Techniques/methods
KW - Humans
KW - Polymorphism, Single Nucleotide
UR - http://www.scopus.com/inward/record.url?scp=85162255706&partnerID=8YFLogxK
U2 - 10.1038/s41467-023-39202-0
DO - 10.1038/s41467-023-39202-0
M3 - Journal article
C2 - 37339987
SN - 2041-1723
VL - 14
SP - 1
EP - 17
JO - Nature Communications
JF - Nature Communications
IS - 1
M1 - 3660
ER -