Aarhus University Seal

Zexi Cai

Weighting sequence variants based on their annotation increases the power of genome-wide association studies in dairy cattle

Research output: Contribution to journal/Conference contribution in journal/Contribution to newspaperJournal articleResearchpeer-review

Standard

Weighting sequence variants based on their annotation increases the power of genome-wide association studies in dairy cattle. / Cai, Zexi; Guldbrandtsen, Bernt; Lund, Mogens Sandø et al.
In: Genetics Selection Evolution, Vol. 51, No. 1, 20, 10.05.2019, p. 20.

Research output: Contribution to journal/Conference contribution in journal/Contribution to newspaperJournal articleResearchpeer-review

Harvard

APA

CBE

MLA

Vancouver

Cai Z, Guldbrandtsen B, Lund MS, Sahana G. Weighting sequence variants based on their annotation increases the power of genome-wide association studies in dairy cattle. Genetics Selection Evolution. 2019 May 10;51(1):20. 20. doi: 10.1186/s12711-019-0463-9

Author

Bibtex

@article{068872503e7d4a30a25ab07794a640f7,
title = "Weighting sequence variants based on their annotation increases the power of genome-wide association studies in dairy cattle",
abstract = "BACKGROUND: Genome-wide association studies (GWAS) are widely used to identify regions of the genome that harbor genetic determinants of quantitative traits. However, the multiple-testing burden from scanning tens of millions of whole-genome sequence variants reduces the power to identify associated variants, especially if sample size is limited. In addition, factors such as inaccuracy of imputation, complex linkage disequilibrium structures, and multiple closely-located causal variants may result in an identified causative mutation not being the most significant single nucleotide polymorphism in a particular genomic region. Therefore, the use of information from different sources, particularly variant annotations, was proposed to enhance the fine-mapping of causal variants. Here, we tested whether applying significance thresholds based on variant annotation categories increases the power of GWAS compared with a flat Bonferroni multiple-testing correction.RESULTS: Whole-genome sequence variants in dairy cattle were categorized according to type and predicted impact. Then, GWAS between markers and 17 quantitative traits were analyzed for enrichment for association of each annotation category. By using annotation categories that were determined with the variants effect predictor software and datasets indicating regions of open chromatin, {"}low impact{"} variants were found to be highly enriched. Moreover, when the variants annotated as {"}modifier{"} and not located at open chromatin regions were further classified into different types of potential regulatory elements, the high impact variants, moderate impact variants, variants located in the 3' and 5' untranslated regions, and variants located in potential non-coding RNA regions exhibited relatively more enrichment. In contrast, a similar study on human GWAS data reported that enrichment of association signals was highest with high impact variants. We observed an increase in power when these variant category-based significance thresholds were applied for GWAS results on stature in Nordic Holstein cattle, as more candidate genes from previous large GWAS meta-analysis for cattle stature were confirmed.CONCLUSIONS: Use of variant category-based genome-wide significance thresholds can marginally increase the power to detect the candidate genes in cattle. With the continued improvements in annotation of the bovine genome, we anticipate that the growing usefulness of variant category-based significance thresholds will be demonstrated.",
keywords = "CANDIDATE GENES, DATABASE, EXPRESSION, GWAS, JOINT ANALYSIS, MASTITIS RESISTANCE, POSITIONAL DISTRIBUTION, TRAITS",
author = "Zexi Cai and Bernt Guldbrandtsen and Lund, {Mogens Sand{\o}} and Goutam Sahana",
year = "2019",
month = may,
day = "10",
doi = "10.1186/s12711-019-0463-9",
language = "English",
volume = "51",
pages = "20",
journal = "Genetics Selection Evolution",
issn = "0999-193X",
publisher = "BioMed Central Ltd.",
number = "1",

}

RIS

TY - JOUR

T1 - Weighting sequence variants based on their annotation increases the power of genome-wide association studies in dairy cattle

AU - Cai, Zexi

AU - Guldbrandtsen, Bernt

AU - Lund, Mogens Sandø

AU - Sahana, Goutam

PY - 2019/5/10

Y1 - 2019/5/10

N2 - BACKGROUND: Genome-wide association studies (GWAS) are widely used to identify regions of the genome that harbor genetic determinants of quantitative traits. However, the multiple-testing burden from scanning tens of millions of whole-genome sequence variants reduces the power to identify associated variants, especially if sample size is limited. In addition, factors such as inaccuracy of imputation, complex linkage disequilibrium structures, and multiple closely-located causal variants may result in an identified causative mutation not being the most significant single nucleotide polymorphism in a particular genomic region. Therefore, the use of information from different sources, particularly variant annotations, was proposed to enhance the fine-mapping of causal variants. Here, we tested whether applying significance thresholds based on variant annotation categories increases the power of GWAS compared with a flat Bonferroni multiple-testing correction.RESULTS: Whole-genome sequence variants in dairy cattle were categorized according to type and predicted impact. Then, GWAS between markers and 17 quantitative traits were analyzed for enrichment for association of each annotation category. By using annotation categories that were determined with the variants effect predictor software and datasets indicating regions of open chromatin, "low impact" variants were found to be highly enriched. Moreover, when the variants annotated as "modifier" and not located at open chromatin regions were further classified into different types of potential regulatory elements, the high impact variants, moderate impact variants, variants located in the 3' and 5' untranslated regions, and variants located in potential non-coding RNA regions exhibited relatively more enrichment. In contrast, a similar study on human GWAS data reported that enrichment of association signals was highest with high impact variants. We observed an increase in power when these variant category-based significance thresholds were applied for GWAS results on stature in Nordic Holstein cattle, as more candidate genes from previous large GWAS meta-analysis for cattle stature were confirmed.CONCLUSIONS: Use of variant category-based genome-wide significance thresholds can marginally increase the power to detect the candidate genes in cattle. With the continued improvements in annotation of the bovine genome, we anticipate that the growing usefulness of variant category-based significance thresholds will be demonstrated.

AB - BACKGROUND: Genome-wide association studies (GWAS) are widely used to identify regions of the genome that harbor genetic determinants of quantitative traits. However, the multiple-testing burden from scanning tens of millions of whole-genome sequence variants reduces the power to identify associated variants, especially if sample size is limited. In addition, factors such as inaccuracy of imputation, complex linkage disequilibrium structures, and multiple closely-located causal variants may result in an identified causative mutation not being the most significant single nucleotide polymorphism in a particular genomic region. Therefore, the use of information from different sources, particularly variant annotations, was proposed to enhance the fine-mapping of causal variants. Here, we tested whether applying significance thresholds based on variant annotation categories increases the power of GWAS compared with a flat Bonferroni multiple-testing correction.RESULTS: Whole-genome sequence variants in dairy cattle were categorized according to type and predicted impact. Then, GWAS between markers and 17 quantitative traits were analyzed for enrichment for association of each annotation category. By using annotation categories that were determined with the variants effect predictor software and datasets indicating regions of open chromatin, "low impact" variants were found to be highly enriched. Moreover, when the variants annotated as "modifier" and not located at open chromatin regions were further classified into different types of potential regulatory elements, the high impact variants, moderate impact variants, variants located in the 3' and 5' untranslated regions, and variants located in potential non-coding RNA regions exhibited relatively more enrichment. In contrast, a similar study on human GWAS data reported that enrichment of association signals was highest with high impact variants. We observed an increase in power when these variant category-based significance thresholds were applied for GWAS results on stature in Nordic Holstein cattle, as more candidate genes from previous large GWAS meta-analysis for cattle stature were confirmed.CONCLUSIONS: Use of variant category-based genome-wide significance thresholds can marginally increase the power to detect the candidate genes in cattle. With the continued improvements in annotation of the bovine genome, we anticipate that the growing usefulness of variant category-based significance thresholds will be demonstrated.

KW - CANDIDATE GENES

KW - DATABASE

KW - EXPRESSION

KW - GWAS

KW - JOINT ANALYSIS

KW - MASTITIS RESISTANCE

KW - POSITIONAL DISTRIBUTION

KW - TRAITS

U2 - 10.1186/s12711-019-0463-9

DO - 10.1186/s12711-019-0463-9

M3 - Journal article

C2 - 31077144

VL - 51

SP - 20

JO - Genetics Selection Evolution

JF - Genetics Selection Evolution

SN - 0999-193X

IS - 1

M1 - 20

ER -