Combining multi-population datasets for joint genome-wide association and meta-analyses: The case of bovine milk fat composition traits

Research output: Contribution to journal/Conference contribution in journal/Contribution to newspaperJournal articleResearchpeer-review

DOI

  • G Gebreyesus
  • A J Buitenhuis
  • N A Poulsen
  • M H P W Visker, Wageningen University and Research
  • ,
  • Q Zhang, College of Animal Science and Technology, China Agricultural University
  • ,
  • H J F van Valenberg, Wageningen University and Research
  • ,
  • D Sun, College of Animal Science and Technology, China Agricultural University
  • ,
  • H Bovenhuis, Wageningen University and Research

In genome-wide association studies (GWAS), sample size is the most important factor affecting statistical power that is under control of the investigator, posing a major challenge in understanding the genetics underlying difficult-to-measure traits. Combining data sets available from different populations for joint or meta-analysis is a promising alternative to increasing sample sizes available for GWAS. Simulation studies indicate statistical advantages from combining raw data or GWAS summaries in enhancing quantitative trait loci (QTL) detection power. However, the complexity of genetics underlying most quantitative traits, which itself is not fully understood, is difficult to fully capture in simulated data sets. In this study, population-specific and combined-population GWAS as well as a meta-analysis of the population-specific GWAS summaries were carried out with the objective of assessing the advantages and challenges of different data-combining strategies in enhancing detection power of GWAS using milk fatty acid (FA) traits as examples. Gas chromatography (GC) quantified milk FA samples and high-density (HD) genotypes were available from 1,566 Dutch, 614 Danish, and 700 Chinese Holstein Friesian cows. Using the joint GWAS, 28 additional genomic regions were detected, with significant associations to at least 1 FA, compared with the population-specific analyses. Some of these additional regions were also detected using the implemented meta-analysis. Furthermore, using the frequently reported variants of the diacylglycerol acyltransferase 1 (DGAT1) and stearoyl-CoA desaturase (SCD1) genes, we show that significant associations were established with more FA traits in the joint GWAS than the remaining scenarios. However, there were few regions detected in the population-specific analyses that were not detected using the joint GWAS or the meta-analyses. Our results show that combining multi-population data set can be a powerful tool to enhance detection power in GWAS for seldom-recorded traits. Detection of a higher number of regions using the meta-analysis, compared with any of the population-specific analyses also emphasizes the utility of these methods in the absence of raw multi-population data sets to undertake joint GWAS.

Original languageEnglish
JournalJournal of Dairy Science
Volume102
Issue12
Pages (from-to)11124-11141
ISSN0022-0302
DOIs
Publication statusPublished - Dec 2019

    Research areas

  • mega-analysis, meta-analysis, multi-population GWAS

See relations at Aarhus University Citationformats

ID: 167412193