Institut for Biomedicin

Lars Bolund

SMAP: a streamlined methylation analysis pipeline for bisulfite sequencing

Publikation: Bidrag til tidsskrift/Konferencebidrag i tidsskrift /Bidrag til avisTidsskriftartikelForskningpeer review

DOI

  • Shengjie Gao
  • ,
  • Dan Zou, Danmark
  • Likai Mao
  • ,
  • Quan Zhou, Danmark
  • Wenlong Jia, Danmark
  • Yi Huang, Danmark
  • Shancen Zhao
  • ,
  • Gang Chen
  • ,
  • Song Wu, Danmark
  • Dongdong Li, Danmark
  • Fei Xia, Danmark
  • Huafeng Chen, Danmark
  • Maoshan Chen, Danmark
  • Torben F Ørntoft
  • Lars Bolund
  • Karina D Sørensen

BACKGROUND: DNA methylation has important roles in the regulation of gene expression and cellular specification. Reduced representation bisulfite sequencing (RRBS) has prevailed in methylation studies due to its cost-effectiveness and single-base resolution. The rapid accumulation of RRBS data demands well designed analytical tools.

FINDINGS: To streamline the data processing of DNA methylation from multiple RRBS samples, we present a flexible pipeline named SMAP, whose features include: (i) handling of single-and/or paired-end diverse bisulfite sequencing data with reduced false-positive rates in differentially methylated regions; (ii) detection of allele-specific methylation events with improved algorithms; (iii) a built-in pipeline for detection of novel single nucleotide polymorphisms (SNPs); (iv) support of multiple user-defined restriction enzymes; (v) conduction of all methylation analyses in a single-step operation when well configured.

CONCLUSIONS: Simulation and experimental data validated the high accuracy of SMAP for SNP detection and methylation identification. Most analyses required in methylation studies (such as estimation of methylation levels, differentially methylated cytosine groups, and allele-specific methylation regions) can be executed readily with SMAP. All raw data from diverse samples could be processed in parallel and 'packetized' streams. A simple user guide to the methylation applications is also provided.

OriginalsprogEngelsk
TidsskriftGigaScience
Vol/bind4
Sider (fra-til)29
ISSN2047-217X
DOI
StatusUdgivet - 2015

Se relationer på Aarhus Universitet Citationsformater

ID: 90139258