TY - JOUR
T1 - Non-coding cancer driver candidates identified with a sample- and position-specific model of the somatic mutation rate
AU - Rasmussen, Malene Juul
AU - Bertl, Johanna
AU - Guo, Qianyun
AU - Nielsen, Morten Muhlig
AU - Świtnicki, Michał
AU - Hornshøj, Henrik
AU - Madsen, Tobias
AU - Hobolth, Asger
AU - Pedersen, Jakob Skou
PY - 2017/3/31
Y1 - 2017/3/31
N2 - Non-coding mutations may drive cancer development. Statistical detection of non- coding driver regions is challenged by a varying mutation rate and uncertainty of functional impact. Here, we develop a statistically founded non-coding driver-detection method, ncdDetect, which includes sample-specific mutational signatures, long-range mutation rate variation, and position-specific impact measures. Using ncdDetect, we screened non-coding regulatory regions of protein-coding genes across a pan-cancer set of whole-genomes (n = 505), which top-ranked known drivers and identified new candidates. For individual candidates, presence of non-coding mutations associates with altered expression or decreased patient survival across an independent pan-cancer sample set (n = 5454). This includes an antigen-presenting gene (CD1A), where 5’UTR mutations correlate significantly with decreased survival in melanoma. Additionally, mutations in a base-excision-repair gene (SMUG1) correlate with a C-to-T mutational-signature. Overall, we find that a rich model of mutational heterogeneity facilitates non-coding driver identification and integrative analysis points to candidates of potential clinical relevance.
AB - Non-coding mutations may drive cancer development. Statistical detection of non- coding driver regions is challenged by a varying mutation rate and uncertainty of functional impact. Here, we develop a statistically founded non-coding driver-detection method, ncdDetect, which includes sample-specific mutational signatures, long-range mutation rate variation, and position-specific impact measures. Using ncdDetect, we screened non-coding regulatory regions of protein-coding genes across a pan-cancer set of whole-genomes (n = 505), which top-ranked known drivers and identified new candidates. For individual candidates, presence of non-coding mutations associates with altered expression or decreased patient survival across an independent pan-cancer sample set (n = 5454). This includes an antigen-presenting gene (CD1A), where 5’UTR mutations correlate significantly with decreased survival in melanoma. Additionally, mutations in a base-excision-repair gene (SMUG1) correlate with a C-to-T mutational-signature. Overall, we find that a rich model of mutational heterogeneity facilitates non-coding driver identification and integrative analysis points to candidates of potential clinical relevance.
KW - Journal Article
UR - http://www.scopus.com/inward/record.url?scp=85019709734&partnerID=8YFLogxK
U2 - 10.7554/eLife.21778
DO - 10.7554/eLife.21778
M3 - Journal article
C2 - 28362259
SN - 2050-084X
VL - 6
JO - eLife
JF - eLife
M1 - e21778
ER -