TY - JOUR
T1 - Cancer LncRNA Census reveals evidence for deep functional conservation of long noncoding RNAs in tumorigenesis
AU - Carlevaro-Fita, Joana
AU - Lanzós, Andrés
AU - Feuerbach, Lars
AU - Hong, Chen
AU - Mas-Ponte, David
AU - Pedersen, Jakob Skou
AU - PCAWG Drivers and Functional Interpretation Group
AU - Bertl, Johanna
AU - Guo, Qianyun
AU - Hobolth, Asger
AU - Hornshøj, Henrik
AU - Juul, Randi Istrup
AU - Madsen, Tobias
AU - Nielsen, Morten Muhlig
AU - Pedersen, Jakob Skou
AU - Zhang, Yan
AU - Johnson, Rory
AU - PCAWG Consortium
PY - 2020/12/1
Y1 - 2020/12/1
N2 - Long non-coding RNAs (lncRNAs) are a growing focus of cancer genomics studies, creating the need for a resource of lncRNAs with validated cancer roles. Furthermore, it remains debated whether mutated lncRNAs can drive tumorigenesis, and whether such functions could be conserved during evolution. Here, as part of the ICGC/TCGA Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium, we introduce the Cancer LncRNA Census (CLC), a compilation of 122 GENCODE lncRNAs with causal roles in cancer phenotypes. In contrast to existing databases, CLC requires strong functional or genetic evidence. CLC genes are enriched amongst driver genes predicted from somatic mutations, and display characteristic genomic features. Strikingly, CLC genes are enriched for driver mutations from unbiased, genome-wide transposon-mutagenesis screens in mice. We identified 10 tumour-causing mutations in orthologues of 8 lncRNAs, including LINC-PINT and NEAT1, but not MALAT1. Thus CLC represents a dataset of high-confidence cancer lncRNAs. Mutagenesis maps are a novel means for identifying deeply-conserved roles of lncRNAs in tumorigenesis.
AB - Long non-coding RNAs (lncRNAs) are a growing focus of cancer genomics studies, creating the need for a resource of lncRNAs with validated cancer roles. Furthermore, it remains debated whether mutated lncRNAs can drive tumorigenesis, and whether such functions could be conserved during evolution. Here, as part of the ICGC/TCGA Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium, we introduce the Cancer LncRNA Census (CLC), a compilation of 122 GENCODE lncRNAs with causal roles in cancer phenotypes. In contrast to existing databases, CLC requires strong functional or genetic evidence. CLC genes are enriched amongst driver genes predicted from somatic mutations, and display characteristic genomic features. Strikingly, CLC genes are enriched for driver mutations from unbiased, genome-wide transposon-mutagenesis screens in mice. We identified 10 tumour-causing mutations in orthologues of 8 lncRNAs, including LINC-PINT and NEAT1, but not MALAT1. Thus CLC represents a dataset of high-confidence cancer lncRNAs. Mutagenesis maps are a novel means for identifying deeply-conserved roles of lncRNAs in tumorigenesis.
KW - ANNOTATION
KW - DATABASE
KW - EVOLUTION
KW - GENES
KW - GENOME BROWSER
KW - LANDSCAPE
KW - MALAT1
KW - PRINCIPLES
KW - REGULATOR
KW - TRANSCRIPTION
UR - http://www.scopus.com/inward/record.url?scp=85079071080&partnerID=8YFLogxK
U2 - 10.1038/s42003-019-0741-7
DO - 10.1038/s42003-019-0741-7
M3 - Journal article
C2 - 32024996
AN - SCOPUS:85079071080
SN - 2399-3642
VL - 3
JO - Communications Biology
JF - Communications Biology
IS - 1
M1 - 56
ER -