Cell-of-Origin Patterns Dominate the Molecular Classification of 10,000 Tumors from 33 Types of Cancer

Cell. 2018 Apr 5;173(2):291-304.e6. doi: 10.1016/j.cell.2018.03.022.

Abstract

We conducted comprehensive integrative molecular analyses of the complete set of tumors in The Cancer Genome Atlas (TCGA), consisting of approximately 10,000 specimens and representing 33 types of cancer. We performed molecular clustering using data on chromosome-arm-level aneuploidy, DNA hypermethylation, mRNA, and miRNA expression levels and reverse-phase protein arrays, of which all, except for aneuploidy, revealed clustering primarily organized by histology, tissue type, or anatomic origin. The influence of cell type was evident in DNA-methylation-based clustering, even after excluding sites with known preexisting tissue-type-specific methylation. Integrative clustering further emphasized the dominant role of cell-of-origin patterns. Molecular similarities among histologically or anatomically related cancer types provide a basis for focused pan-cancer analyses, such as pan-gastrointestinal, pan-gynecological, pan-kidney, and pan-squamous cancers, and those related by stemness features, which in turn may inform strategies for future therapeutic development.

Keywords: TCGA; cancer; cell-of-origin; genome; methylome; organs; proteome; subtypes; tissues; transcriptome.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Aneuploidy
  • Chromosomes / genetics
  • Cluster Analysis
  • CpG Islands
  • DNA Methylation
  • Databases, Factual
  • Humans
  • MicroRNAs / metabolism
  • Mutation
  • Neoplasm Proteins / genetics
  • Neoplasm Proteins / metabolism
  • Neoplasms / genetics
  • Neoplasms / pathology*
  • RNA, Messenger / metabolism

Substances

  • MicroRNAs
  • Neoplasm Proteins
  • RNA, Messenger