Skip to main navigation Skip to search Skip to main content

A deep learning system accurately classifies primary and metastatic cancers using passenger mutation patterns

  • PCAWG Tumor Subtypes and Clinical Translation Working Group
  • , PCAWG Consortium
  • Ontario Institute for Cancer Research
  • University of Toronto
  • Vector Institute
  • Broad Institute
  • Harvard University
  • Icahn School of Medicine at Mount Sinai
  • Massachusetts General Hospital
  • University of Zagreb
  • Hartwig Medical Foundation
  • Utrecht University
  • Spanish National Cancer Research Centre (CNIO)
  • University of Glasgow
  • Glasgow Royal Infirmary
  • University of New South Wales
  • University of California at Los Angeles
  • Cambridge University Hospitals NHS Foundation Trust
  • University of Cambridge
  • Wellcome Trust Genome Campus
  • Cornell University
  • Dana-Farber Cancer Institute
  • University of Melbourne
  • University of North Carolina at Chapel Hill
  • University of Edinburgh
  • National Cancer Center Japan
  • University of Texas MD Anderson Cancer Center
  • Oregon Health and Science University
  • Sage Bionetworks
  • University of California at San Francisco
  • University of Bern
  • The University of Tokyo
  • Kiel University
  • Ulm University
  • Barcelona Institute of Science and Technology (BIST)
  • Pompeu Fabra University
  • European Molecular Biology Laboratory

Research output: Contribution to journalArticlepeer-review

170 Citations (Scopus)

Abstract

In cancer, the primary tumour’s organ of origin and histopathology are the strongest determinants of its clinical behaviour, but in 3% of cases a patient presents with a metastatic tumour and no obvious primary. Here,as part of the ICGC/TCGA Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium, we train a deep learning classifier to predict cancer type based on patterns of somatic passenger mutations detected in whole genome sequencing (WGS) of 2606 tumours representing 24 common cancer types produced by the PCAWG Consortium. Our classifier achieves an accuracy of 91% on held-out tumor samples and 88% and 83% respectively on independent primary and metastatic samples, roughly double the accuracy of trained pathologists when presented with a metastatic tumour without knowledge of the primary. Surprisingly, adding information on driver mutations reduced accuracy. Our results have clinical applicability, underscore how patterns of somatic passenger mutations encode the state of the cell of origin, and can inform future strategies to detect the source of circulating tumour DNA.

Original languageEnglish
Article number728
JournalNature Communications
Volume11
Issue number1
DOIs
Publication statusPublished - 1 Dec 2020

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Fingerprint

Dive into the research topics of 'A deep learning system accurately classifies primary and metastatic cancers using passenger mutation patterns'. Together they form a unique fingerprint.

Cite this