Sources

Provenance-first source cards

Every record keeps its source databases, citation identifiers, links, and generation date visible. Seed catalog freshness: Feb 4, 2026.

7,383 records

cBioPortal

Somatic mutation observations from public cancer genomics studies.

Used for cohort frequencies, sample counts, study links, and gene-level mutation context.

Study-level ascertainment, tumor purity, panel design, and cohort composition can shift apparent recurrence.
Source site
7,383 records

TCGA Pan-Cancer Atlas

Public harmonized cancer cohort context represented through cBioPortal-derived records.

Used as a high-level public cohort backbone for cancer-type observations.

The atlas presents research cohorts, not prospective clinical testing populations.
Source site
3,499 records

Cancer Hotspots

Recurrent amino-acid positions observed across cancer cohorts.

Used as hotspot evidence and a prioritization signal for variant pages.

Hotspot status is biological evidence, not a therapy recommendation by itself.
Source site
88,250 records

ClinVar

Variant-level clinical significance assertions and identifier context.

Used for clinical-significance labels and variation links where present.

ClinVar assertions can change and may include germline-oriented evidence that needs context.
Source site
0 records

OncoTree

Cancer type vocabulary and code mapping.

Used to make cancer-type labels and route groupings consistent.

Some source cohort labels include study suffixes and are intentionally preserved for provenance.
Source site
0 records

IntOGen

Driver gene knowledgebase referenced by the seed build metadata.

Kept as a provenance hook for driver-gene enrichment.

The current atlas does not redistribute a full IntOGen driver table.
Source site
0 records

CIViC

Open variant interpretation evidence model.

Supported as a future enrichment target and modeled in schema-compatible fields.

CIViC evidence is not bulk-loaded in this v1 static build unless present in seed records.
Source site
0 records

ClinicalTrials.gov

Trial identifier surface for therapy and variant context.

Supported through NCT fields and links where available.

No trial eligibility inference is performed.
Source site
0 records

FDA Oncology Notices

Public regulatory approval context.

Supported as an indication field and future data adapter.

The current drug labels are not approval claims unless the indication field is populated.
Source site