🏆 Finalist — NIH Data Sharing Index (“S-Index”) Challenge
Demo corpus. Scores are computed on a select set of biomedical paper/datasets and may be inaccurate for papers outside this corpus — DataRank relies on network effects that improve with scale. We aim to expand this into a fully open resource pending additional funding.

Multiclass cancer diagnosis using tumor gene expression signatures

Proceedings of the National Academy of Sciences(2001)10.1073/pnas.211566398Source: DataRank Database

Multiclass cancer diagnosis using tumor gene expression signatures is a research paper published in Proceedings of the National Academy of Sciences (2001). On theSindex it has a DataRank of 1.1. It has been cited 2,044 times.

N/A
1.1DataRank · unranked
1.1
Open Access2044 citations · base score 7.6
Cite:
datarank_citation_only_1hop_v6· scope data_onlyMethodology

Abstract

The optimal treatment of patients with cancer depends on establishing accurate diagnoses by using a complex combination of clinical and histopathological data. In some instances, this task is difficult or impossible because of atypical clinical presentation or histopathology. To determine whether the diagnosis of multiple common adult malignancies could be achieved purely by molecular classification, we subjected 218 tumor samples, spanning 14 common tumor types, and 90 normal tissue samples to oligonucleotide microarray gene expression analysis. The expression levels of 16,063 genes and expressed sequence tags were used to evaluate the accuracy of a multiclass classifier based on a support vector machine algorithm. Overall classification accuracy was 78%, far exceeding the accuracy of random classification (9%). Poorly differentiated cancers resulted in low-confidence predictions and could not be accurately classified according to their tissue of origin, indicating that they are molecularly distinct entities with dramatically different gene expression patterns compared with their well differentiated counterparts. Taken together, these results demonstrate the feasibility of accurate, multiclass molecular cancer classification and suggest a strategy for future clinical implementation of molecular cancer diagnostics.

Data sources & pipeline
Pipeline:MetadataData-paper checkEnrichmentCitation networkScoring
Enrichment:Pending

FAIR Checklist

Context only (not used in score)
Findable (1/2)
  • Has DOI
Accessible (1/2)
  • Open Access
Interoperable (0/2)
    Reusable (0/3)

      FAIR checklist signals are shown for context only and do not affect DataRank scoring.

      DataRank Breakdown

      Base Score 100%Citation Network 0%

      Base Score Contribution

      1.1

      From this paper's citation signal

      Citation Network Contribution

      0

      Citation network not refreshed for this result

      This paper's DataRank is currently driven only by its base citation score. Citation network data was not refreshed for this result.

      Learn more about DataRank methodology →
      Why this DataRank?

      DataRank blends this paper's own citation count with the influence of the papers that cite it. Here, roughly 100% comes from its base citations and 0% from the citation network.

      Base score B(p)
      log1p(citation_count) — grows sub-linearly, so a paper with 1,000 citations is not 10× a paper with 100.
      Network N(p)
      Σ over citers of log1p(Cq) ÷ max(outdegreeq, 1). Being cited by a highly-cited paper with few references counts most.
      Damping factor d = 0.85
      DataRank = (1−d)·B(p) + d·N(p) — the two cards above are each already multiplied by their share.
      Self-citations excluded
      Citers sharing any OpenAlex author ID with this paper are filtered out before the network sum.

      Citers are pulled from OpenAlex sorted by cited_by_count:descand capped per paper, so when the cap binds we keep the highest-signal references and the score is reproducible across reruns.

      Read the full methodology →

      Authors (17)

      Pablo TamayoORCID,Ryan Rifkin,Sayan MukherjeeORCID,Chen-Hsiang Yeang,Michael Angelo

      Related Papers (10)

      Proceedings of the National Academy of Sciences(2001)
      co-citedsame journal
      10.1073/pnas.191502998
      Proceedings of the National Academy of Sciences(2001)
      co-citedsame journal
      10.1073/pnas.091062498
      Journal of Computational Biology(2000)
      co-cited
      10.1089/106652700750050961
      Proceedings of the National Academy of Sciences(2000)
      co-citedsame journal
      10.1073/pnas.220392197
      The use and analysis of microarray data
      N/A
      0.927DataRank · unranked
      Nature Reviews Drug Discovery(2002)
      co-cited
      10.1038/nrd961