Demo corpus. Scores are computed on a select set of biomedical paper/datasets and may be inaccurate for papers outside this corpus — DataRank relies on network effects that improve with scale. We aim to expand this into a fully open resource pending additional funding.

Finishing the euchromatic sequence of the human genome

Nature(2004)10.1038/nature03001Source: DataRank Database

Finishing the euchromatic sequence of the human genome is a dataset published in Nature (2004). On theSindex it has a DataRank of 20.0, placing it in the top 5.2% of the data-sharing corpus. It has been cited 4,903 times, with 200 citing works in its 1-hop citation network. Its calibrated FAIR score is 65/100.

Top 5%percentile

20.0DataRank

20.0Top 5%

Dataset4903 citations · base score 8.5

Cite:

datarank_citation_only_1hop_v6· scope data_onlyMethodology

›Data sources & pipeline

Pipeline:MetadataData-paper checkEnrichmentCitation networkScoring

Enrichment:Pending

FAIR Checklist

Context only (not used in score)

Findable (1/2)

Has DOI

Accessible (0/2)

Interoperable (0/2)

Reusable (1/3)

Dataset classification

FAIR checklist signals are shown for context only and do not affect DataRank scoring.

65FAIR score

F Findable

100

A Accessible

I Interoperable

R Reusable

Top 5% by FAIRdeterministic✓ full text read

Calibrated FAIR score — a parallel quality metric, independent of the DataRank citation score. See the full evaluation →

DataRank Breakdown

Base Score 6%Citation Network 94%

Base Score Contribution

1.3

From this paper's citation signal

Citation Network Contribution

18.7

From 200 citing papers with measurable signal

Learn more about DataRank methodology →

Top 5 citers driving the network score

Ranked by citation count — the same ordering the engine uses when summing log1p(C_q) over citers.

Canu: scalable and accurate long-read assembly via adaptive <i>k</i> -mer weighting and repeat separation
Genome Research20178,078 citationsDataRank 1.3
A map of human genome variation from population-scale sequencing
Nature20108,067 citationsDataRank 32.4Top 1%
Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project
Nature20075,248 citationsDataRank 15.6Top 12%
GENCODE: The reference human genome annotation for The ENCODE Project
Genome Research20125,021 citationsDataRank 17.2Top 9%
The complete sequence of a human genome
Science20223,274 citationsDataRank 8.5Top 24%

Why this DataRank?

DataRank blends this paper's own citation count with the influence of the papers that cite it. Here, roughly 6% comes from its base citations and 94% from the citation network (200 citing papers contributed measurable signal).

Base score B(p): log1p(citation_count) — grows sub-linearly, so a paper with 1,000 citations is not 10× a paper with 100.
Network N(p): Σ over citers of log1p(C_q) ÷ max(outdegree_q, 1). Being cited by a highly-cited paper with few references counts most.
Damping factor d = 0.85: DataRank = (1−d)·B(p) + d·N(p) — the two cards above are each already multiplied by their share.
Self-citations excluded: Citers sharing any OpenAlex author ID with this paper are filtered out before the network sum.

Citers are pulled from OpenAlex sorted by cited_by_count:descand capped per paper, so when the cap binds we keep the highest-signal references and the score is reproducible across reruns.

Read the full methodology →

Click a node to highlight its connections. Use scroll to zoom. Drag to pan.

Node colors:CenterData PaperData + Open AccessNon-dataSelected & links| Node size = percentile rank