Demo corpus. Scores are computed on a select set of biomedical paper/datasets and may be inaccurate for papers outside this corpus — DataRank relies on network effects that improve with scale. We aim to expand this into a fully open resource pending additional funding.

Enrichr: a comprehensive gene set enrichment analysis web server 2016 update

Nucleic Acids Research(2016)10.1093/nar/gkw377Source: DataRank Database

Enrichr: a comprehensive gene set enrichment analysis web server 2016 update is a dataset published in Nucleic Acids Research (2016). On theSindex it has a DataRank of 15.9, placing it in the top 11.1% of the data-sharing corpus. It has been cited 11,578 times, with 191 citing works in its 1-hop citation network. Its calibrated FAIR score is 39/100.

Top 11%percentile

15.9DataRank

15.9Top 11%

Dataset Open Access11578 citations · base score 9.3

Cite:

datarank_citation_only_1hop_v6· scope data_onlyMethodology

Abstract

Enrichment analysis is a popular method for analyzing gene sets generated by genome-wide experiments. Here we present a significant update to one of the tools in this domain called Enrichr. Enrichr currently contains a large collection of diverse gene set libraries available for analysis and download. In total, Enrichr currently contains 180 184 annotated gene sets from 102 gene set libraries. New features have been added to Enrichr including the ability to submit fuzzy sets, upload BED files, improved application programming interface and visualization of the results as clustergrams. Overall, Enrichr is a comprehensive resource for curated gene sets and a search engine that accumulates biological knowledge for further biological discoveries. Enrichr is freely available at: http://amp.pharm.mssm.edu/Enrichr.

›Data sources & pipeline

Pipeline:MetadataData-paper checkEnrichmentCitation networkScoring

Enrichment:Pending

FAIR Checklist

Context only (not used in score)

Findable (1/2)

Has DOI

Accessible (1/2)

Open Access

Interoperable (0/2)

Reusable (1/3)

Dataset classification

FAIR checklist signals are shown for context only and do not affect DataRank scoring.

39FAIR score

F Findable

A Accessible

I Interoperable

R Reusable

Top 80% by FAIRLLM-assessed✓ full text read

Calibrated FAIR score — a parallel quality metric, independent of the DataRank citation score. See the full evaluation →

DataRank Breakdown

Base Score 9%Citation Network 91%

Base Score Contribution

1.4

From this paper's citation signal

Citation Network Contribution

14.5

From 191 citing papers with measurable signal

Learn more about DataRank methodology →

Top 5 citers driving the network score

Ranked by citation count — the same ordering the engine uses when summing log1p(C_q) over citers.

Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles
Proceedings of the National Academy of Sciences200555,906 citationsDataRank 1.6
KEGG: Kyoto Encyclopedia of Genes and Genomes
Nucleic Acids Research200038,911 citationsDataRank 25.4Top 2%
Metascape provides a biologist-oriented resource for the analysis of systems-level datasets
Nature Communications201915,474 citationsDataRank 14.7Top 13%
Integrating single-cell transcriptomic data across different conditions, technologies, and species
Nature Biotechnology201814,465 citationsDataRank 1.4
NCBI GEO: archive for functional genomics data sets—update
Nucleic Acids Research201210,702 citationsDataRank 20.2Top 5%

Why this DataRank?

DataRank blends this paper's own citation count with the influence of the papers that cite it. Here, roughly 9% comes from its base citations and 91% from the citation network (191 citing papers contributed measurable signal).

Base score B(p): log1p(citation_count) — grows sub-linearly, so a paper with 1,000 citations is not 10× a paper with 100.
Network N(p): Σ over citers of log1p(C_q) ÷ max(outdegree_q, 1). Being cited by a highly-cited paper with few references counts most.
Damping factor d = 0.85: DataRank = (1−d)·B(p) + d·N(p) — the two cards above are each already multiplied by their share.
Self-citations excluded: Citers sharing any OpenAlex author ID with this paper are filtered out before the network sum.

Citers are pulled from OpenAlex sorted by cited_by_count:descand capped per paper, so when the cap binds we keep the highest-signal references and the score is reproducible across reruns.

Read the full methodology →

Click a node to highlight its connections. Use scroll to zoom. Drag to pan.

Node colors:CenterData PaperData + Open AccessNon-dataSelected & links| Node size = percentile rank