🏆 Finalist — NIH Data Sharing Index (“S-Index”) Challenge
Demo corpus. Scores are computed on a select set of biomedical paper/datasets and may be inaccurate for papers outside this corpus — DataRank relies on network effects that improve with scale. We aim to expand this into a fully open resource pending additional funding.

Growth rates of modern science: a latent piecewise growth curve approach to model publication numbers from established and new literature databases

Humanities and Social Sciences Communications(2021)10.1057/s41599-021-00903-wSource: DataRank Database

Growth rates of modern science: a latent piecewise growth curve approach to model publication numbers from established and new literature databases is a dataset published in Humanities and Social Sciences Communications (2021). On theSindex it has a DataRank of 1.5, placing it in the top 38.1% of the data-sharing corpus. It has been cited 334 times, with 39 citing works in its 1-hop citation network. Its calibrated FAIR score is 30/100.

Top 38%percentile
1.5DataRank
1.5Top 38%
Dataset Open Access334 citations · base score 3.7
Cite:
datarank_citation_only_1hop_v6· scope data_onlyMethodology

Abstract

AbstractGrowth of science is a prevalent issue in science of science studies. In recent years, two new bibliographic databases have been introduced, which can be used to study growth processes in science from centuries back: Dimensions from Digital Science and Microsoft Academic. In this study, we used publication data from these new databases and added publication data from two established databases (Web of Science from Clarivate Analytics and Scopus from Elsevier) to investigate scientific growth processes from the beginning of the modern science system until today. We estimated regression models that included simultaneously the publication counts from the four databases. The results of the unrestricted growth of science calculations show that the overall growth rate amounts to 4.10% with a doubling time of 17.3 years. As the comparison of various segmented regression models in the current study revealed, models with four or five segments fit the publication data best. We demonstrated that these segments with different growth rates can be interpreted very well, since they are related to either phases of economic (e.g., industrialization) and/or political developments (e.g., Second World War). In this study, we additionally analyzed scientific growth in two broad fields (Physical and Technical Sciences as well as Life Sciences) and the relationship of scientific and economic growth in UK. The comparison between the two fields revealed only slight differences. The comparison of the British economic and scientific growth rates showed that the economic growth rate is slightly lower than the scientific growth rate.

Data sources & pipeline
Pipeline:MetadataData-paper checkEnrichmentCitation networkScoring
Enrichment:Pending

FAIR Checklist

Context only (not used in score)
Findable (1/2)
  • Has DOI
Accessible (1/2)
  • Open Access
Interoperable (0/2)
    Reusable (1/3)
    • Dataset classification

    FAIR checklist signals are shown for context only and do not affect DataRank scoring.

    30FAIR score
    F Findable
    20
    A Accessible
    55
    I Interoperable
    13
    R Reusable
    33
    Top 88% by FAIRLLM-assessed✓ full text read

    Calibrated FAIR score — a parallel quality metric, independent of the DataRank citation score. See the full evaluation →

    DataRank Breakdown

    Base Score 38%Citation Network 62%

    Base Score Contribution

    0.557

    From this paper's citation signal

    Citation Network Contribution

    0.908

    From 26 citing papers with measurable signal

    Learn more about DataRank methodology →

    Top 5 citers driving the network score

    Ranked by citation count — the same ordering the engine uses when summing log1p(Cq) over citers.

    Why this DataRank?

    DataRank blends this paper's own citation count with the influence of the papers that cite it. Here, roughly 38% comes from its base citations and 62% from the citation network (26 citing papers contributed measurable signal).

    Base score B(p)
    log1p(citation_count) — grows sub-linearly, so a paper with 1,000 citations is not 10× a paper with 100.
    Network N(p)
    Σ over citers of log1p(Cq) ÷ max(outdegreeq, 1). Being cited by a highly-cited paper with few references counts most.
    Damping factor d = 0.85
    DataRank = (1−d)·B(p) + d·N(p) — the two cards above are each already multiplied by their share.
    Self-citations excluded
    Citers sharing any OpenAlex author ID with this paper are filtered out before the network sum.

    Citers are pulled from OpenAlex sorted by cited_by_count:descand capped per paper, so when the cap binds we keep the highest-signal references and the score is reproducible across reruns.

    Read the full methodology →

    Click a node to highlight its connections. Use scroll to zoom. Drag to pan.

    Node colors:CenterData PaperData + Open AccessNon-dataSelected & links| Node size = percentile rank

    Authors (4)

    Robin HaunschildORCID,Rüdiger MutzORCID,Ruediger Mutz,Lutz BornmannORCID

    Related Papers (10)

    Journal of the Association for Information Science and Technology(2015)
    co-cited
    10.1002/asi.23329
    Scientometrics(2019)
    co-cited
    10.1007/s11192-019-03140-w
    Learned Publishing(2023)
    co-cited
    10.1002/leap.1544
    International Journal of Selection and Assessment(2005)
    co-cited
    10.1111/j.1468-2389.2005.00326.x
    Journal of Applied Physics(1961)
    co-cited
    10.1063/1.1736034