Demo corpus. Scores are computed on a select set of biomedical paper/datasets and may be inaccurate for papers outside this corpus — DataRank relies on network effects that improve with scale. We aim to expand this into a fully open resource pending additional funding.

p-Curve and p-Hacking in Observational Research

PLOS ONE(2016)10.1371/journal.pone.0149144Source: DataRank Database

p-Curve and p-Hacking in Observational Research is a research paper published in PLOS ONE (2016). On theSindex it has a DataRank of 8.3. It has been cited 218 times, with 198 citing works in its 1-hop citation network.

N/A

8.3DataRank · unranked

8.3

Open Access218 citations · base score 5.4

Cite:

datarank_citation_only_1hop_v6· scope data_onlyMethodology

Abstract

The p-curve, the distribution of statistically significant p-values of published studies, has been used to make inferences on the proportion of true effects and on the presence of p-hacking in the published literature. We analyze the p-curve for observational research in the presence of p-hacking. We show by means of simulations that even with minimal omitted-variable bias (e.g., unaccounted confounding) p-curves based on true effects and p-curves based on null-effects with p-hacking cannot be reliably distinguished. We also demonstrate this problem using as practical example the evaluation of the effect of malaria prevalence on economic growth between 1960 and 1996. These findings call recent studies into question that use the p-curve to infer that most published research findings are based on true effects in the medical literature and in a wide range of disciplines. p-values in observational research may need to be empirically calibrated to be interpretable with respect to the commonly used significance threshold of 0.05. Violations of randomization in experimental studies may also result in situations where the use of p-curves is similarly unreliable.

›Data sources & pipeline

Pipeline:MetadataData-paper checkEnrichmentCitation networkScoring

Enrichment:Pending

FAIR Checklist

Context only (not used in score)

Findable (1/2)

Has DOI

Accessible (1/2)

Open Access

Interoperable (0/2)

Reusable (0/3)

FAIR checklist signals are shown for context only and do not affect DataRank scoring.

Run a calibrated FAIR evaluation for this paper →

DataRank Breakdown

Base Score 10%Citation Network 90%

Base Score Contribution

0.808

From this paper's citation signal

Citation Network Contribution

7.5

From 158 citing papers with measurable signal

Learn more about DataRank methodology →

Top 5 citers driving the network score

Ranked by citation count — the same ordering the engine uses when summing log1p(C_q) over citers.

Why Most Published Research Findings Are False
PLoS Medicine200510,409 citationsDataRank 1.4
Estimating the reproducibility of psychological science
Science20158,592 citationsDataRank 1.4
Why Most Discovered True Associations Are Inflated
Epidemiology20081,529 citationsDataRank 1.1
Contradicted and Initially Stronger Effects in Highly Cited Clinical Research
JAMA20051,467 citationsDataRank 1.1
Influence of Reported Study Design Characteristics on Intervention Effect Estimates From Randomized, Controlled Trials
Annals of Internal Medicine20121,055 citationsDataRank 1.0

Why this DataRank?

DataRank blends this paper's own citation count with the influence of the papers that cite it. Here, roughly 10% comes from its base citations and 90% from the citation network (158 citing papers contributed measurable signal).

Base score B(p): log1p(citation_count) — grows sub-linearly, so a paper with 1,000 citations is not 10× a paper with 100.
Network N(p): Σ over citers of log1p(C_q) ÷ max(outdegree_q, 1). Being cited by a highly-cited paper with few references counts most.
Damping factor d = 0.85: DataRank = (1−d)·B(p) + d·N(p) — the two cards above are each already multiplied by their share.
Self-citations excluded: Citers sharing any OpenAlex author ID with this paper are filtered out before the network sum.

Citers are pulled from OpenAlex sorted by cited_by_count:descand capped per paper, so when the cap binds we keep the highest-signal references and the score is reproducible across reruns.

Read the full methodology →

Click a node to highlight its connections. Use scroll to zoom. Drag to pan.

Node colors:CenterData PaperData + Open AccessNon-dataSelected & links| Node size = percentile rank