Subsampling methods for genomic inference is a research paper published in The Annals of Applied Statistics (2010). On theSindex it has a DataRank of 4.5. It has been cited 68 times, with 64 citing works in its 1-hop citation network. Its calibrated FAIR score is 61/100.
Large-scale statistical analysis of data sets associated with\ngenome sequences plays an important role in modern biology. A\nkey component of such statistical analyses is the computation of\np-values and confidence bounds for statistics\ndefined on the genome. Currently such computation is commonly\nachieved through ad hoc simulation measures. The method of\nrandomization, which is at the heart of these simulation\nprocedures, can significantly affect the resulting statistical\nconclusions. Most simulation schemes introduce a variety of\nhidden assumptions regarding the nature of the randomness in the\ndata, resulting in a failure to capture biologically meaningful\nrelationships. To address the need for a method of assessing the\nsignificance of observations within large scale genomic studies,\nwhere there often exists a complex dependency structure between\nobservations, we propose a unified solution built upon a data\nsubsampling approach. We propose a piecewise stationary model\nfor genome sequences and show that the subsampling approach\ngives correct answers under this model. We illustrate the method\non three simulation studies and two real data examples.
FAIR checklist signals are shown for context only and do not affect DataRank scoring.
Calibrated FAIR score — a parallel quality metric, independent of the DataRank citation score. See the full evaluation →
Base Score Contribution
0.635
From this paper's citation signal
Citation Network Contribution
3.9
From 62 citing papers with measurable signal
Ranked by citation count — the same ordering the engine uses when summing log1p(Cq) over citers.
DataRank blends this paper's own citation count with the influence of the papers that cite it. Here, roughly 14% comes from its base citations and 86% from the citation network (62 citing papers contributed measurable signal).
Citers are pulled from OpenAlex sorted by cited_by_count:descand capped per paper, so when the cap binds we keep the highest-signal references and the score is reproducible across reruns.
Click a node to highlight its connections. Use scroll to zoom. Drag to pan.