🏆 Finalist — NIH Data Sharing Index (“S-Index”) Challenge
Demo corpus. Scores are computed on a select set of biomedical paper/datasets and may be inaccurate for papers outside this corpus — DataRank relies on network effects that improve with scale. We aim to expand this into a fully open resource pending additional funding.

LSTM: A Search Space Odyssey

IEEE Transactions on Neural Networks and Learning Systems(2017)10.1109/tnnls.2016.2582924Source: DataRank Database

LSTM: A Search Space Odyssey is a research paper published in IEEE Transactions on Neural Networks and Learning Systems (2017). On theSindex it has a DataRank of 1.3. It has been cited 6,718 times.

N/A
1.3DataRank · unranked
1.3
Open Access6718 citations · base score 8.8
Cite:
datarank_citation_only_1hop_v6· scope data_onlyMethodology

Abstract

Several variants of the long short-term memory (LSTM) architecture for recurrent neural networks have been proposed since its inception in 1995. In recent years, these networks have become the state-of-the-art models for a variety of machine learning problems. This has led to a renewed interest in understanding the role and utility of various computational components of typical LSTM variants. In this paper, we present the first large-scale analysis of eight LSTM variants on three representative tasks: speech recognition, handwriting recognition, and polyphonic music modeling. The hyperparameters of all LSTM variants for each task were optimized separately using random search, and their importance was assessed using the powerful functional ANalysis Of VAriance framework. In total, we summarize the results of 5400 experimental runs ( ≈ 15 years of CPU time), which makes our study the largest of its kind on LSTM networks. Our results show that none of the variants can improve upon the standard LSTM architecture significantly, and demonstrate the forget gate and the output activation function to be its most critical components. We further observe that the studied hyperparameters are virtually independent and derive guidelines for their efficient adjustment.

Data sources & pipeline
Pipeline:MetadataData-paper checkEnrichmentCitation networkScoring
Enrichment:Pending

FAIR Checklist

Context only (not used in score)
Findable (1/2)
  • Has DOI
Accessible (1/2)
  • Open Access
Interoperable (0/2)
    Reusable (0/3)

      FAIR checklist signals are shown for context only and do not affect DataRank scoring.

      DataRank Breakdown

      Base Score 100%Citation Network 0%

      Base Score Contribution

      1.3

      From this paper's citation signal

      Citation Network Contribution

      0

      Citation network not refreshed for this result

      This paper's DataRank is currently driven only by its base citation score. Citation network data was not refreshed for this result.

      Learn more about DataRank methodology →
      Why this DataRank?

      DataRank blends this paper's own citation count with the influence of the papers that cite it. Here, roughly 100% comes from its base citations and 0% from the citation network.

      Base score B(p)
      log1p(citation_count) — grows sub-linearly, so a paper with 1,000 citations is not 10× a paper with 100.
      Network N(p)
      Σ over citers of log1p(Cq) ÷ max(outdegreeq, 1). Being cited by a highly-cited paper with few references counts most.
      Damping factor d = 0.85
      DataRank = (1−d)·B(p) + d·N(p) — the two cards above are each already multiplied by their share.
      Self-citations excluded
      Citers sharing any OpenAlex author ID with this paper are filtered out before the network sum.

      Citers are pulled from OpenAlex sorted by cited_by_count:descand capped per paper, so when the cap binds we keep the highest-signal references and the score is reproducible across reruns.

      Read the full methodology →

      Authors (6)

      Rupesh K. SrivastavaORCID,Jan Koutnik,Bas R. Steunebrink,Jan Koutník,Jürgen Schmidhuber

      Related Papers (10)

      New England Journal of Medicine(2020)
      co-cited
      10.1056/nejmoa2027906
      IEEE Transactions on Pattern Analysis and Machine Intelligence(2018)
      co-cited
      10.1109/tpami.2017.2699184
      New England Journal of Medicine(2020)
      co-cited
      10.1056/nejmoa2034577
      Nature Reviews Genetics(2015)
      co-cited
      10.1038/nrg3920
      IEEE Transactions on Pattern Analysis and Machine Intelligence(2013)
      co-cited
      10.1109/tpami.2013.50
      Neural Computation(2006)
      co-cited
      10.1162/neco.2006.18.7.1527
      Random Forests
      N/A
      1.8DataRank · unranked
      Machine Learning(2001)
      co-cited
      10.1023/a:1010933404324
      Journal of Educational Psychology(1933)
      co-cited
      10.1037/h0071325
      Proceedings of the IEEE(1998)
      co-cited
      10.1109/5.726791
      New England Journal of Medicine(2021)
      co-cited
      10.1056/nejmoa2035389