Dataset decay and the problem of sequential analyses on open datasets

eLife7.70
Volume: 9
Published: May 19, 2020
Abstract
Open data allows researchers to explore pre-existing datasets in new ways. However, if many researchers reuse the same dataset, multiple statistical testing may increase false positives. Here we demonstrate that sequential hypothesis testing on the same dataset by multiple researchers can inflate error rates. We go on to discuss a number of correction procedures that can reduce the number of false positives, and the challenges associated with...
Paper Details
Title
Dataset decay and the problem of sequential analyses on open datasets
Published Date
May 19, 2020
Journal
Volume
9
Citation AnalysisPro
  • Scinapse’s Top 10 Citation Journals & Affiliations graph reveals the quality and authenticity of citations received by a paper.
  • Discover whether citations have been inflated due to self-citations, or if citations include institutional bias.