Data Dredging

Data dredging (also known as p-hacking, fishing expedition, or post-hoc data mining) refers to the misuse of statistical analysis by performing multiple comparisons or subgroup analyses until a statistically significant result is found — often by chance alone — without a pre-specified hypothesis.

🧠 In scientific critique, data dredging implies: Searching for significance rather than testing a meaningful, pre-defined question.

Multiple subgroup analyses without correction for multiple testing.

Highlighting spurious associations as if they were genuine discoveries.

Retrospective storytelling: interpreting random patterns as clinically or biologically relevant.

🧨 Why it’s problematic: Increases the false positive rate.

Produces results that cannot be replicated.

Undermines the credibility of the research.

Often used to inflate the perceived value of weak or null findings.

📉 Example (in a critical review): “The authors engage in data dredging, slicing the dataset until statistical significance emerges — not to test a hypothesis, but to justify one retroactively.”

  • data_dredging.txt
  • Last modified: 2025/06/18 19:28
  • by administrador