Data dredging (also known as p-hacking, fishing expedition, or post-hoc data mining) refers to the misuse of statistical analysis by performing multiple comparisons or subgroup analyses until a statistically significant result is found β often by chance alone β without a pre-specified hypothesis.
π§ In scientific critique, data dredging implies: Searching for significance rather than testing a meaningful, pre-defined question.
Multiple subgroup analyses without correction for multiple testing.
Highlighting spurious associations as if they were genuine discoveries.
Retrospective storytelling: interpreting random patterns as clinically or biologically relevant.
𧨠Why itβs problematic: Increases the false positive rate.
Produces results that cannot be replicated.
Undermines the credibility of the research.
Often used to inflate the perceived value of weak or null findings.
π Example (in a critical review): βThe authors engage in data dredging, slicing the dataset until statistical significance emerges β not to test a hypothesis, but to justify one retroactively.β