Member-only story

Abuses of Power: Persistent Problems of Post-Hoc Power

A post-hoc power calculation offers no new information.

Anthony B. Masters
3 min readJan 25, 2025

Statistical power is a concept in null hypothesis significance testing. It is the probability a test rejects the null hypothesis, making the ‘correct’ decision. In that situation, this is when the test gets a ‘significant’ result.

There is a table showing the ‘correct’ and ‘incorrect’ decisions under null hypothesis significance testing. The calculation of power is shown on a distribution, assuming the alternate hypothesis holds.
(Image: Alison Yuhan Yao/Towards Data Science. The bottom-right box should say: “False Negative”.)

Statistical power concerns study design. Clinical trials need to be big enough for researchers to find there is a treatment effect if one exists. Such a study designed to have “80% power” will reject the null hypothesis in four of five trials. Sample size calculations form an important part of planning a study or experiment.

One practice is the calculation of “post-hoc power”. This is also called “achieved”, “retrospective”, or “observed” power. This calculation claims to estimate the test’s power, given an observed effect size. A 2019 paper published in the Journal of Surgical Research focuses on post-hoc power. The paper suggested adding this calculation to statistical guidelines. Other journals support this approach when publishing analyses. Common software tools like SPSS also feature “post-hoc power”.

Statisticians have documented many times post-hoc power calculations are not useful. The problem is the p-value determines “observed” power. Post-hoc…

--

--

Anthony B. Masters
Anthony B. Masters

Written by Anthony B. Masters

This blog looks at the use of statistics in Britain and beyond. It is written by RSS Statistical Ambassador and Chartered Statistician @anthonybmasters.

No responses yet