P-value

A test of significance

Authors
Affiliations

Doctor of Physical Therapy

B.S. in Kinesiology

Doctor of Physical Therapy

B.A. in Neuroscience

Definition

  • p-value is the tail probability calculated using a test statistic1

Calculation

To define it formally, let us use an example. A psychologist was interested in estimating the average fluid intelligence (Gf) in a specific age group. Suppose Gf follows a normal distribution, and we denote Xi as the Gf score for an individual i ̨ f1; 2; :::g, then

  • i.i.d: Indicates independent and identically distributed

Significance level (type I error), type II error, and power. The significance level (type I error or α) is a predetermined value (say 0.05), which quantifies the probability of observing extreme values given that the null hypothesis is true (red shades). The type II error (or b) quantifies the probability of failing to reject the null hypothesis given that the alternative hypothesis is true (blue shades). The power (or 1-\beta) quantifies the probability of rejecting the null hypothesis given that the alternative hypothesis is true (dashed shades). The value (1-\alpha) quantifies the probability of failing to reject the null hypothesis when it is true (represented by the white area, not completely shown, under the null hypothesis curve)1

Significance level (type I error), type II error, and power. The significance level (type I error or α) is a predetermined value (say 0.05), which quantifies the probability of observing extreme values given that the null hypothesis is true (red shades). The type II error (or b) quantifies the probability of failing to reject the null hypothesis given that the alternative hypothesis is true (blue shades). The power (or \(1-\beta\)) quantifies the probability of rejecting the null hypothesis given that the alternative hypothesis is true (dashed shades). The value (\(1-\alpha\)) quantifies the probability of failing to reject the null hypothesis when it is true (represented by the white area, not completely shown, under the null hypothesis curve)1

p hacking

“We define p hacking as taking inappropriate steps, whether consciously or innocently, to obtain significant p value(s) in science. Compared with misuses discussed below, which are not completely unfounded but not ideal, p hacking is, in general, inappropriate statistical practice, and one should strive to avoid it.”1

References

1.
Chén OY, Bodelet JS, Saraiva RG, et al. The roles, challenges, and merits of the p value. Patterns (New York, NY). 2023;4(12):100878. doi:10.1016/j.patter.2023.100878

Citation

For attribution, please cite this work as: