2021 May 19. In: StatPearls [Internet]. Treasure Island (FL): StatPearls Publishing; 2021 Jan–.
In research, statistical significance is a measure of the probability of the null hypothesis being true compared to the acceptable level of uncertainty regarding the true answer. If we break apart a study design, we can better understand statistical significance.
When creating a study, the researcher has to start with a hypothesis; that is, they must have some idea of what they think the outcome may be. We will use the example of a new medication to lower blood pressure. The researcher hypothesizes that the new medication lowers systolic blood pressure by at least 10 mmHg compared to not taking the new medication. The hypothesis can be then stated, "Taking the new medication will lower systolic blood pressure by at least 10 mmHg compared to not taking the medication." In science, researchers can never prove any statement as there are infinite alternatives as to why the outcome may have occurred. They can only try to disprove a specific hypothesis. The researcher must then formulate a question they can disprove while coming to their conclusion that the new medication lowers systolic blood pressure. The hypothesis, to be disproven, is the null hypothesis and typically the inverse statement of the hypothesis. Thus, the null hypothesis for our researcher would be, "Taking the new medication will not lower systolic blood pressure by at least 10 mmHg compared to not taking the new medication." The researcher now has the null hypothesis for the research and must next specify the significance level or level of acceptable uncertainty.
Even when disproving a hypothesis, the researcher will not be 100% certain of the outcome. The researcher must then settle for some level of confidence or the significance level for which they do want to be correct. The significance level is given the Greek letter alpha and specified as the probability the researcher is willing to be incorrect. Our researcher wants to be correct about their outcome 95% of the time, or the researcher is willing to be incorrect 5% of the time. Probabilities are stated as decimals, with 1.0 being completely positive (100%) and 0 being completely negative (0%). Thus, the researcher who wants to be 95% sure about the outcome of their study is willing to be wrong 5% of the time about the study result. The alpha is the decimal expression of how much they are willing to be wrong. For the current example, the alpha is 0.05. We now have the level of uncertainty the researcher is willing to accept (alpha or significance level) of 0.05 or 5% chance they are not correct about the outcome of the study.
Now the researcher can perform their research. In the example, the researcher would give some individuals the new medication and other individuals no medication. The researcher then looks to see how the blood pressure changes after receiving the new medication and performs a statistical analysis of the results to obtain a p-value (probability value). There are numerous different tests used in research that can provide a p-value. It is imperative to use the correct statistical analysis tool when calculating the p-value. If the researchers use the wrong test, the p-value will not be accurate, and this result can mislead the researcher. The p-value is best described as the probability that the null hypothesis is true given the researcher's current set of data. In our example, the researcher found blood pressure did tend to decrease after taking the new medication. The researcher then used the help of their statistician to perform the correct analysis and arrives at a p-value of 0.02 for the decrease in blood pressure for those taking the new medication versus those not taking the new medication. Our researcher now has the three required pieces of information to look at statistical significance: the null hypothesis, the significance level, and the p-value.
The researcher can finally assess the statistical significance regarding the new medication. A study result is statistically significant if the p-value of the data analysis is less than the prespecified alpha (significance level). In our example, the p-value is 0.02, which is less than the pre-specified alpha of 0.05, so the researcher concludes there is statistical significance for the study.
What does this mean? The p-value assumes the null hypothesis is true and provides the probability of results in excess as the ones observed IF the null hypothesis is true. The p-value is the probability based on the data assuming the null hypothesis is true. The p-value is not the probability of the null hypothesis itself. Remember, the null hypothesis states that there is no significant change in blood pressure if the patient is or is not taking the new medication. This is not the same as “a 2% chance” that the null hypothesis is correct. The researcher pre-specified an alpha of 0.05, implying they wanted the chance of the null hypothesis to be less than 5% before rejecting the null hypothesis. As the p-value is 0.02 and less than the alpha of 0.05, the researcher rejects the null hypothesis because there is statistical significance. By rejecting the null hypothesis, the researcher accepts the alternative hypothesis. The researcher rejects the idea that there is no difference in systolic blood pressure with the new medication and accepts the alternative that there is a difference of at least 10 mmHg in systolic blood pressure when taking the new medication.
If the researcher had prespecified an alpha of 0.01, implying they wanted to be 99% sure the new medication lowered the blood pressure by at least 10 mmHg, then the p-value of 0.02 would be greater than the prespecified alpha of 0.01. The researcher would conclude the study did not reach statistical significance as the p-value is equal to or greater than the pre-specified alpha. The research would then not be able to reject the null hypothesis.