What Is a Two-Tailed Test?
In statistics, a two-tailed test is a method in which the critical area of a distribution is two-sided and tests whether a sample is greater than or less than a certain range of values. It is used in null-hypothesis testing and testing for statistical significance. If the sample being tested falls into either of the critical areas, the alternative hypothesis is accepted instead of the null hypothesis. The two-tailed test gets its name from testing the area under both tails of a normal distribution, although the test can be used in other non-normal distributions.
- In statistics, a two-tailed test is a method in which the critical area of a distribution is two-sided and tests whether a sample is greater than or less than a certain range of values.
- It is used in null-hypothesis testing and testing for statistical significance.
- If the sample being tested falls into either of the critical areas, the alternative hypothesis is accepted instead of the null hypothesis.
- By convention two-tailed tests are used to determine significance at the 5% level, meaning each side of the distribution is cut at 2.5%.
Be careful to note if a statistical test is one- or two-tailed as this will greatly influence a model's interpretation.
How a Two-Tailed Test Works
A basic concept of inferential statistics is the hypothesis testing, which is run to determine whether a claim is true or not, given a population parameter. A testing that is programmed to show whether the mean of a sample is significantly greater than and significantly less than the mean of a population is referred to as a two-tailed test.
A two-tailed test is designed to examine both sides of a specified data range as designated by the probability distribution involved. The probability distribution should represent the likelihood of a specified outcome based on predetermined standards. This requires the setting of a limit designating the highest (or upper) and lowest (or lower) accepted variable values included within the range. Any data point that exists above the upper limit or below the lower limit is considered out of the acceptance range and in an area referred to as the rejection range.
There is no inherent standard with regard to the number of data points that must exist within the acceptance range. In instances where precision is required, such as in the creation of pharmaceutical drugs, a rejection rate of 0.001% or less may be instituted. In instances where precision is less critical, such as the number of food items in a product bag, a rejection rate of 5% may be appropriate.
An Example of a Two-Tailed Test
As a hypothetical example, imagine that a new stockbroker (XYZ) claims that his brokerage fees are lower than that of your current stock broker's (ABC). Data available from an independent research firm indicates that the mean and standard deviation of all ABC broker clients are $18 and $6, respectively.
A sample of 100 clients of ABC is taken and brokerage charges are calculated with the new rates of XYZ broker. If the mean of the sample is $18.75 and the sample standard deviation is $6, can any inference be made about the difference in the average brokerage bill between ABC and XYZ broker?
- H0: Null Hypothesis: mean = 18
- H1: Alternative Hypothesis: mean <> 18 (This is what we want to prove.)
- Rejection region: Z <= - Z2.5 and Z>=Z2.5 (assuming 5% significance level, split 2.5 each on either side).
- Z = (sample mean – mean) / (std-dev / sqrt (no. of samples)) = (18.75 – 18) / (6/(sqrt(100)) = 1.25
This calculated Z value falls between the two limits defined by: - Z2.5 = -1.96 and Z2.5 = 1.96.
This concludes that there is insufficient evidence to infer that there is any difference between the rates of your existing broker and the new broker. Alternatively, the p-value = P(Z< -1.25)+P(Z >1.25) = 2 * 0.1056 = 0.2112 = 21.12%, which is greater than 0.05 or 5%, leads to the same conclusion.
Special Considerations: Random Sampling
A two-tailed test can also be used practically during certain production activities in a firm, such as with the production and packaging of candy at a particular facility. If the production facility designates 50 candies per bag as its goal, with an acceptable distribution of 45 to 55 candies, any bag found with an amount below 45 or above 55 is considered within the rejection range
To confirm the packaging mechanisms are properly calibrated to meet the expected output, a random sampling may be taken to confirm accuracy. For the packaging mechanisms to be considered accurate, an average of 50 candies per bag with an appropriate distribution is desired. Additionally, the number of bags that fall within the rejection range need to fall within the probability distribution limit considered acceptable as an error rate.
If an unacceptable rejection rate is discovered, or an average deviating too far from the desired mean, adjustments to the facility or associated equipment may be required to correct the error. Regular use of two-tailed testing methods can help ensure production stays within limits over the long term.
Two-Tailed Versus One-Tailed Test
When a hypothesis test is set up to show that the sample mean would be higher or lower than the population mean, this is referred to as a one-tailed test. The one-tailed test gets its name from testing the area under one of the tails (sides) of a normal distribution. When using a one-tailed test, an analyst is testing for the possibility of the relationship in one direction of interest, and completely disregarding the possibility of a relationship in another direction.
If the sample being tested falls into the one-sided critical area, the alternative hypothesis will be accepted instead of the null hypothesis. A one-tailed test is also known as a directional hypothesis or directional test.