Two Tail t-Test Comparison
The statistical technique for comparing the means of two samples was developed by William Gosset, a chemist for the Guinness Brewery in Ireland in the late 1800s.  Gosset wanted to be able to compare different chemical properties of small samples of barley, one of the main ingredients in beer.  Simply subtracting the sample means to find the difference was inadequate. It did not take into account any differences in the variability of the data.  As you saw looking at the box and whisker plots in the NFL data, the variation in measurements taken for two different years could be quite different and that difference needs to be taken into account in any statistic that measures the degree of difference in the two sets of data. 

These differences are accounted for in the formula for Gosset's Student-t statistic:

where the difference in the sample means is in the numerator and the standard deviation of the difference in sample means in the denominator. Think about what happens in the formula depending on the size of the different values. For example, if the difference in the sample means is bigger, t is bigger because you are dividing into a larger number. 

Student was the pen name that Gosset used.  The formula is part of the statistics calculator below.  The calculator is set up to compare the average weight of prospective NFL quarterbacks in 1987 and 2019.  The resulting t = 5.51 tells us that the approximately 19 pound difference in the mean quarterback weights for the two years is 5.51 standard deviations above the average difference you would expect if the two groups were randomly broken into two groups and compared many, many times - like you did in the simulation.


Measure:

Position:




mean
s.d.
n

t-Test Results



A t value of 5.51 is extremely unlikely.  In fact, if you open the table indicating the 95% confidence level, you can see that any t value more than 2.145 standard deviations away from the mean difference would occur less than 5% of the time. Alternatively, t values less than or equal to 2.145 would occur 95% of the time. If the original hypothesis was that there was no difference in the weights, you can now say with 95% confidence that the hypothesis was wrong, that the average weights are significantly different.

Notice in using the table that the results depend on df, or the number of degrees of freedom in the comparison. The  number of degrees of freedom indicates how many of the values are free to vary.  In the case of a comparison of two groups like the one here the number of degrees of freedom is approximately one less than the smallest group size.


(Data can be changed in cells 3 and 4. q1 contains the list of earlier values; q2, the list of 2019 values.
The t value is found in cell 13.)

To Start You Thinking

1) Compare running back weight for 1987 and 2019. If we originally assume that there is no difference in the two groups, what does the t value suggest about this hypothesis?

2) What if the sample sizes, the values of N, in the Student-t formula are bigger? Or, what if the variability, the values of S, in each sample are bigger or smaller? How is the value of t affected in these situations?

3) Are there significant differences between players at other positions from 1987 to 2019.

Notes

NFL combine statistics from NFL Combine Results. Downloaded August 29, 2019.

Desmos Graphing Calculator. (2015). Desmos Graphing Calculator. Accessed September 5, 2019].

Last modified in August, 2024 by Rick Thomas