|
Test Assumptions
All parametric statistics have a set of assumptions that must be met in order to properly use the statistics to test hypotheses. The assumptions of the t test for independent means are listed below.
- Random sampling from a defined population
- Interval or ratio scale of measurement
- Scores in the populations are normally distributed
- The population variances are equal
When reading the psychological literature, we can find many studies in which many of these assumptions are violated. Random sampling is required for all statistical inference because it is based on probability. Random samples are difficult to find, however, and psychologists and researchers in other fields will use inferential statistics on nonrandom samples and discuss the sampling limitations in the article.
We learned in our scale of measurement tutorial that psychologists will apply parametric statistics such as the t test for independent means on approximately interval scales even though the tests require interval or ratio data. This is an accepted practice in psychology and one that we use when we analyze our class data. Finally, the assumption that the difference scores are normally distributed in the population is considered "robust". This means that the the statistic has been shown to yield useful results even when the assumption is violated.
The central limit theorem tells us that even if the population distribution is unknown, we know that the sampling distribution of the mean will be approximately normally distributed if the sample size is large. This also applies to the means of difference scores and helps to contribute to the t test being robust for violations of normal distribution.
The t test for independent means is not robust for violations of the assumption of equal variance. The shape of the sampling distribution of the mean is a function of the variance/standard deviation in the population and the size of the sample drawn from the population. If the populations have different variances, then we do not know what the shape of the sampling distribution of the difference should be. This is a serious problem for our statistic. Computer programs offer a solution by testing whether the variances are equal and providing a mathematical correction when they are not. If we are conducting a one-tailed test, the data are highly skewed, and the variances are unequal, we should consider a different test.
|