FAQ/semsd - CBU statistics Wiki

Revision 24 as of 2013-04-10 14:21:58

Clear message
location: FAQ / semsd

# How do I obtain the standard deviation from the standard error of the mean (s.e.m.) and how does this and the mean vary with sample size?

$$\frac{\mbox{The standard deviation}}{\sqrt{\mbox{sample size}}}$$= standard error of the mean

i.e.

standard deviation = $$\sqrt{\mbox{sample size}} \mbox{multiplied by the standard error of the mean}$$

It follows from this (e.g. p.218 of Babbie) that the standard error of the mean decreases with sample size, N.

This follows since: The variance of the mean = 1/N x variance of the response = 1/N x The variance of N randomly sampled responses from a parent population = 1/ $$N2 x (Nsigma2) = sigma2/N$$ where $$sigma^2$$ is the (unobserved true) variance of the (parent population of the) response. Since this involves a term 1/N the variance (or its square root = standard error) of the mean decreases with sample size. The sample variance is used as an estimate for $$sigma^2$$.

The mean (= to the midpoint or median in a Normal distribution) on the other hand is not proportional to sample size so is uneffected by N as it converges to the true mean of the ( assumed) underlying normal distribution as N increases. One can see this easily by considering an example: suppose we have a sample of size 3 of a responses = 1 2 3 then the mean is 2. Suppose I take a sample of size 7 say of the same response and get values of 1 1 2 2 2 3 3 then the mean = 2 there since it is symmetric about the (hypothesised true) mean of 2.

Putting these together the ratio mean / s.e.(mean) is akin to a one-sample t-test of the null hypothesis that the true mean equals zero. This ratio will increase with increasing sample size because the mean will stay approximately the same as its s.e. decreases. This also follows from the fact that as the sample size increases a t random variable converges to a value having a standard Normal distribution (z-value). If any z-value is squared it becomes a value having a chi-square distribution (with one degree of freedom) and chi-squares increase with sample size (see e.g. Howell p.157) so the square of the mean divided by its standard error goes up with N. The main practical problem resulting from the dependency of the standard error of the mean on N is that for large N the t and chi-square tests will be large and lead to type I error (incorrectly rejecting the null hypothesis) for trivially small effect sizes.

The main point is that you already have a fixed difference (mean-0) and so the t statistic just tells you how "big that difference is" measured in (estimated) units of variability of the observed mean. In more general terms, when the units become smaller in absolute numbers, the same absolute difference d emp (= the difference between the observed sample mean and the mean assumed under the null hypothesis) will "be worth more units" and will thus count as "more surprisingly high" (= less likely to occur) if the true mean is equal to the mean assumed under the null hypothesis (see here for further discussion).

References

Babbie, E. (2008). The Basics of Social Research. Fourth Edition. Thomson Wadsworth: Belmont.CA.

Howell, D.C. (1979) Statistical Methods for Psychologists. Fourth Edition. Wadsworth:Belmont,CA.