CSL 9B Activity
Sampling Distributions and the Central Limit Theorem
In Cyberstats Units B-11 and B-12, you learned that statistics, which are random variables also, have a sampling distribution. For example, the range of heights in samples of size 4 among adult women have a sampling distribution. You also learned that the sample mean has a sampling distribution, with very special properties:
a. The mean of the sampling distribution of averages in samples of size N is equal to the mean ofm x = E(X) = m
b. The standard deviation of the distribution of the sample mean is equal
to the standard deviation
of the population from which samples are drawn
divided by the square root of N, that is
s x
= s / o N
c. As the size of the samples increases the sampling
distribution of xbar becomes more nearly a
normal distribution.
Today's activity is intended to illustrate properties a, b, and c.
The disk given to you contains data on X = 'Longest relationship' 209 students in Stat 200 said they had had. Specifically, the question asked of them was 'How long (in weeks) did your longest relationship last?'
Column C1 contains the 209 (population) values in the Minitab
Worksheet.
Column C2 contains the averages of 1000 samples each of size N=4 from this
population.
Column C3 contains the averages of 1000 samples each of size N=16 from this
population
Column C4 contains the averages of 1000 samples each of size N=64 from this
population.
Histograms of the data in the four columns are given on the last page of this activity.
a. Draw one random sample of size N=4 for each member of your group from the population of 209 values and compute the average for each of your samples:
Click Calc, Random Data, Sample From Columns. Type in '4', 'C6', and click Sample with Replacement.Averages for each group member: ______, ______, ______, ______, ______, ______.
This 'simulates' what you see in Column 2 (only there are 1000 averages in C2!).
Sample size Mean SD Min Q1 Median Q3 Max
(Population) N=1 ____ _____ _____ _____ _____ _____ _____
N=4 ____ _____ _____ _____ _____ _____ _____
N=16 ____ _____ _____ _____ _____ _____ _____
N=64 ____ _____ _____ _____ _____ _____ _____
d. Does the mean change very much? Yes ___ No ___ What should the values be?
e. Describe the behavior of the SD. The value for N=1 is s
= _____; what are the theoretical
values for N=4: _______; N=16: _______; and
N=64: ______?
In words, how is the SD changing as N increases from 1 to 4 to 16 to 64?
__________________________________________________________________
__________________________________________________________________
f.
Is the median 'constant'? Look at the value of the median for N=1 (the
population median) and
then compare it with the values for N=4, 16, and 64
(the medians of the sampling distribution of
the mean).
__________________________________________________________________
__________________________________________________________________
__________________________________________________________________
__________________________________________________________________
Measures of 'Center/Location' ? Yes ___ No ___; Measures of spread? Yes ___ No ___
Mild Outliers? : Yes ___ No ___ Extreme Outliers? Yes ___ No ___
If yes, what are they? ___________ If yes, what are they? _____________
j. Does the distribution of X = mean of the distribution of averages for samples of size N appear to be2.5th percentile = _____. 97.5th percentile = _____.



