|
|
One can't emphasize enough how applications give rise to formulas and not the other way around.
Adolph Quetelet (1796-1874) was a practical man lending himself to taking actual data.
With large data collections the "Laws of Probability" should be further realized.
Quetelet was in charge of some measurements of chest-sizes of
5738 Scottish soldiers. 
|
|
When does Normal Distribution apply?
There certainly is no one answer to this question. Many measurements are normally distributed and the standard deviation is considered as a help in error analysis.
Normal distribution seems to apply to most of the following measurements:
1) The heights of males in North America
2) The IQ-scores of a random individuals from a uniform population
3) The lengths of pregnancies
4) The body temperatures of humans ( or other animals)
5) The weights of newborns
6) The measures of cholesterol levels
7) The weights of manufactured products
There are certainly many more.
|
|
Test-scores at Cambridge University
|
Marks Obtained
|
Number of Candidates
|
|
0000-0500
|
24
|
|
0500-1000
|
74
|
|
1000-1500
|
38
|
|
1500-2000
|
21
|
|
2000-2500
|
11
|
|
2500-3000
|
8
|
|
3000-3500
|
11
|
|
3500-4000
|
5
|
|
4000-4500
|
2
|
|
4500-5000
|
1
|
|
5000-5500
|
3
|
|
5500-6000
|
2
|
|
Mid-points
|
Freq.Xmid
|
Freq.X(mid.-mean)^2
|
|
250
|
6 000
|
33 843 750.00
|
|
750
|
55 500
|
34 976 562.50
|
|
1 250
|
47 500
|
1 335 937.50
|
|
1 750
|
36 750
|
2 050 781.25
|
|
2 250
|
24 750
|
7 261 718.75
|
|
2 750
|
22 000
|
13 781 250.00
|
|
3 250
|
35 750
|
36 136 718.80
|
|
3 750
|
18 750
|
26 738 281.30
|
|
4 250
|
8 500
|
15 820 312.50
|
|
4 750
|
4 750
|
10 972 656.30
|
|
5 250
|
15 750
|
43 605 468.80
|
|
5 750
|
11 500
|
37 195 312.50
|
|
287 500
|
263 718 750.00
|
1) The mean is not at the center of the test scores.
2) The distribution of the test scores are not symmetric on both sides of the
mean.
3) A histogram
would show that the vertical bars do not have a "bell-shape"
curve.
|
|

US Army recruits - 1895
|
Heights (in.)
|
Frequency
|
|
61
|
2
|
|
62
|
4
|
|
63
|
7
|
|
64
|
11
|
|
65
|
13
|
|
66
|
15
|
|
67
|
14
|
|
68
|
12
|
|
69
|
8
|
|
70
|
5
|
|
71
|
2
|
|
72
|
1
|
|
Freq.Xheight
|
Freq(Height-mean)^2
|
|
122
|
53.4578
|
|
248
|
69.5556
|
|
441
|
70.3423
|
|
704
|
51.7979
|
|
845
|
17.7957
|
|
990
|
0.4335
|
|
938
|
9.6446
|
|
816
|
40.1868
|
|
552
|
64.0712
|
|
350
|
73.3445
|
|
142
|
46.6578
|
|
72
|
33.9889
|
|
6 220
|
531.276
|
1) The mean is at the center of the height measurements.
2) Heights are equally balanced on either side of the mean.
|
|
The data set, as presented above, is using a discrete random variable. The "Bell-curve" or normal curve uses a continuous random variable.
Even though it seems that the theory is difficult to work with at first it pays large dividends to engineers and scientists every day.
To show you how good this large collection of measurements is we look at this example:
|
|
From the grouped data we can find the mean chest size of 39.9 inches and a standard deviation of 2.1 inches.
Using these parameters (and the Standard Normal tables)
Part A)
We find z-values for 34.5 and 38.5.
They are z = - 2.52 and z = -0.62. (Both left of the mean.)
Standard normal tables gives us a probability of
0.4910 - 0.2324 = 0.2586 = 25.9%
Part B)
We find z-values for X = 39.5. z-value is -0.14 (left of the mean).
Standard normal tables gives us a probability of
0.5000 + 0.0557 = 0.5557 = 55.6% (Theory)
| X = chest measurement in inches |
|
|
X
|
Frequency
|
Rel. Freq.
|
|
33
|
3
|
0.1%
|
|
34
|
18
|
0.3%
|
|
35
|
81
|
1.4%
|
|
36
|
185
|
3.2%
|
|
37
|
420
|
7.3%
|
|
38
|
749
|
13.1%
|
|
39
|
1 073
|
18.7%
|
|
40
|
1 079
|
18.8%
|
|
41
|
934
|
16.3%
|
|
42
|
658
|
11.5%
|
|
43
|
370
|
6.4%
|
|
44
|
92
|
1.6%
|
|
45
|
50
|
0.9%
|
|
46
|
21
|
0.4%
|
|
47
|
4
|
0.1%
|
|
48
|
1
|
0.0%
|
|
Totals
|
5 738
|
100%
|

This application shows how many measurements behave according to a theoretical "Bell-curve".
If you were to look at a histogram the 16 vertical bars would form a normal distribution.