Continuous Random Variables and the Normal Distribution Dr Tom Ilvento
Department of Food and Resource Economics
•
Most intro stat class would have a section on probability - we don’t
•
But it is important to get exposure to the normal distribution
•
We will use this distribution, and the related tdistribution, when we shift to inferences
• •
First we need to understand the normal distribution And feel comfortable with the Standard Normal Table 2
Probability
•
• •
Random Variables
Probability is a numerical measure of the likelihood that Event A will occur
• •
P(A)
•
Random Variables – variables that assume numerical values associated with random outcomes from an experiment
•
Random variables can be:
Prob(A)
The basic definition is:
•
It is a proportion which goes from 0 to 1
3
• •
Discrete Continuous
For random variables there is
• •
A probability distribution Expectation and variance 4
The probability of the number of males in three live births
• •
This discrete distribution shows the probability of 0, 1, 2, or 3 males in three births The mean and variance are:
• • •
Mean = 1.5 Variance = .75 Std Dev = .866
What are Continuous Random Variables?
Probability Distribution p(X) 0.4 0.3 0.2 0.1 0 0
1
2
3
•
Unlike Discrete Random Variables, Continuous Random Variables take on any point in the interval
• •
Thus the probability distribution is continuous It is referred to as a Probability Density Function
• •
PDF f(x)
5
6
You might want to get a copy of the Standard Normal Distribution handout
When dealing with a pdf...
• •
•
• •
It is not particularly useful to think of a probability when a continuous random variable takes on a particular value
•
P(x=a) = 0
But, we can think of areas under the curve as reflecting a probability
• • •
P(a 1.28 z > 1.645 z > 2.33
Two Tailed z < -1.645 or z > 1.645 z < -1.96 or z > 1.96 z < -2.575 or z > 2.575
8
Infinate Number of Normal Curves
The Normal Distribution
•
One bell shaped, symmetrical distribution is the normal distribution It is defined by two parameters
• •
µ the Mean ! The Standard Deviation
f ( x) =
2 1 e #(1/ 2 )[( x # µ ) / ! ] ! 2"
•
For every distribution with a mean (µ) and a standard deviation (!) there is a different normal curve
• •
Thus, there are an infinite number of normal curves If x is a random variable distributed as a normal variable then it is designated as: x ~ N(mean, std dev)
9
10
Properties of the Normal Distribution
Standard Normal Distribution
•
The area under the curve = 1
• •
Mean = Median = Mode
•
It has an infinite range
The IQR is 1.33 Std Deviations wide (.677 below or .677 above)
z x F(x)
0.025 0.02
0 100 50.00%
50
1.1 1 0.9 0.8 0.7
0.015
0.6 0.5
50.00%
0.01
0.4 0.3
0.005
0.2
Cumulative Probability
Defined by the mean and standard deviation
Cumulative Probability Graph
Probability Density
Symmetrical, Bell-shaped curve
•
0.1 0 0
50
100
150
0 200
• 11
Since its properties are defined by a formula, we can a priori define probabilities associated with the normal curve, but each combination of a mean and std deviation results in a different normal curve If we convert our normally distributed variable to z-scores, we make it possible to use one table of probabilities for all normal pdf
Cumulative Probability Graph z x F(x)
0 0 50.00%
0.4550
1.1 1
0.4
0.9
0.35
0.8
0.3
0.7
0.25
0.6
0.2
0.5
50.00%
0.4
0.15
0.3
0.1
0.2
0.05
0.1
0 -6
-4
-2
0 0
2
4
6
This is called the Standard Normal Distribution
• •
mean = 0 std dev = 1
12
Cumulative Probability
•
• • •
For every variable distributed normally with a mean (µ) and a standard deviation (!) there is a different normal curve
Probability Density
• •
Finding Areas under the Normal curve
•
Look at the Standard Normal Table
Basic Steps 1. Draw the curve and the area we are interested in 2. Convert the values to zscores
• •
There are several types of tables
•
Since the distribution is symmetrical,
We will work with a table where only ! of the curve is presented
• •
3. Read the proportions in the table, and do any additional calculations that may be necessary
•
Both halves are identical, and each half represents p = .5
So our table will only calculate probabilities for the right hand side of the distribution
•
Moving from the center, µ = 0, toward the right tail
13
14
Standard Normal Table - a partial view of the table •
The table allows for two decimal places of a zscore
•
Vertical axis is the ones and first decimal place
•
Horizontal axis is the second decimal place
•
To find the probability associated with a zscore of 1.08
•
Z
This value represents the probability from µ = 0 up to 1.08 standard deviations above the mean - .3599