Continuous Random Variables
Probability distribution of a continuous random variable
Continuous random variables, unlike discrete random variables, can take any value in a real interval. Thus the range of a continuous random variables is infinite and uncountable.
Such a density of values makes impossible to compute the probability for each one of them, and therefore, it’s not possible to define a probabilistic model trough a probability function like with discrete random variables.
Besides, usually the measurement of continuous random variable is limited by the precision of the measuring instrument. For instance, when somebody says that is 1.68 meters tall, his or her true height is no exactly 1.68 meters, because the precision of the measuring instrument is only cm (two decimal places). This means that the true height of that person is between 1.675 y 1.685 meters.
Hence, for continuous variables, it makes no sense to calculate the probability of an isolated value, and we will calculate probabilities for intervals.
Probability density function
To model the probability distribution of a continuous random variable we use a probability density function.
Definition - Probability density function. The probability density function of a continuous random variable
-
It is non-negative:
, -
The area bounded by the curve of the density function and the x-axis is equal to 1, that is,
-
The probability that
assumes a value between and is equal to the area bounded by the density function and the x-axis from to , that is,
Distribution function
The same way that for discrete random variables, for continuous random variables it makes sense to calculate cumulative probabilities.
Definition - Distribution function. The distribution function of a continuous random variable
Probabilities as areas
To calculate probabilities with a continuous random variable we measure the area bounded by the probability density function and the x-axis in an interval.
This area can be calculated integrating the density function or subtracting the distribution function that is easier,
Example. Given the following function
let’s check that is a density function.
As this function is clearly non-negative, we have to check that total area bounded by the curve and the x-axis is 1.
Now, let’s calculate the probability of
Population statistics
The calculation of the population statistics is similar to the case of discrete variables, but using the density function instead of the probability function, and extending the discrete sum to the integral.
The most important are:
Definition - Continuous random variable mean The mean or the expectec value of a continuous random variable
Definition - Continuous random variable variance and standard deviation The variance of a continuous random variable
The standard deviation of a random variable
Example. Let
The mean is
and the variance is
Continuous probability distribution models
According to the type of experiment where the random variable is measured, there are different probability distributions models. The most common are
- Continuous uniform.
- Normal.
- Student’s T.
- Chi-square.
- Fisher-Snedecor’s F.
Continuous uniform distribution
When all the values of a random variable
Definition – Continuous uniform distribution
Observe that
The mean and the variance are
Example. The generation of a random number between 0 and 1 is follows a continuous uniform distribution
As the density function is constant, the distribution function has a linear growth.
Example. A bus has a frequency of 15 minutes. Assuming that a person can arrive to the bus station in any time, what is the probability of waiting for the bus between 5 and 10 minutes?
In this case, the variable
Then, the probability of waiting between 5 and 10 minutes is
And the expected waiting (the mean) time is
Normal distribution
The normal distribution model is, without a doubt, the most important continuous distribution model as it is the most common in Nature.
Definition - Normal distribution
The two parameters
The plot of the probability density function of a normal distribution
The bell shape depends on the mean
- The mean
sets the center of the bell. - The standard deviation sets
the width of the bell.
The plot of the distribution function of a normal distribution is S shaped.
Normal distribution properties
- It is symmetric with respect to the mean, and therefore, the coefficient of skewness is zero,
. - It is mesokurtic, as the density function is bell shaped, and so, the coefficient of kurtosis is zero,
. - The mean, median and mode are the same
- It asymptotically approaches 0 when
tends to .
Example. It is known that the cholesterol level in females of age between 40 and 50 follows a normal distribution with mean 210 mg/dl and standard deviation 20 mg/dl.
According to the Gauss bell properties, this means that
- The 68% of females have a cholesterol level between
mg/dl, i.e., between 190 and 230 mg/dl. - The 95% of females have a cholesterol level between
mg/dl, i.e., between 170 and 250 mg/dl. - The 99% of females have a cholesterol level between
mg/dl, i.e., between 150 and 270 mg/dl.
Example of blood analysis. In blood analysis it is common to use the interval
Thus, when a women between 40 and 50 years of age has a cholesterol level out of this interval, it’s common to think about some pathology. However this person could be healthy, although the likelihood of that happening is only 5%.

The central limit theorem
This behavior is common in many physical and biological variables in Nature.
If you think about the distribution of the height, for instance, you can check that most people in the population have a height around the mean, but as the heights move away from the mean, both below and above the mean, there are few and few people with such a heights.
The explanation for this behavior is the , that we will see in the next chapter; it states that a continuous random variable whose values depends on a huge number of independent factors adding their effects, always follows a normal distribution.
The standard normal distribution
The most important normal distribution has mean zero,
Calculation of probabilities with the normal distribution
To avoid integrating the normal density function to compute probabilities it’s common to use the distribution function, that is given in a tabular format like the one below. For instance, to calculate
0.00 | 0.01 | 0.02 | … | |
---|---|---|---|---|
0.0 | 0.5000 | 0.5040 | 0.5080 | … |
0.1 | 0.5398 | 0.5438 | 0.5478 | … |
0.2 | 0.5793 | 0.5832 | 0.5871 | … |
0.3 | 0.6179 | 0.6217 | 0.6255 | … |
0.4 | 0.6554 | 0.6591 | 0.6628 | … |
0.5 | 0.6915 | 0.6950 | 0.6985 | … |
⋮ | ⋮ | ⋮ | ⋮ | ⋮ |
To compute cumulative probabilities to the right of a value, we can apply the rule for the complement event. For instance,
Standardization
We have seen how to use the table of the standard normal distribution function to compute probabilities, but, what to do when the normal distribution is not the standard one?
In that case we can use standardization to transform any normal distribution in the standard normal distribution.
Theorem - Standardization. If
Thus, to compute probabilities with a non-standard normal distribution first we have to standardize the variable before using the table of the standard normal distribution function.
Example. Assume that the grade of an exam
As
Then we can use the table of the standard normal distribution function,
Therefore,
Chi-square distribution
Definition - Chi-square distribution
follows a chi-square probability distribution with
Its range is
Example. Below are plotted the density functions of some chi-square distribution models.
Chi-square distribution properties
- The range is non-negative.
- If
and , then - It asymptotically approaches to a normal distribution as the degrees of freedom increase.
As we will see in the next chapter, the chi-square distribution plays an important role in the estimation of the population variance and in the study of relations between qualitative variables.
Student’s t distribution
Definition - Student’s t distribution
follows a Student’s t probability distribution model with
Its range is
Example. Below are plotted the density functions of some student’s t distribution models.
Student’s t distribution properties
- The mean, the median and the mode are the same,
. - It is symmetric,
. - It asymptotically approaches to the standard normal distribution as the degrees of freedom increase. In practice for
both distributions are approximately the same.
As we will see in the next chapter, the Student’s t distribution plays an important role in the estimation of the population mean.
Fisher-Snedecor’s F distribution
Definition - Fisher-Snedecor’s F distribution
follows a Fisher-Snedecor’s F probability distribution model with
Its range is
Example. Below are plotted the density functions of some Fisher-Snedecor’s F distribution models.
Fisher-Snedecor’s F distribution properties
- The range is non-negative.
- It satisfies
Thus, if we name the value that satisfies , then which is helpful in order to compute probabilities from the table of the distribution function.
As we will see in the next chapter, the Fisher-Snedecor’s F distribution plays an important role in the comparison of population variances and in the analysis of variance test (ANOVA).