Problems of Linear Regression
Exercise 1
Give some examples of:
- Non related variables.
- Variables that are increasingly related.
- Variables that are decreasingly related.
- The daily averge temperature and the daily number of births in a city.
- The hours preparing an exam and the score.
- The weight of a person and the time require to run 100 meters.
Exercise 2
In a study about the effect of different doses of a medicament, 2 patients got 2 mg and took 5 days to cure, 4 patients got 2 mg and took 6 days to cure, 2 patients got 3 mg ant took 3 days to cure, 4 patients got 3 mg and took 5 days to cure, 1 patient got 3 mg and took 6 days to cure, 5 patients got 4 mg and took 3 days to cure and 2 patients got 4 mg and took 5 days to cure.
- Construct the joint frequency table.
- Get the marginal frequency distributions and compute the main statistics for each variable.
- Compute the covariance and interpret it.
Dose:
Days:
3.
Exercise 3
The table below shows the two-dimensional frequency distribution of a sample of 80 persons in a study about the relation between the blood cholesterol (
- Complete the table.
- Construct the linear regression model of cholesterol on pressure.
- Use the linear model to calculate the expected cholesterol for a person with pressure 160 mmHg.
- According to the linear model, what is the expected pressure for a person with cholesterol 270 mg/dl?
Use the following sums:
Regression line of cholesterol on blood pressure:
3.
4.
Regression line of blood pressure on cholesterol:
Exercise 4
A research study has been conducted to determine the loss of activity of a drug. The table below shows the results of the experiment.
- Construct the linear regression model of activity on time.
- According to the linear model, when will the activity be 80%? When will the drug have lost all activity?
years, years .
%, % .
years %.
Regression line of activity on time: .
Regression line of time on activity:
Exercise 5
A basketball team is testing a new stretching program to reduce the injuries during the league. The data below show the daily number of minutes doing stretching exercises and the number of injuries along the league.
- Construct the regression line of the number of injuries on the time of stretching.
- How much is the reduction of injuries for every minute of stretching?
- How many minutes of stretching are require for having no injuries? Is reliable this prediction?
Use the following sums (
min, min .
injuries, injuries .
min injuries.
Regression line of injuries on time of stetching: . injuries/min.
Regression line of time of stretching on injuries:
Exercise 6
For two variables
- The regression line of
on is . - The regression line of
on is .
Calculate:
- The means
and . - The correlation coefficient.
and . .
Exercise 7
The means of two variables
- Predict the value of
for . - Predict the value of
for . - Plot both regression lines.
. .
Exercise 8
A study to determine the relation between the age and the physical strength gave the scatter plot below.
- Calculate the linear coefficient of determination for the whole sample.
- Calculate the linear coefficient of determination for the sample of people younger than 25 years old.
- Calculate the linear coefficient of determination for the sample of people older than 25 years old.
- For which age group the relation between age and strength is stronger?
Use the following sums (
-
Whole sample:
years, Kg, years , Kg and years Kg. -
Young people:
years, Kg, years , Kg and years Kg. -
Old people:
years, Kg, years , Kg and years Kg.
years, years .
kg, kg .
years kg.
. years, years .
kg, kg .
years kg.
. years, years .
kg, kg .
years kg.
.- The linear relation between the age and the physical strength is a little bit stronger in the group of young people.