Exercise 1

Give some examples of:

  1. Non related variables.
  2. Variables that are increasingly related.
  3. Variables that are decreasingly related.

Exercise 2

In a study about the effect of different doses of a medicament, 2 patients got 2 mg and took 5 days to cure, 4 patients got 2 mg and took 6 days to cure, 2 patients got 3 mg ant took 3 days to cure, 4 patients got 3 mg and took 5 days to cure, 1 patient got 3 mg and took 6 days to cure, 5 patients got 4 mg and took 3 days to cure and 2 patients got 4 mg and took 5 days to cure.

  1. Construct the joint frequency table.
  2. Get the marginal frequency distributions and compute the main statistics for each variable.
  3. Compute the covariance and interpret it.

Exercise 3

The table below shows the two-dimensional frequency distribution of a sample of 80 persons in a study about the relation between the blood cholesterol ($X$) in mg/dl and the high blood pressure ($Y$).

  1. Complete the table.
  2. Construct the linear regression model of cholesterol on pressure.
  3. Use the linear model to calculate the expected cholesterol for a person with pressure 160 mmHg.
  4. According to the linear model, what is the expected pressure for a person with cholesterol 270 mg/dl?

Use the following sums: $\sum x_i=16960$ mg/dl, $\sum y_j=11160$ mmHg, $\sum x_i^2=3627200$ (mg/dl)$^2$, $\sum y_j^2=1576800$ mmHg$^2$ y $\sum x_iy_j=2378800$ mg/dl$\cdot$mmHg.

Exercise 4

A research study has been conducted to determine the loss of activity of a drug. The table below shows the results of the experiment.

  1. Construct the linear regression model of activity on time.
  2. According to the linear model, when will the activity be 80%? When will the drug have lost all activity?

Exercise 5

A basketball team is testing a new stretching program to reduce the injuries during the league. The data below show the daily number of minutes doing stretching exercises and the number of injuries along the league.

  1. Construct the regression line of the number of injuries on the time of stretching.
  2. How much is the reduction of injuries for every minute of stretching?
  3. How many minutes of stretching are require for having no injuries? Is reliable this prediction?

Use the following sums ($X$=Number of minutes stretching, and $Y$=Number of injuries): $\sum x_i =160$ min, $\sum y_j=14$ injuries, $\sum x_i^2=4700$ min$^2$, $\sum y_j^2=36$ injuries$^2$ and $\sum x_iy_j=160$ min$\cdot$injuries.

Exercise 6

For two variables $X$ and $Y$ we have

  • The regression line of $Y$ on $X$ is $y-x-2=0$.
  • The regression line of $X$ on $Y$ is $y-4x+22=0$.

Calculate:

  1. The means $\bar x$ and $\bar y$.
  2. The correlation coefficient.

Exercise 7

The means of two variables $X$ and $Y$ are $\bar x=2$ and $\bar y=1$, and the correlation coefficient is 0.

  1. Predict the value of $Y$ for $x=10$.
  2. Predict the value of $X$ for $y=5$.
  3. Plot both regression lines.

Exercise 8

A study to determine the relation between the age and the physical strength gave the scatter plot below. plot of chunk age_physical_strength_scatterplot

  1. Calculate the linear coefficient of determination for the whole sample.
  2. Calculate the linear coefficient of determination for the sample of people younger than 25 years old.
  3. Calculate the linear coefficient of determination for the sample of people older than 25 years old.
  4. For which age group the relation between age and strength is stronger?

Use the following sums ($X$=Age and $Y=$Weight lifted).

  • Whole sample: $\sum x_i=431$ years, $\sum y_j=769$ Kg, $\sum x_i^2=13173$ years$^2$, $\sum y_j^2=39675$ Kg$^2$ and $\sum x_iy_j=21792$ years$\cdot$Kg.

  • Young people: $\sum x_i=123$ years, $\sum y_j=294$ Kg, $\sum x_i^2=2339$ years$^2$, $\sum y_j^2=14418$ Kg$^2$ and $\sum x_iy_j=5766$ years$\cdot$Kg.

  • Old people: $\sum x_i=308$ years, $\sum y_j=475$ Kg, $\sum x_i^2=10834$ years$^2$, $\sum y_j^2=25257$ Kg$^2$ and $\sum x_iy_j=16026$ years$\cdot$Kg.