Degrees: Pharmacy, Biotechnology
Date: December 17, 2018

## Question 1

The chart below represents the cumulative distribution of the number of daily defective drugs produced by a machine in a sample of 40 days.

1. Construct the frequency table of the number of defective drugs.
2. Draw the box and whiskers plot of the number of defective drugs.
3. Study the symmetry of the distribution of the number of defective drugs.
4. If the number of defective drugs produced by a second machine follows the equation $y=3x+2$, where $x$ and $y$ are the number of defective drugs with the first and the second machines respectively, in which machine is more representative the mean of the number of defective drugs?
5. Which number of defective drugs is relatively smaller, 3 drugs in the first machine or 9 in the second one?

## Question 2

A pharmaceutical laboratory produces two models of blood pressure monitor, one for the arm and the other for the wrist. To compare the accuracy of both blood pressure monitors, a quality control has been conducted with a sample of 20 patients, getting the following results:

$\sum x_i=265.4$ mmHg, $\sum y_i=262.5$ mmHg , $\sum z_i=262.4$ mmHg,
$\sum x_i^2=3701.14$ mmHg$^2$, $\sum y_i^2=3629.41$ mmHg$^2$, $\sum z_i^2=3615.38$ mmHg$^2$,
$\sum x_iy_j=3658.28$ mmHg$^2$, $\sum x_iz_j=3655.95$ mmHg$^2$, $\sum y_jz_j=3613.97$ mmHg$^2$.

Where $X$ is the blood pressure with the arm monitor, $Y$ with the wrist monitor and $Z$ the real blood pressure.

1. Which blood pressure monitor predicts better the real blood pressure with a linear regression model?
2. If a patient has a real blood pressure of $13.5$ mmHg, what is the expected blood pressure given by the arm blood pressure monitor?

## Question 3

The regression line of $Y$ on $X$ is $y=1.2x-0.6$.

1. Which of the following lines can not be the regression line of $X$ on $Y$. Justify the answer.
1. $x=0.9y-0.6$
2. $x=-0.7y+0.4$
3. $x=0.8y-0.7$
4. $x=-0.6y-0.5$
5. $x=0.4y-0.6$
6. $x=-0.5y+0.9$
2. Considering only the ones that can be the regression line of $X$ on $Y$, which one will give better predictions? Justify the answer.

## Question 4

In an epidemiological study a sample of 400 persons with breast cancer was drawn and another sample of 1200 persons without breast cancer. In the sample of persons with breast cancer there was 180 smokers, while in the sample of persons without breast cancer there was 1140 non-smokers.

1. Compute the relative risk of developing cancer smoking and interpret it.
2. Compute the odds ratio of developing cancer smoking and interpret it.

## Question 5

We want to develop a diagnostic test to rule out a disease when the outcome of the test is negative (negative predictive value) with a probability 90% at least. It is known that the prevalence of the disease in the population is 15% and the sensitivity of the test is set to 80%.

1. What must be the minimum specificity of the test?
2. Using the previous specificity, compute the probability of a correct diagnostic.
3. If we apply the same test two times to the same patient with negative outcomes, what is the probability of ruling out the disease?

## Question 6

It is known that in a city one out of 20 persons, in average, has blood type $AB$.

1. If we draw randomly 200 blood donors, what is the probability of having at least 5 with blood type $AB$?
2. If we draw randomly 10 blood donors, what is the probability of having more than 8 with blood type different of $AB$?

## Question 7

In a course there are 150 females and 80 males. It is known that the distribution of scores of females and males are normal with the same standard deviation. It is also known that there are 120 females and 56 males with a score greater than 5, and 36 males with a score between 5 and 7.

1. Compute the means and standard deviations of the distributions of scores of females and males.
2. How many females will have a score between 4.5 and 8?
3. Above what score will be 10% of females?