Pharmacy exam 2021-01-18

Degrees: Pharmacy, Biotechnology
Date: January 18, 2021

Question 1

The table below contains the differences between the grades in the final school exam and the entrance exam in a sample of public high schools (X) and private high schools (Y):

Public schools1.20.70.40.91.60.50.21.80.8Private schools2.10.50.71.90.22.81

  1. Which of the following box plots corresponds to each variable? Compare the central dispersion of the two variables according to the box plots. In which variable is smaller the median?

image

  1. In which type of schools is more representative the mean of grades?

  2. In which type of schools is more symmetric the distribution of grades?

  3. In which type of schools is more peaked the distribution of grades?

  4. Which difference is relatively smaller, 0.5 points in a public high school or 1 points in a private high school?

Use the following sums for the computations:
Public: xi=5.1, xi2=9.63, (xix¯)3=0.95 and (xix¯)4=8.76.
Private: yi=8.8, yi2=17.64, (yiy¯)3=0.82 and (yiy¯)4=11.28.

  1. The box plot 1 corresponds to private schools and the box plot 2 to public schools. The central dispersion is pretty similar in both variables. The median is smaller in private schools.

  2. Public schools: x¯=0.5667 , s2=0.7489 , s=0.8654 and cv=1.5271.
    Private schools: y¯=1.2571 , s2=0.9396 , s=0.9693 and cv=0.7711.
    Thus, the mean of the grade is more representative in private schools.

  3. g1x=0.1626 and g1y=0.1285. Thus, the distribution of grades in private schools is more symmetric as the coefficient of skewness is closer to 0.

  4. g2x=1.2651 and g2y=1.1748. Thus, the distribution of grades in private schools is more peaked.

  5. Public schools: z(0.5)=0.077.
    Private schools: z(1)=0.2653.
    Thus, a difference of grades -0.5 in a public schools is relatively smaller than a difference of -1 in a private school.

Question 2

An auditor is studying the relationship between the salary and the number of absences of a hospital warden. The following table shows the salary in thousands of euros (X) and the annual average of absences with that salary (Y).

Salary20.022.52527.530.032.535.037.540.0Absences2.32.021.82.21.51.21.30.6

  1. Compute the regression line that best explains the absences as a function of the salary.

  2. What is the expected number of absences that will have a warden with a salary of 29000€? Is this prediction reliable?

  3. How much will the number of absences increase or decrease for every increment of 1000€ in the salary?

Use the following sums for the computations:
xi=270 103€, yi=14.9 absences,
xi2=8475 (103€)2, yi2=27.11 absences2,
xiyj=420 103€ absences.

  1. x¯=30 103€, sx2=41.6667 (103€)2,
    y¯=1.6556 absences, sy2=0.2714 absences2,
    sxy=3 103€ absences.
    Regression line of absences on salary: y=3.81560.072x.

  2. y(29)=1.7276 absences.
    r2=0.796, thus the model fits well as the coefficient of determination is not far from 1, but the sample size is too small to be reliable the prediction.

  3. The number of absences will decrease 0.072 for every increment of 1000€ in the salary.

Question 3

In a regression study it is known that the regression line of Y on X is y+2x10=0 and the regression line of X on Y is y+3x14=0.

  1. Compute the means of X and Y.

  2. Compute the linear correlation coefficient and interpret it.

  1. x¯=4 and y¯=2.

  2. r=0.8165. The linear correlation coefficient is near -1 so there is a strong inverse relation between X and Y.

Question 4

A test to detect prostate cancer produces 1% of false positives and 0.2% false negatives. It is known that 1 in 400 males suffer this type of cancer.

  1. Compute the sensitivity and the specificity of the test.

  2. If a male got a positive outcome in the test, what is the chance of developing cancer?

  3. Compute and interpret the negative predictive value.

  4. Is this test better to predict or to rule out the cancer?

  5. To study whether there is an association between the practice of sports and this type of cancer, a sample of 1000 males was drawn, of which 700 practised sports, and it was observed that there were 2 males with cancer in the group of males who practised sports, and there were 3 males with cancer in the group of males who did not practice sports. Compute the relative risk and the odds ratio and interpret them.

Let D the event corresponding to suffering prostate cancer and + and the events corresponding to get a positive and a negative outcome respectively.

  1. The sensitivity is P(+|D)=0.2 and specificity P(|D)=0.99.

  2. Positive predictive value P(D|+)=0.0476.

  3. Negative predictive value P(D|)=0.998.

  4. As the positive predictive value is smaller than the negative predictive value, this test is better to rule out the disease. In fact, we can not use this test to detect the prostate cancer because the positive predictive value is less than 0.5.

  5. RR(D)=0.2857 and OR(D)=0.2837. Thus, there is an association between the practice of sports and the prostate cancer and the risks and the odds of developing cancer is almost one fourth smaller if the male practice sports.

Question 5

The probability that a child of a mother with the color-blind gene and a father without the color-blind gene is a color-blind male is 0.25. It is also known that in a population there is one color-blind male for every 5000 males.

  1. If this couple has 5 children, what is the probability that at most 2 of them are color-blind males?

  2. If this couple has 5 children, and the gender of the children is equiprobable, what is the probability that 3 or more are females?

  3. In a random sample of 10000 males of this population, what is the probability that more than 3 are color-blind males?

  1. Let X be the number of color-blind sons in a sample of 5 children, then XB(5,0.25) and P(X2)=0.8965.

  2. Let Y be the number of girls in a sample of 5 children, then YB(5,0.5) and P(Y3)=0.5.

  3. Let Z be the number of color-blind males in a sample of 10000 males, then ZB(10000,0.0002)P(2) and P(Z>3)=0.1429.

Question 6

The primate cranial capacity follows a normal distribution with mean 1200 cm3 and standard deviation 140 cm3.

  1. Compute the probability that the cranial capacity of a primate is greater than 1400 cm3.

  2. Compute the probability that the cranial capacity of a primate is exactly than 1400 cm3.

  3. Above what cranial capacity will 20% of primates be?

  4. Compute the interquartile range of the cranial capacity of primates and interpret it.

Let X be the primate cranial capacity. Then XN(1200,140).

  1. P(X>1400)=0.0766.

  2. P(X=1400)=0.

  3. P80=1317.827 cm3.

  4. Q1=1105.5714 cm3, Q3=1294.4286 cm3 and IQR=188.8571 cm3. Thus the 50% of central data will be concentranted in an interval of width 188.8571 cm3, that is a small spread.

Previous
Next