Pharmacy exam 2018-10-29 Degrees: Pharmacy, Biotechnology Date: October 29, 2018 Question 1 A study about obesity in a city has measured the body mass index (BMI) in a sample. The collected data is shown in the table below. $$ \begin{array}{lr} \mbox{BMI} & \mbox{Persons} \newline \hline 15-18 & 5 \newline 18-21 & 62 \newline 21-24 & 72 \newline 24-27 & 45 \newline 27-30 & 12 \newline 30-33 & 2 \newline 33-36 & 1 \newline 36-39 & 1 \newline \hline \end{array} $$ Compute the percentage of people with a BMI between 19 and 25. Which is the BMI with a 20% of persons above it? Are there outliers in the sample? Give the outliers if there are some. Solution Non interpolating: $F(19)\approx 0.335$ and $F(25)\approx 0.920$, so the percentage of people between 19 and 25 is 58.5% approximately. $P_{80}\approx 25.5$. $Q_1\approx 19.5$, $Q_3\approx 25.5$, $IQR\approx 6$, $f_1\approx 10.5$ and $f_2\approx 34.5$. Thus there is at leats one outlier in the interval (36-39). Interpolating: $F(19)=0.1283$ and $F(25)=0.77$, so the percentage of people between 19 and 25 is 64.17% $P_{80}=25.4$. $Q_1=20.1774$, $Q_3=24.7333$, $IQR=4.5559$, $f_1=13.3435$ and $f_2=31.5671$. Thus there are at leats two outliers in the intervals (33-36) and (36-39). Question 2 A gene of a rat species has been modified to help the metabolization of cholesterol in blood. To check the effectiveness of this genetic modification two samples of 20 rats were drawn, ones with the gene modified and the others not, and they were fed with the same diet with different concentrations of palm oil during one month. The following sums summarize the results: Palm oil quantity in gr (the same in both samples) $\sum x_i=640.6467$ gr, $\sum x_i^2=23508.6387$ gr², $\sum(x_i-\bar x)^3=-5527.08$ gr³, $\sum(x_i-\bar x)^4=792910$ gr⁴ Cholesterol level in blood in mg/dl of non genetically modified rats $\sum y_j=2945.8545$ mg/dl, $\sum y_j^2=439517.5975$ (mg/dl)², $\sum(y_j-\bar y)^3=604.08$ (mg/dl)³, $\sum(y_j-\bar y)^4=3717331.07$ (mg/dl)⁴ $\sum x_iy_j=98156.0658$ gr$\cdot$mg/dl. Cholesterol level in blood in mg/dl of genetically modified rats $\sum y_j=2126.5899$ mg/dl, $\sum y_j^2=226824.5373$ (mg/dl)², $\sum(y_j-\bar y)^3=-629.4$ (mg/dl)³, $\sum(y_j-\bar y)^4=48248.29$ (mg/dl)⁴ $\sum x_iy_j=69517.3648$ gr$\cdot$mg/dl. In which sample the cholesterol has a more representative mean, genetically modified or non modified rats? In which sample the distribution of cholesterol is more skew? In which sample the kurtosis of the distribution of cholesterol is less normal? Which rat has a cholesterol level relatively bigger, a genetically modified rat with a cholesterol level of 130 mg/dl, or a non genetically modified rat with a cholesterol level of 145 mg/dl? In which sample the regression line of cholesterol on the palm oil quantity fits better? According to the regression line, what level of cholesterol is expected for a genetically modified rat with a diet of 25 gr of palm oil? And for a non genetically modified rat? What amount of palm oil must be supplied to a non genetically modified rat to have a cholesterol level smaller than 150 mg/dl? Is this prediction reliable? Solution Non genetically modified rats: $\bar y=147.2927$ mg/dl, $s^2_y=280.7332$ (mg/dl)², $s=16.7551$ mg/dl and $cv_y=0.1138$. Genetically modified rats: $\bar y=106.3295$ mg/dl, $s^2_y=35.265$ (mg/dl)², $s=5.9384$ mg/dl and $cv_y=0.0558$. Thus, the mean of genetically modified rats is more representative since the coef. of variation is smaller. Non genetically modified rats: $g_1=0.0064$. Genetically modified rats: $g_1-0.1503$ Thus, the distribution of genetically modified rats is more skew since the coef. of skewness is further from 0. Non genetically modified rats: $g_2=-0.6416$. Genetically modified rats: $g_2-1.0602$ Thus, the kurtosis of the distribution of genetically modified rats is less normal since the coef. of kurtosis is further from 0. Non genetically modified rats: $z(145)=-0.1368$. Genetically modified rats: $z(130)=3.986$. Thus, a cholesterol level of 130 mg/dl in genetically modified rats is relatively greater than 145 mg/dl in non genetically modidied rats. $\bar x=32.0323$ gr, $s^2_x=149.3614$ gr². Non genetically modified rats: $s_{xy}=189.6733$ gr$\cdot$mg/dl and $r^2=0.858$. Genetically modified rats: $s_{xy}=69.8861$ gr$\cdot$mg/dl and $r^2=0.9273$. Thus, the regression line fits better in genetically modified rats since the coef. of determination is greater. Regression line of $Y$ on $X$ in non genetically modified rats: $y=106.615+1.2699x$. Prediction: $y(25)=138.3624$ Regression line of $Y$ on $X$ in genetically modified rats: $y=91.3416+0.4679x$. Prediction: $y(25)=103.0391$ Regression line of $X$ on $Y$ in non genetically modified rats: $x=-67.4838+0.6756y$. Prediction: $x(150)=33.8615$. The prediction is very reliable since the coef. of determination is close to 1. Question 3 It is known that the regression line of $Y$ on $X$ has equation $3x+2y-4=0$ and it explains half of the variability of $Y$. According to the linear regression model, how much will $X$ change for each unit that increases $Y$? Solution $r^2=0.5$ and $b_{xy}=-\frac{1}{3}$, so $X$ decreases 1/3 of the increase of $Y$. Exam Statistics Biostatistics Previous Pharmacy exam 2018-11-19 Next Pharmacy exam 2018-01-19