Problems of Descriptive Statistics
Exercise 1
The number of injuries suffered by the members of a soccer team in a league were
0 1 2 1 3 0 1 0 1 2 0 1 1 1 2 0 1 3 2 1 2 1 0 1
Calculate the following statistics and interpret them.
- Mean.
- Median.
- Mode.
- Quartiles.
- Percentile 32.
injuries. injury. injury. injury, injury and injuries. injury.
Exercise 2
The chart below shows the cumulative distribution of the time (in min) required by 66 students to do an exam.
- At what time have half of the students finished? And 90% of students?
- What percentage of students have finished after 100 minutes?
- What is the time that best represent the time required by students in the sample to finish the exam? Is this value representative or not?
min. min. of students. min, min and .
Exercise 3
In a study about children’s growth, two samples were drawn, one for newborn babies and the other for one year old infants. The heights in cm of children in each of the samples were
Newborn children: 51 50 51 53 49 50 53 50 47 50
One year old children: 62 65 69 71 65 66 68 69
In which group is the mean more representative? Justify your answer.
One year old children:
Exercise 4
To determine the accuracy of a method for measuring hematocrit in blood, the measurement was repeated 8 times on the same blood sample. The results of hematocrit in plasma, in percentage, were
42.2 42.1 41.9 41.8 42 42.1 41.9 42
What do you think about the accuracy of the method?
Exercise 5
The histogram below shows the frequency distribution of the body mass index (BMI) of a group of people by gender.
- Draw the pie chart for the gender.
- In which group is more representative the mean of the BMI?
- Calculate the mean for the whole sample.
Use the following sums
Females:
- Females:
min, min and .
Males: min, min and . .
Exercise 6
The following table represents the frequency distribution of ages at which a group of people suffered a heart attack.
age | persons |
---|---|
[40,50) | 6 |
[50,60) | 12 |
[60,70) | 23 |
[70,80) | 19 |
[80,90) | 5 |
Could we assume that the sample comes from a normal population?
Use the following sums:
Exercise 7
To compare two rehabilitation treatments
Days | A | B |
---|---|---|
20-40 | 5 | 8 |
40-60 | 20 | 15 |
60-80 | 18 | 20 |
80-100 | 7 | 7 |
- In which treatment is more representative the mean?
- In which treatment the distribution of days is more skew?
- In which treatment the distribution is more peaked?
Use the following sums:
: days, days and .
: days, days and . and . and , so the distribution of treatment is more peaked than the one of treatment as .
Exercise 8
The systolic blood pressure (in mmHg) of a sample of persons is
135 128 137 110 154 142 121 127 114 103
- Calculate the central tendency statistics.
- How is the relative dispersion with respect to the mean?
- How is the skewness of the sample distribution?
- How is the kurtosis of the sample distribution?
- If we know that the method used for measuring the blood pressure is biased, and, in order to get the right values, we have to apply the linear transformation
, what are the statistics values of parts (a) to (d) for the new, corrected distribution?
Use the following sums:
mmHg, mmHg, all the values. mmHg and . . . mmHg, mmHg, mmHg, mmHg, , and .
Exercise 9
The table below contains the frequency of pregnancies, abortions and births of a sample of 999 women in a city.
Num | Pregnancies | Abortions | Births |
---|---|---|---|
0 | 61 | 751 | 67 |
1 | 64 | 183 | 80 |
2 | 328 | 51 | 400 |
3 | 301 | 10 | 300 |
4 | 122 | 2 | 90 |
5 | 81 | 2 | 62 |
6 | 29 | 0 | 0 |
7 | 11 | 0 | 0 |
8 | 2 | 0 | 0 |
- How many birth outliers are in the sample?
- Which variable has lower spread with respect to the mean?
- Which value is relatively higher, 7 pregnancies or 4 abortions? Justify your answer.
Use the following sums:
Pregnancies:
outliers.- Pregnancies:
, and .
Abortions: , and .
Births: , and . - Standard score of
pregnancies is , and standard score of abortions is .