Chapter 9 Process and Presentation

9.1 Raw Data

Key Skills:

  • s13 Be able to select suitable techniques for processing raw data.
  • s14 Be able to clean data including dealing with missing data and outliers

9.2 Graphs, Charts and Summary Measures

Key Skills:

  • s15 Be able to select suitable data displays and summary measures to show the main features of raw data.
  • s16 Be able to use data displays to check whether distributions being used are realistic.

9.3 Parameters and Inputs; Calculations

Key Skills:

  • s17 Use standard statistical notation for samples.
  • s18 Be able to use sample data to estimate the parameters of a distribution or the inputs for a procedure or model.
  • s19 Be able to use the statistical functions of a calculator to find the mean and standard deviation.
  • s20 Understand the use of a datum level as a base for measurement or calculation.
  • s21 Know how the mean and standard deviation are affected by linear transformations.
  • s22 Be able to substitute input values into a model or procedure.

9.4 The Normal Distribution in Depth

Key Skills:

  • u1 Be able to use the Normal distribution as a model and recognise when it is likely to be appropriate to do so.
  • u2 Be able to standardise a value from a Normal distribution with a given mean and standard deviation.
  • u3 Use the Normal distribution to estimate population proportions in the context of a problem.

9.5 The Chi Squared Test

Key Skills:

  • h3 Be able to apply the χ² hypothesis test to data in a contingency table.
  • h4 Be able to interpret the χ² results of a test.

9.6 Bivariate Data

Key Skills:

  • b1 Know the vocabulary associated with bivariate data.

9.6.1 Relationships between variables

Real life data often comes in pairs. Scratch that, it often comes in lists! When we gather data about a group of people, say, we will gain a whole list of information about each individual. Bivariate data analysis (literally two variables) is the first step on a long road of statistical techniques that enables you to seek out and find the relationships between that long list of variables.

Lucky for you we will only be looking at data that comes in pairs!

For example even though in the country data below you have access to many different variables at this level you will only analyse a pair of variables suc as the relationship between GDP per capita and population or life expectancy and birth rate.

Table 9.1: Country Data.
Country Sub.region population population.ranking life.expectancy life.expectancy.at.birth.ranking birth.rate.per.1000 birth.rate.ranking GDP.per.capita…. GDP.per.capita.ranking
Algeria Africa (Saharan) 38813722 33 76.39 79 23.99 63 7500 137
Egypt Africa (Saharan) 86895099 15 73.45 121 23.35 68 6600 144
Libya Africa (Saharan) 6244174 107 76.04 85 18.40 104 11300 109
Morocco Africa (Saharan) 32987206 38 76.51 77 18.47 100 5500 155
Tunisia Africa (Saharan) 10937521 78 75.68 91 16.90 112 9900 118
Angola Africa (Sub-Saharan) 19088106 58 55.29 204 38.97 9 6300 147
Benin Africa (Sub-Saharan) 10160556 87 61.07 190 36.51 20 1600 202
Botswana Africa (Sub-Saharan) 2155784 144 54.06 209 21.34 77 16400 82
Burkina Faso Africa (Sub-Saharan) 18365123 59 54.78 206 42.42 5 1500 203
Burundi Africa (Sub-Saharan) 10395931 85 59.55 195 42.33 6 600 225
Cabo Verde Africa (Sub-Saharan) 538535 172 71.57 144 20.72 82 4400 167
Cameroon Africa (Sub-Saharan) 23130708 53 57.35 201 36.58 19 2400 188
Central African Republic Africa (Sub-Saharan) 5277959 117 51.35 217 35.45 23 700 223
Chad Africa (Sub-Saharan) 11412107 76 49.44 222 37.29 16 2500 184
Comoros Africa (Sub-Saharan) 766865 163 63.48 183 29.05 44 1300 209

9.7 Spearman’s Rank Correlation

Key Skills:

  • b2 Know how to calculate Spearman’s rank correlation coefficient and carry out hypothesis tests using it.

9.8 Product Moment Correlation

Key Skills:

  • b3 Be able to use suitable technology to find Pearson’s product moment correlation coefficient and to interpret the correlation coefficient.