Skip to main navigationSkip to main content
The University of Southampton
Practical Applications of Statistics in the Social SciencesResearch Question 3: GCSE Scores

Multiple Linear Regression

How do both sex and ethnicity influence total GCSE scores?

 

In social science research, it is rare that we would only want to include one explanatory variable in our regression analysis. It is more likely that we would want to investigate the effect that two or more factors have on an outcome, such as total GCSE score. This might be because variables may measure the same thing or have similar relationships – we want to know what the relationship is controlling for other variables. Multiple linear regression allows us to obtain predicted values for specific variables under certain conditions, such as GCSE scores between sexes, while controlling for the influence of other factors, such as ethnicity.

We are now going to add additional explanatory variables to our regression model and learn how to make predictions using a multiple linear regression model.

Multiple regression model:

Using the same procedure outlined on the previous pages for a simple model, you can fit a linear regression model with s1gcseptsnew as the dependent variable and both s1gender1 and the dummy variables for ethnic group as explanatory variables.

To fit a multiple linear regression, select Analyze, Regression, and then Linear.

In the dialogue box that appears, move s1gcseptsnew to the Dependent(s) box and s1gender1, MIXED, ASIAN, BLACK, and OTHER in the Independent(s) box. (Remember we are still using WHITE as a baseline, so you do not need to include this dummy variable in your multiple linear regression model.)

 

Your output should look like the tables on the right.

RQ3 MLR image
Multiple Linear Regression Output

For this model, our regression equation is:

s1gcseptsnew = 381.242 + (s1eth2 constant x ethnicity) + (s1gender1 constant x sex)

After our multiple linear regression, our predicted values are:

s1gcseptsnew = 381.242 + (-49.069 x 1) + (23.995 x 0) = 332.173 (Black Male)
s1gcseptsnew = 381.242 + (-49.069 x 1) + (23.995 x 1) = 356.168 (Black Female)
s1gcseptsnew = 381.242 + (7.646 x 1) + (23.995 x 0) = 388.888 (Asian Male)
s1gcseptsnew = 381.242 + (7.646 x 1) + (23.995 x 1) = 412.883 (Asian Female)
s1gcseptsnew = 381.242 + (3.623 x 1) + (23.995 x 0) = 384.865 (Mixed Male)
s1gcseptsnew = 381.242 + (3.623 x 1) + (23.995 x 1) = 408.86 (Mixed Female)
s1gcseptsnew = 381.242 + (29.328 x 1) + (23.995 x 0) = 410.570 (Other Male)
s1gcseptsnew = 381.242 + (29.328 x 1) (23.995 x 1) = 434.565 (Other Female)
s1gcseptsnew = 381.242 + (23.995 x 0) = 340.693 (White Male)
s1gcseptsnew = 381.242 + (23.995 x 1) = 340.693 (White Female)


Taking into consideration the trends we saw in the gender and ethnicity simple linear regression models, how closely do the results of our multiple linear regression follow the established patterns? Do women still have higher total GCSE scores? Are differences with respect to ethnicity still seen?

Run another multiple linear regression, including s1truan in the model along with s1gender1 and the ethnicity dummy variables. You’ll need to create dummy variables for the categories in s1truan, and then select one of them to be the baseline category, remembering to leave that baseline category out of the multiple linear regression model. Do the predicted scores change at all when you control for the influence of s1truan? Are the trends we saw previously still illustrated in this model?

Summary

 

Here, we’ve used multiple linear regression to determine the statistical significance of GCSE scores while controlling for sex and ethnic background. We’ve learned that there are still statistically significant relationships between GCSE score and ethnicity, and between GCSE score and sex. Finally, we’ve used the ethnicity and sex coefficients presented to us in the multiple linear regression to predict GCSE scores for people falling into the various ethnicity and sex categories.

Note: as we are making changes to a dataset we’ll continue using, please make sure to save your changes before you close down SPSS. This will save you having to repeat sections you’ve already completed.

Useful Downloads

Need the software?PDF Reader
Privacy Settings