Are football referees really biased and inconsistent?

In professional football disciplinary transgressions by players and sanctions taken by referees provide a rich source of subject material for debate among pundits, journalists and the general public. Although newspaper and television pundits routinely and piously deplore incidents involving foul play or physical confrontation, there is no doubt that a violent incident, immediately followed by the referee’s theatrical action of brandishing a yellow or red card in the direction of the miscreant, makes an important contribution to the popular appeal of the football match as a spectacle. Due to the ever-increasing scope of television coverage of football especially at the highest level, together with improvements in video technology, the actions of players and referees have never been more keenly and intensely scrutinized than they are in the modern-day game.

Here you have performance analysis in football the margins separating success from failure can be slender, and often depend ultimately on split-second decisions taken by referees and players in the heat of battle. Yet the financial implications of success or failure for individual football clubs and their players can be huge. The football authorities are under intense pressure from all sides to take steps to ensure that refereeing decisions are as fair, consistent and accurate as is humanly possible. In an attempt to learn more about disciplinary sanction we carried out a statistical analysis of patterns in the incidence (or frequency) of disciplinary sanction using data on the 4,940 matches played in the English Premier League between the 1996-97 and 2008-09 seasons, inclusive.

The statistical analysis allows for tests of several hypotheses concerning patterns in the incidence of disciplinary sanction, including:

  • A home advantage hypothesis: the tendency for away teams to incur more disciplinary points than home teams is solely a corollary of home advantage, or the tendency for home teams to win more often than away teams
  • A refereeing consistency hypothesis: the propensity to take disciplinary action does not vary between referees
  • A time consistency hypothesis: the overall incidence of disciplinary sanction is stable over time and unaffected by changes to the content or interpretation of the rules

The following questions are also examined in the analysis:

  • Does the average rate of disciplinary sanction against each team depend upon which team is the favorite and which is the underdog?
  • Does it depend upon whether the match itself is competitive (between two evenly balanced teams) or noncompetitive?
  • Does it depend upon whether end-of-season outcomes are at stake for either team?
  • Is it affected by the stadium audience?

The incidence of disciplinary sanction and variation across referees

We attempt to explain the variation in the total number of disciplinary ‘points’ incurred by the home and away teams in each match. The disciplinary points are calculated by awarding one point for a yellow card and two points for a red card. Table 1 (below) shows the frequency distribution for the number of disciplinary points incurred by the home and away teams in the 4,940 matches during the sample period. For example, there were 340 matches where neither side incurred disciplinary points, 443 matches in which the home team received no disciplinary points but the away team incurred one point, and so on. In our statistical models, the number of disciplinary points that the home and away teams can expect to incur in each match is conditional upon a number of factors that vary from match to match (and these relate to the hypotheses and questions of interest mentioned above).

Inconsistency in the standards applied by different referees is among the most frequent causes of complaint from football managers, players, supporters and media pundits. Table 2 summarises the average numbers of disciplinary points per match awarded against the home and away teams and against both teams combined, by each referee who officiated at least 30 Premier League matches during the study period. There appears to be considerable variation between the propensities for individual referees to take disciplinary action. For example, the most lenient referee (Keith Burge) averaged 2.526 disciplinary points per match over 57 matches, and the most prolific (Mike Reed) averaged 4.541 points over 85 matches.

Are referees inconsistent?

Does the variation in refereeing standards suggested by Table 2 constitute statistical evidence of inconsistency in refereeing standards? Our results say it does. In other words, individual referee effects make a significant contribution to the expected number of disciplinary points incurred by the home and away teams in each match, indicating that there are inconsistencies between referees in the interpretation or application of the rules. There is also evidence of variation between referees in the degree of home team bias and this variation contributes to the overall pattern of refereeing inconsistency.

Our research also found:

  • The tendency for away teams to incur more disciplinary points than home teams cannot be explained solely by the home advantage effect on match results. Even after controlling for team quality, a (relatively strong) away team can expect to collect more disciplinary points than a (relatively weak) home team with the same win probability.
  • The incidence of disciplinary sanction tends to be higher in matches between evenly balanced teams, in matches with end-of-season outcomes at stake, and in matches that attract high attendances.
  • Home teams appear to play more aggressively in front of larger crowds, but perhaps surprisingly crowd size does not influence the incidence of disciplinary sanction against the away team.
  • Despite an increase over time in the number of offences subject to disciplinary sanction, there is no consistent time-trend in the data: players and officials appear to have adjusted to changes in the rules so that in the long run the rate of disciplinary sanction remained approximately constant.

Conclusion

By providing a comprehensive statistical analysis of patterns in the award of yellow and red cards, this study has given the English football authorities and other interested parties a firm factual basis for policy decisions and debate concerning the interpretation and implementation by referees of the rules governing disciplinary sanction.

References

P Dawson, SM Dobson, JA Goddard and J Wilson (2007), Are football referees really biased and inconsistent? Evidence on the incidence of disciplinary sanction in the English Premier League. Journal of the Royal Statistical Society, Series A, 170 No.1 pp 231-250.

SM Dobson and JA Goddard (2011), The Economics of Football, Cambridge University Press (2nd edition).

Table 1 Frequency distribution for home team and away team disciplinary points

HOME AWAY
0 1 2 3 4 5+ Total
0 340 443 277 172 85 26 1343
1 215 460 413 258 119 75 1540
2 124 311 292 215 104 90 1136
3 35 126 160 119 79 54 573
4 10 36 64 65 32 26 233
5+ 4 19 34 30 15 13 115
Total 728 1395 1240 859 434 284 4940

Source: The Football Association

Table 2 Average number of disciplinary points awarded per match by referee

MATCHES HOME TEAM AWAY TEAM BOTH TEAMS
Reed, Mike 85 1.788 2.753 4.541
Willard, Gary 60 1.900 2.350 4.250
Harris, Rob 52 1.750 2.327 4.077
Barber, Graham 168 1.690 2.327 4.018
Riley, Mike 282 1.709 2.309 4.018
Dean, Mike 193 1.798 2.114 3.912
Dowd, Phil 148 1.527 2.291 3.818
Clattenburg, Mark 87 1.897 1.874 3.770
Wilkes, Clive 30 1.400 2.333 3.733
Styles, Rob 211 1.645 2.076 3.720
Knight, Barry 75 1.600 2.080 3.680
Bennett, Steve 225 1.516 2.107 3.622
D’Urso, Andy 119 1.479 2.109 3.588
Messias, Matt 45 1.489 2.089 3.578
Mason, Lee 39 1.641 1.923 3.564
Poll, Graham 276 1.514 2.040 3.554
Bodenham, Martin 44 1.455 2.045 3.500
Lodge, Stephen 102 1.392 2.108 3.500
Webb, Howard 150 1.473 2.027 3.500
Jones, Peter 112 1.411 1.991 3.402
Rennie, Uriah 176 1.614 1.773 3.386
Ashby, Gerald 33 1.212 2.152 3.364
Wilkie, Alan 81 1.358 1.975 3.333
Elleray, David 129 1.295 1.984 3.279
Atkinson, Martin 98 1.418 1.857 3.276
Barry, Neale 157 1.318 1.949 3.268
Dunn, Steve 178 1.331 1.882 3.213
Marriner, Andre 58 1.155 2.052 3.207
Wiley, Alan 248 1.448 1.742 3.190
Alcock, Paul 78 1.000 2.026 3.026
Winter, Jeff 165 1.121 1.891 3.012
Foy, Chris 127 1.260 1.717 2.976
Gallagher, Dermot 194 1.191 1.716 2.907
Walton, Peter 100 1.200 1.700 2.900
Durkin, Paul 169 1.166 1.450 2.615
Halsey, Mark 217 1.037 1.498 2.535
Burge, Keith 57 0.877 1.649 2.526