Select Committee on Education and Skills Minutes of Evidence


Memorandum submitted by Roger Porkess (QCA 32)

1.  PERSONAL POSITION

1.1  The argument "Let sleeping dogs lie"

  1.1.1  It can be argued that what happened to A Levels this summer should be kept in perspective. Lots of students don't get the examination results they are hoping for or even perhaps deserve. Examining is not an exact science. Most of those involved are now at University and it is much more important for them to be looking forward and getting on with their new courses than harking back to what might have been. So we should draw a line under the whole episode and forget about it.

  1.1.2  The danger with that argument is that it allows precedent to be established on two key principles.

1.2  Adjusting module thresholds to influence qualification outcomes

  1.2.1  The grading problems occurred because certain modules were marked down in order to reduce the numbers of candidates getting particular A Level grades. This is a fundamentally wrong thing to do in a modular syllabus.

    (i)  It breaks faith with the candidates, in effect secretly moving the goal posts.

    (ii)  It is unsound examining practice since it causes the candidates to be ranked incorrectly.

  1.2.2  Modular A Levels have been around for some 10 years, but, to my knowledge, never before have module thresholds been adjusted to influence qualification outcomes.

1.3  Fairness to candidates

  1.3.1  Until now it has always been a principle of our examinations that the candidates' interests are paramount.

  1.3.2  To the extent that it is humanly possible, every effort has been made to ensure that each candidate receives the correct grade.

  1.3.3  This summer tens of thousands of candidates have received incorrect grades but nothing is being done about it, even though their grades could easily be set right.

1.4  The integrity of A Levels

  1.4.1  The future integrity of A Levels can only be guaranteed if these two key principles are re-established, and that in turn depends on re-grading this summer's candidates.

2.  BACKGROUND

2.1  Curriculum 2000

  2.1.1  Work began on Curriculum 2000 in the middle of 1998 and I was keen to do all I could to help make it a success. During the later part of that year there were a number of seminars on particular issues, most of which I attended. In several cases I followed these up by writing discussion papers to help QCA take matters forward.

  2.1.2  One of these related to the question of how to aggregate AS and A2 marks without causing grade inflation, a matter which was causing concern to those designing Curriculum 2000. In that paper I suggested a mechanism that had worked well with our MEI syllabus for the previous eight years. However, in the event QCA neither accepted my suggestion nor any other but let the curriculum go through with this flaw built in.

  2.1.3  It became clear to me at that time that those involved were uncomfortable with the mathematical nature of the problem. At meetings eyes would glaze over. I suspect that this had a lot to do with its never being resolved.

2.2  Syllabus submissions (1999-2000): specimen papers

(This information is included in the light of some of the questions to earlier witnesses)

  2.2.1  Syllabuses (renamed "specifications") were submitted in 1999 and most were approved towards the end of that year although some dragged on into 2000. These submissions including specimen examination papers and mark-schemes. In the case of the MEI syllabus, the design thresholds are also written into the approved syllabus.

  2.2.2  It is thus untrue to say that the AS and A2 standards were undefined. QCA put a great deal of effort into looking at the specimen papers, and held up approval for substantial periods of time until they were satisfied.

  2.2.3  As it happens, in mathematics QCA got the AS standard badly wrong, contributing to the very high failure rate (29.1%) in June 2001.

  2.2.4  I have seen no evidence of any attempt by QCA to ensure comparability of standards across subjects. It remains the case that pass rates are much higher in the arts subjects than in the science; mathematics remains firmly at the bottom of the list.

3.  A CASE STUDY FROM THIS SUMMER

3.1  Rationale for this section

  3.1.1  At this point I would like, as a case study, to describe the events surrounding the award of one particular syllabus. For reasons of confidentiality this is presented as a separate Appendix.

4.  MODELLING THE SITUATION

4.1  Description

  4.1.1  The rest of this submission is a report that was issued on 15 November 2002.

  4.1.2  Most of this describes the calculations that led me to the conclusion that tens of thousands of candidates have received a lower grade than they should have done.

4.2  Calculations

  4.2.1  While the actual calculations are correct, their validity depends upon assumptions about data which are held by the examinations boards and are not in the public domain.

  4.2.2  Publication of full data would allow a more accurate estimate of the number of candidates affected to be made. In the absence of such data, these figures stand.

  4.2.3  An exact answer to the question "How many candidates?" can only be obtained by re-grading all syllabuses.

Appendix to Section 4 Report: The effect of moving grade thresholds

SUMMARY

  The Tomlinson Inquiry restricted its scope to the most extreme movements of grade thresholds.

  Consequently many of this summer's candidate's have lost an A Level grade.

  As a mathematician I estimate the number affected to be over 20,000.

BACKGROUND

  During this summer's A Level award, there were many cases where the thresholds set by Awarding Committees were subsequently made substantially more severe by the examination boards.

  The reason for this was to ensure that the numbers of students getting high grades were in line with those in 2001, before Curriculum 2000 was introduced.

  Because Curriculum 2000 is modular, where action was taken it involved particular modules take in June 2002.

  Adjusting the results on particular modules to influence the overall outcome is an intrinsically unsound practice; it introduces inconsistency in standards across modules and discriminates against candidates who took certain modules.

  When the Tomlinson Inquiry was announced many of us expected that in all cases where grade thresholds had been moved the original thresholds would be restored, and candidates re-graded. This did not happen.

  Instead a cut-off was decided upon. Only those modules with threshold shifts of six marks or more were considered for re-grading (and two others where there had been many complaints).

  The application of the cut-off will inevitably have left some candidates with a lower A Level grade than would have been the case if all thresholds had been restored to those set by the Awarding Committees.

  The Tomlinson Inquiry did not address the question of how many students lost a grade in this way.

  There is also evidence that where re-grading did occur, the original thresholds were not fully restored.

A STATISTICAL ESTIMATE

  The Tomlinson Inquiry set a cut-off point of a threshold adjustment of six marks on a module examination. A natural group to consider are those just below this cut off point. Here is a question.

    "Thresholds on AS/A modules are increased five marks at one sitting. What percentage of candidates lose an A Level grade as a consequence?"

  Until now this question would seem not to have been answered.

  Perhaps the reason is because there is no single neat mathematical answer. It depends on the mark distribution for the particular A Level syllabus this summer, on how tightly the thresholds are packed together for the modules in question and on how many modules a candidate took at that sitting.

  To deal with such a problem you need to make realistic assumptions, in this case about the mark distribution, the spacing of the thresholds and the number of modules taken, and then work through the consequences.

    —  The mark distribution is assumed to be that in the attached graph.

    —  The module thresholds are taken to be five marks apart, so that the Tomlinson cut off point corresponds to one module grade.

    —  The threshold adjustment is made only at the grade A boundary.

    —  Candidates take two examination modules.

  All of these are reasonable assumptions, and they lead to the conclusion that 16.1% of candidates of that syllabus would have lost an overall A Level grade.

  The next stage is to vary the assumptions and so obtain a range of values.

    —  Looking at other mark distributions gave a range of outcomes: 16.8%, 14.8% and for a very bottom-heavy distribution, 12.4%. A realistic "average" figure would seem to be about 15%.

    —  In coursework modules the thresholds are usually much closer together and so the final outcome would be higher, over 20% if one of the modules is coursework.

    —  In some subjects threshold adjustments were made to both A and E boundaries (and so to all those in between). In that case the final outcome would be about twice as large and so could be over 30%.

    —  Very few candidates would have taken only one module in June, but quite a lot took three as some schools had forbidden January entries. The final outcome for those who took three would have been one and a half times as great, so over 20%.

  In conclusion, a low estimate of the percentage of candidates losing an A Level grade in such a "cut-off" syllabus is 15%, and it could be quite a lot higher.

  The Tomlinson Inquiry identified syllabuses from all three examination boards which had had threshold adjustments above the cut-off level, but by far the majority of them were in one board, and in that one board it would seem that the average threshold adjustment was about five marks per module.

  So an estimate of the number of candidates who lost an A Level grade from that one board alone is 15% of the total A Level entry

  or about 35,000 candidates.

  There will be some more from the other two boards but the available data are rather restricted, making it hard to do more than guess at the number. I prefer to stay with 35,000 than to guess higher.

  The assumptions that underlie that figure have been deliberately on the cautions side. As a further act of caution, I will allow a large margin for error, and conclude that the evidence suggest a figure in excess of 20,000.

December 2002

APPENDIX

AN EXAMPLE

  When an examination paper is marked it is given a raw mark. This is then converted into a uniform mark, which is independent of the difficulty of the paper. In the conversion, the value of one raw mark varies according to the grade band width.

  A grade band width of five raw marks is quite common and this converts into 10 uniform marks. In that case one raw mark is worth two uniform marks. In most cases this is close to reality and so provides a helpful rule-of-thumb.

  However, in coursework modules band widths may be as small as two raw marks, and in that case one raw mark is worth five uniform marks.

  The effect of any change in candidates' uniform marks on their grading is illustrated for a typical distribution on the attached graph. In this example the cut-off for two modules is taken to correspond to 20 uniform marks.

    —  The black vertical lines are drawn at the aggregated A Level thresholds of 240 (E), 300 (D), . . ., 480 (A).

    —  The red vertical lines illustrate the effect of a change of 20 uniform marks at the A thresholds, with proportional effects at B, C and D.

    —  The grey shaded regions represent those candidates losing a grade. In this example, 16.1% of the candidates fall into this category.

Roger Porkess

    —  Roger Porkess is a mathematician and a Fellow of the Royal Statistical Society.

    —  He is Project leader for Mathematics in Education and Industry (MEI), a long established independent curriculum development body and is responsible for one of the largest Mathematics A Level syllabuses (MEI Structured Mathematics).

    —  He was responsible for the development of the first modular A Level in any subject; this established the principles for the assessment of such courses and is the model upon which Curriculum 2000 is based.

    —  He has long experience of examining, as a setter, revisor, awarder and marker.

    —  He is author, co-author or series editor of over 50 books, mostly on mathematics, and contributed numerous articles to various journals.

    —  In his earlier career he taught mathematics in a variety of schools in the UK and third world countries.

    —  Being employed by an independent body, allows him a freedom of speech and association on professional matters that few others in the examination world enjoy.


 
previous page contents next page

House of Commons home page Parliament home page House of Lords home page search page enquiries index

© Parliamentary copyright 2003
Prepared 14 April 2003