The luck factor

Warning: long and boring and technical entry coming up… Feel free to skip it…

I want to talk about the way candidates are marked in their A-level English Oral exams.  No I’m not complaining, but I want to see if any of you has anything to say about the system.  Ok here’s the story:

It’s the Oral exams from 13 Mar to 22 Mar – that’s 8 days of examination.  I’m lucky enough to be one of the examiners.

Each day, I examine 7 groups of candidates (4 in each group) – that’s 28 of them (sometimes 27 or less).  After they’ve given their oral presentation and a discussion, I give them a score from 1 to 7 (1 being the worst, 7 being the best).  Now the score is subjective and based on impression marking; there’s neither criteria nor benchmark nor performance standards (actually there ARE guidelines but I won’t go into that).  I simply compare all 28 of them and sort of rank them.  I’m supposed to fit their scores into something like a normal distribution curve, namely the following "acceptable range" of marks:

1 mark: 0-2 candidates
2 marks: 3-4 candidates
3 marks: 5-6 candidates
4 marks: 8-9 candidates
5 marks: 5-6 candidates
6 marks: 3-4 candidates
7 marks: 0-2 candidates

For example, the following pattern would be acceptable:

1 XX
2 XXX
3 XXXXX
4 XXXXXXX
5 XXXXXX
6 XXXX
7 X

The following pattern would obviously be UNacceptable, for it’s statistically unlikely that the numbers of poor, average and outstanding candidates are all equal:

1 XXXX
2 XXXX
3 XXXX
4 XXXX
5 XXXX
6 XXXX
7 XXXX

The following pattern is also UNacceptable because it’s skewed to one side, indicating that the examiner is unduly harsh in his marking:

1 XX
2 XXXXX
3 XXXXXXX
4 XXXXXXXXX
5 XXXX
6 X
7

Now the obvious question is: How is it possible to make all candidates from EVERY DAY fit in the same normal curve?  I mean, is it possible that the overall standard of candidates on, say, Monday is higher than that on Wednesday?  That is, a certain student getting a 2 on Monday would get a 3 on Wednesday, simply because he’s being compared to a different, less able, batch.  Is this likely?  The Examinations Authority says that it’s theoretically possible but very rare in practice because the candidature of each exam centre is completely random.  Now I don’t know the exact odds for this (I’m no statistician), but this theoretical possibility direct results in a matter of pass or fail.  Why?  Because there has to be a dividing line between pass or fail.  Something like: you get a 2, you fail; you get a 3, you pass.  I know it’s a double-examiner system but that doesn’t change the essence.  Both examiners give a 2; the candidate fails.  Both give a 3; he passes.  You can see that the frequent claim "examiners are subjective" no longer applies here.  It doesn’t make a difference really.

So that leaves the question: Is it likely that a candidate who failed on one day would have passed on a different day?  Absolutely.  In fact, that’s what I experienced in just these 3 days.  On Monday (13 Mar), the standard of my 7 groups was indeed quite high (my partner agreed so).  Today (15 Mar), the standard was unexpectedly poor (echoed by my partner again).  In other words, if you happened to be one of the candidates in my group on Monday, then I feel sorry for you; but if you were in my group today, then congratulations, you would’ve got a high mark!

I know that the Oral component accounts for only 18% of the total Use of English scores, and that it’s again statistically insignificant for a person to fail the whole subject because of this, but the appearance of an "F" on the result slip is still unsightly.

So, what’s the solution to the problem?  Honestly I don’t know.  Can we increase the number of candidates from 28 to 40, to make the normal distribution more likely to be attained?  No, because no examiner can examine 10 groups of candidates in one go and still remain sane or reliable.  Can we change the scoring system altogether and make it a criterion-referenced one, like the LPAT (±Ð®v»y¤å°ò·Ç¸Õ)?  No, because fitting a candidate’s performance to a given level (based on some kind of descriptors) is very subjective and imprecise.  Actually this was done in the past but it was found that examiners often could not discriminate betwee the less able candidates, i.e. the boundary between Pass and Fail was a very fine line to tread.

So, the current system is supposed to be the best one there is.  Or is it?  I really have no idea.

All I know is that when I wish my students "good luck on the exams", I now mean it literally, especially on the Oral exams.

Log in to write a note
March 15, 2006

unfair to some students, actually but, the world is often not fair

March 15, 2006

MAN… The distribution curve seems to make it fair but actually it’s UNfair! We always knew we were marked according to the curve but never realised the consequences. Sigh…

nds
March 15, 2006

The problem is that the students sampled in each day is not necessarily representative of the student body at large, due to random skews in the sampling. The easiest way to deal with that problem is by repeated sampling. Repeat several rounds of such oral exams, each round using a different schedule. That way the effect of sampling all “good” students on the same day can get amortised.

nds
March 15, 2006

Too bad that also means our examiners would have to do several times more work. 🙁

March 15, 2006

No wonder I got poor grade in oral exam….>.< I was in bad luck!!

March 15, 2006

This system lacks flexibility.

March 15, 2006

this happens not only in CE / A-level, this happens in all public exams, plus in-house exams in school… every academic institute ranks students in this way, trying to fit all students in a normal distribution curve… many of the teaching staffs (i’m not saying you) believe students should fit into that curve, while they themselves are exceptionally good.

March 15, 2006

ÂI¸Ñ½g³¥¤@Á¿­^¤å d notes ´N·|ÅÜ«{°µ¯Â­^¤å¬[­ù..(¥ò­n­ø«Y chinglish~~~)¯u«Y«l §Ú°O±o¥H«e±`Å¥»¡ group discussion²Ä¤@­Ó·m¥ýÁ¿ªº´N°ª¤À..¨þ

March 15, 2006

Hummm…it does sound unfair… Good luck to all the students taking the A_level oral exam!

March 15, 2006

sigh

March 15, 2006

my two cents: 1) a group 28 students is definitely not a fair sample size for statistical reason, 2) for every student, will the highest and lowest mark be discarded?

March 15, 2006

Just score them all pass…what the hack !! hahaha :o)

March 16, 2006

oh dear…. this luck factor O.O in which case it might actually be “fair” of examiners just give their marks based on impression…? do you think it works if you’re allowed to -not- follow the curve, and give a candidate any mark you think he or she deserves?? Re: thanks for your support^^~ but i’m actually just a tiny tiny potato contributing to the anthology :-p

March 16, 2006

hhmmm… a bit sleepy tonight.. can i read this entry later? haha.. RYN: ¼M!! *Shhhhhhh* ­ø¦nËݤjÁn§r!!! ÂI¸Ñ§A¤S·|¦ô¨ìgeh?? §A¯u«Y¤ñ§Ú·Q¶H¤¤Áo©ú¦n¦h!!! ^^

March 16, 2006

Yup! I fall in love with a man finally……….. Hope everything goes well la!

March 16, 2006

oral exam has killed many candidates…

March 16, 2006

of course there will be bias, but i consider that it would be the same in the working enviornment as in the oral exam, and students that had done better will still receive better marks, right? and we also need the “luck” factor in daily life, it’s the reality.

March 16, 2006

RE: ¾¾¡A¦pªG¯àÅܦ¨¦³®Éµo¾`¡A®É®Éµo¥ú·íµM¬O§ó¶i¤@¨B¡A¦ý¦³®Éµo¥ú¤w¸g¬O«ÜÃø±oªº°Õ¡C

March 17, 2006
March 17, 2006

rere: students do not mind, if “exams” are in daily basis… that reflects the truth competence too.

March 17, 2006

re: À°¸É¤Uma

March 17, 2006

And that’s how luck realms! But after all, I think the stronger ones will always get what they deserve. Only that it won’t be so fair for those in between, so unevenly distributed on different days. Just recall how I once had my A-level two years ago though I’d rather go back to that period instead of being “tortured” by my studies now. Have a lovely holiday anyway! Take care…

March 17, 2006

RYN: hahaha.. ^^ Well, ¨ä¹ê§Ú»¡±Ð®Ñªº°é¤luncool¥u¬Oªx«ü³o¦æ·~, ·íµM·í¤¤·|¦³«Ü¦h­Ó§Ocases/¨Ò¥~ªº¤H… ´N¦n¹³§A°Õ.^^ ÁöµM§Ú¤£»{²{¹ê¥Í¬¡¤¤ªº§A, ¦ý±qodùØ·P¨ì§A¬O¤@­Ó­·½ì, «ÕÀq, ¥¿­±, ©M«D±`¦³±Ð¾Ç¼ö¸Û¤Î·R¤ßªº¤H. §Aªº¯d¨¥¥Ã»·¥O¤H·|¤ß·L¯º, ¦]¬°§AÁ`¬O±q¤@¨Ç«ÕÀqªº¨¤«×¥Xµo, ©Ò¥H´Nºâ«ÜÁV¿|«Üªq³àªº¨Æ±¡, µ¹§A’«Õ¤W¤@Àq’«á, ³£·|Åܱoless unbearable©O. §Aªº¯d¨¥¯u¥¿¦³’cheer ppl up’ªº§@¥Î.

March 17, 2006

¬O¤F, §O­n»~·|§Ú¯S¦a©^¥à§A°Ú. ^0^ ³o¨Ç³£¬O§Ú¯u¥¿ªº·Pı©O…

March 17, 2006

Yes good luck to them! :-)… Thank you for your kind words you left me! Have a great day!! See I not only saved money… but my lungs and life as well by not smoking. *smiles*

March 17, 2006

Oh, so boring…not you OD, but the marking job…=_=

March 17, 2006

RE(2): there will not be a perfect marking scheme. exams is a system full of “the luck factor”, and it usually cannot represent the real talent of students. but are there any better ways to solve the problem?

March 25, 2006

©Ò¿×¤@©R¤G¹B¤T­·¤ô¡A¦ÒÃä¬ì³£¦n¡A³£­nÚ»¤U¦n­ø¦n±m¥ý¦³¾÷·|°ª¤À