‘the dark irony and cruelty of hegemony is that teachers take pride in acting on the very assumptions that work to enslave them’
Graded observations must go. I’ve argued before that they are this year’s brain gym; I’ve shown the damaging impact on teachers in around 50 anecdotes; and I set out several alternative solutions other than grading. Now I want to compare some observation rubrics that schools use.
Matt O Leary asked me a while back to review his book Classroom Observation. Here is his argument:
The high-stakes nature of performance management-driven observation for monitoring and measuring militates against professional development; school leaders must challenge the hegemony of graded observations and redesign observation as a tool for reciprocal learning, decoupled from summative high-stakes grading.
‘Good teaching is a contested area and cannot be easily defined … any search for a simple model of good teaching is ultimately doomed to failure.’
‘There’s a mismatch between teachers’ work and the means used to evaluate it – managerial models fail to capture the complexity – they fail because they are flawed.’
‘Some of the key qualities of the good teacher are often not observable – humour, compassion, enthusiasm, determination…’
‘Why do we insist on grading teachers against this arbitrary Ofsted 4-point scale?’
‘There is no published research which shows that meaningful grading is possible.’
‘Other professions such as medicine and law are not subjected to this arbitrary system of measurement. Why can’t the same apply to teachers?’
‘The failure of the teaching profession to reach any agreed consensus on what constituted best practice meant that it was at the mercy of whichever political wind was blowing.’
While I completely agree with Matt O Leary’s diagnosis, I’m uneasy with this prescription:
The jargon of ‘maximising potential’ ’hypothesis-building’ ‘inclusivity’ and ‘resilience’ makes me feel uneasy. And here lies the rub: any selective list of what makes good teaching will never be agreed upon.
Let’s look at some examples. One school uses these observation criteria:
It seems suspiciously designed to fit the acronym BRAD PIT.
Another school uses this apparatus:
What these two and many other observation rubrics have in common is the 1-4 numerical grading judgements. But how are the criteria decided on? The first neglects subject knowledge; the second is more exhaustive but demands an extraordinary 20 numerical judgements per lesson observation.
As an experiment, school leaders were asked to write down what they thought was necessary to be a good teacher:
Such lists are endless; combining and prioritising them into a selective list of 10 or 20 is intractable and interminable. Itemising and quantifying things like tenacity, creativity and idiosyncrasies (undoubtedly important parts of being a teacher) is a fool’s errand.
The problem is, we are asking the wrong question. Debates over ‘what makes a good teacher?’ and ‘what makes an outstanding teacher?’ or even ‘what makes good teaching?’ are circular. There are as many possible answers as there are teachers in the world.
Instead, we should be asking: how can observations most help improve our teaching? Not by grading or quantifying or judging. Observations most improve teaching when they are disconnected from performance management, appraisal and pay, and only formative. Observations most help when they are low-stakes, frequent and give one clear, prioritised, next-step piece of feedback. As Paul Bambrick Santayo argues:
‘By receiving weekly observations and feedback, a teacher develops as much in one year as most teachers do in twenty.’
He proposes this rubric:
One colleague I trust says he can’t find any way to improve on this. I think it’s missing two vital components: planning and assessment. So here is the rubric I propose:
The vital difference is not the criteria. The vital difference is that it is only used formatively. Numerical judgement must not take place at all if teachers are to get the best out of observations for their professional development. Beyond that, the criteria are only used as guidance, a tool for focusing rather than judging.
In a world without graded observations, these two ideas might have an impact on CPD:
Video allows teachers to reflect on their teaching and analyse their pupils’ behaviour, concentration and effort beyond their lessons. Matt O Leary makes the case for them in his book, and says that in the best schools, teachers ask for their worst classes to be observed. Why? Because of the culture of trust and learning.
Observing to learn: one obligatory, unscheduled weekly observation to learn from another teacher would create an open-door culture. I rarely found enough time to observe other teachers, but when I did they were some of the most valuable experiences I had in teaching. If I’d been forced to make this into a regular habit, a bit like practicing a musical instrument, I would have learned far more.
The research that shows grading is ineffective is out there. School leaders now have a responsibility to replace graded observations with an alternative that works.
You can already hear the pleas of panic: but how will we manage performance and underperforming teachers without graded observations? How will appraisal and capability work? What will we tell OFSTED when they arrive? Those are questions I take up in my next blogposts.