In the midst of a controversial LA Times series linking teacher performance in that district to test scores, a new briefing paper was released today by the Economic Policy Institute cautioning against the use of test scores, the Value Added Model, to judge teacher performance.
The 27-page paper — by a blue ribbon collection of educaion researchers including Eva L. Baker, Paul E. Barton, Linda Darling-Hammond, Edward Haertel, Helen F. Ladd , Robe rt L. Linn, Diane Ravitch, Richard Rothstein, Richard J. Shavelson, and Lorrie A. Shepard - lists many negative impacts from judging teachers largely on student test scores. They also point to studies that cite the unreliability of scores.
My first response to this paper is wonder if there is any school system with the time, resources or staffing to conduct the thoughtful and deeper evaluations that these researchers recommend. The comprehensive evaluation model they suggest could be applied in many professions, except that it calls for resources and time that I don’t think too many companies have any longer. And I certainly don’t think schools do in this current bleak climate that will likely persist for a few more years.
Here are key passages, but please read the entire paper. I think many of you will be applauding its position. Also, I have added the link to the LA Times investigation, which is a powerful piece of journalism and creating quite a stir. So, forget about the wash and the grocery store, read both of these and let us know what you think:
A review of the technical evidence leads us to conclude that, although standardized test scores of students are one piece of information for school leaders to use to make judgments about teacher effectiveness, such scores should be only a part of an overall comprehensive evaluation. Some states are now considering plans that would give as much as 50% of the weight in teacher evaluation and compensation decisions to scores on existing tests of basic skills in math and reading. Based on the evidence, we consider this unwise.
Any sound evaluation will necessarily involve a balancing of many factors that provide a more accurate view of what teachers in fact do in the classroom and how that contributes to student learning.
For a variety of reasons, analyses of VAM results have led researchers to doubt whether the methodology can accurately identify more and less effective teachers. VAM estimates have proven to be unstable across statistical models, years, and classes that teachers teach. One study found that across five large urban districts, among teachers who were ranked in the top 20% of effectiveness in the first year, fewer than a third were in that top group the next year, and another third moved all the way down to the bottom 40%. Another found that teachers’ effectiveness ratings in one year could only predict from 4% to 16% of the variation in such ratings in the following year.
Thus, a teacher who appears to be very ineffective in one year might have a dramatically different result the following year. The same dramatic fluctuations were found for teachers ranked at the bottom in the first year of analysis. This runs counter to most people’s notions that the true quality of a teacher is likely to change very little over time and raises questions about whether what is measured is largely a “teacher effect” or the effect of a wide variety of other factors.
A study designed to test this question used VAM methods to assign effects to teachers after controlling for other factors, but applied the model backwards to see if credible results were obtained. Surprisingly, it found that students’ fifth grade teachers were good predictors of their fourth grade test scores. Inasmuch as a student’s later fifth grade teacher cannot possibly have influenced that student’s fourth grade performance, this curious result can only mean that VAM results are based on factors other than teachers’ actual effectiveness.
VAM’s instability can result from differences in the characteristics of students assigned to particular teachers in a particular year, from small samples of students (made even less representative in schools serving disadvantaged students by high rates of student mobility), from other influences on student learning both inside and outside school, and from tests that are poorly lined up with the curriculum teachers are expected to cover, or that do not measure the full range of achievement of students in the class.
The paper concludes:
Although some advocates argue that admittedly flawed value-added measures are preferred to existing cumbersome measures for identifying, remediating, or dismissing ineffective teachers, this argument creates a false dichotomy. It implies there are only two options for evaluating teachers—the ineffectual current system or the deeply flawed test-based system. Yet there are many alternatives that should be the subject of experiments. The Department of Education should actively encourage states to experiment with a range of approaches that differ in the ways in which they evaluate teacher practice and examine teachers’ contributions to student learning. These experiments should all be fully evaluated.
There is no perfect way to evaluate teachers. However, progress has been made over the last two decades in developing standards-based evaluations of teaching practice, and research has found that the use of such evaluations by some districts has not only provided more useful evidence about teaching practice, but has also been associated with student achievement gains and has helped teachers improve their practice and effectiveness.
Structured performance assessments of teachers like those offered by the National Board for Professional Teaching Standards and the beginning teacher assessment systems in Connecticut and California have also been found to predict teacher’s effectiveness on value-added measures and to support teacher learning. These systems for observing teachers’ classroom practice are based on professional teaching standards grounded in research on teaching and learning. They use systematic observation protocols with well-developed, research-based criteria to examine teaching, including observations or videotapes of classroom practice, teacher interviews, and artifacts such as lesson plans, assignments, and samples of student work. Quite often, these approaches incorporate several ways of looking at student learning over time in relation to the teacher’s instruction.
Evaluation by competent supervisors and peers, employing such approaches, should form the foundation of teacher evaluation systems, with a supplemental role played by multiple measures of student learning gains that, where appropriate, should include test scores. Given the importance of teachers’ collective efforts to improve overall student achievement in a school, an additional component of documenting practice and outcomes should focus on the effectiveness of teacher participation in teams and the contributions they make to school-wide improvement, through work in curriculum development, sharing practices and materials, peer coaching and reciprocal observation, and collegial work with students.
111 comments Add your comment
J.B. STONER
August 29th, 2010
8:12 pm
and I wonder what ‘Proud black man thinks..
J.B. STONER
August 29th, 2010
8:24 pm
And to Glenn B eck, thanks for yesterday, god bless you.
nter your comments here
Dr. John Trotter
August 29th, 2010
9:29 pm
MrNumbersMan: You must hail from the Columbus area, right? My parents are still kickin’ at 85. Daddy still tells some funny school stories. Of course, Bo is a riot in and of himself! Patti retired as a First Grade teacher a couple or three years ago, and her kids loved her because she had such a sense of humor with them. Two quick stories…
When Patti was a young teacher, she was faced with the burden of telling a parent that her child just needed to be held back another year in First Grade. The kid just wasn’t picking up on anything, as much as Patti worked with him. Well, it was in the Fall, and Patti had arranged for a conference witht the mother and the principal. After sweetly and gently talking with the mother and giving her rationale why it would be better for the child to repeat the First Grade, Patti gingerly asked the mother if she has any questions. Patti braced herself. The mother asked: “Yes, when is the Halloween Party?”
Daddy tells this story about one of the coaches back in the 1950s at Jordan High (where my father was an assistant principal at the time). This coach was a gem of a coach (especially baseball) but was not known for his academic prowess in the classroom. But, he was no-nonsense when it came to discipline. There was also a sweet, petite lady who taught Chemistry but the kids ran all over her. She was just terrible as far as discipline was concerned. She just didn’t have that commanding presence which scared the students. She was a walk-over, but she knew her Chemistry. My father called the coach in (in fact, I saw this coach at a banquet the other day in Columbus; I think that he is 85 like my father). He said: “Coach, I want you and Miss __________ to Team Teach the Chemistry class. You just sit there with your paddle, and the moment any of the students try to give Miss ___________ any trouble at all, you just pull them out into the hall and paddle them.” The Team Teaching plan worked marvelously! The students gave Miss ____________ no trouble, and she was able to teach. You just have to be creative…something that they don’t want to let you be anymore.
MrNumbersMan, they tie up your hands and don’t let you be effective, but they do want the paperwork done so that they can show their bosses how hard they are working. The paperwork justifies their jobs.
Good to hear from you!
thankateacher
August 29th, 2010
10:26 pm
Echo….
PLEASE, PLEASE, PLEASE tell MACE to come to Douglas County. There are many teachers literally sick to their stomachs when they heard what Central Office wanted to do with teacher evaluations and how to tie it into GTEP.
As I have said before, they want to tie COACH post test scores into teacher evaluations. Teachers have been told that 80% of their classes have to make 75% or higher on the COACH post test. They are also looking into Class Keys and how to tie that into teacher evaluations. Teachers that do not “perform” will be put on a PDP automatically.
To make a long story short, if you have an administrator that does not like you, he or she will find a way to make your evaluation look like crap. We need to fight this!!!
Dr. John Trotter
August 29th, 2010
10:34 pm
Thankateacher: You guys just need to call the MACE Office and talk to one of our Representatives. MACE protects and empowers classroom educators…one member at a time. MACE never claims to change the whole culture of a school or a school system. MACE protects and empowers its members.
Anonymous Teacher
August 30th, 2010
8:16 am
There are some awesome administrators out there. Unfortunately, they are often used up, over worked, or run off by the incompetant ones.
My school had the most awesome AP for the past 6 years. Notice the use of the word “had.” This person cared about the students, the teachers, and the community. He worked with the “hard to love” students who had discipline problems and assigned them adult mentors to help them get straightened out. He rarely missed school and was constantly preoccupied with the safety of our students. He was run off by a narcisstic principal who had had numerous complaints lodged against him to the Professional Standards Commission for speaking out when the principal was insisting that some questionable things be done.
A building principal is an absolute dictator. They set the tone for how things go in the school. If you have one who constantly threatens and berates the entire faculty because one or two are not doing what they should be doing, then it creates resentment among all. If you have one who conducts meetings that go on for hours while other schools in the same district (who have the same information to cover) get out in a third of the time, then it creates resentment because it is clear the administrator is on a power trip instead of focusing on teacher’s preparing for students.
And who can teachers complain to? They can complain to their spouse or they can complain anonymously online to a blog. That’s it. Because if they speak up in any other way, they become the next target to lose their job.
Mac
August 30th, 2010
11:12 am
This is what I have been worried about with VAM all along (we started using a variation of it a few years back to look at teacher growth) – the main variable that that is not taken into account is year to year make up and dynamics of the classroom due to the student mix and maturity levels. This changes each year dramatically and is even in flux during the course of the year in each classroom (talking elementary here primarily). Every other administrator or education data guru I mentioned this to just poo-pooed it and repeated the ‘it’s a better measure, times are changing’ mantra. All for change, just not for it’s own sake and only if it is good for kids. I don’t see that here.
Ole Guy
August 30th, 2010
4:39 pm
And it took a team of high-powered, high-speed researchers to arrive at this simple conclusion…AMAZING, SIMPLY FRIQIN AMAZING!
Here it is..........
August 30th, 2010
5:08 pm
http://empoweredga.org/Articles/whatrtttmeans.html
Nikole
August 30th, 2010
7:21 pm
Thanks for the link to the paper. I will use it in a graduate course this semester.
National debate is under way: What is the fairest way to measure a teacher’s classroom performance? » iThinkEducation.net!
September 1st, 2010
3:33 pm
[...] I wrote about the Economic Policy Institute policy brief that challenged the use of student scores to measure teacher effectiveness and the controversial [...]