GothamSchools — daily independent reporting on NYC public schools

call to action

Report: “Meaningless” teacher evaluations need improvement

picture-1A new report is urging school districts across the country to beef up their methods of evaluating teachers, which the report describes as so slipshod as to be “largely meaningless.” The report, by a nonprofit group that has clashed with teachers unions in the past, describes the poor evaluations as “just one symptom of a larger, more fundamental crisis—the inability of our schools to assess instructional performance accurately or to act on this information in meaningful ways.”

The report goes on:

This inability not only keeps schools from dismissing consistently poor performers, but also prevents them from recognizing excellence among top-performers or supporting growth among the broad plurality of hardworking teachers who operate in the middle of the performance spectrum. Instead, school districts default to treating all teachers as essentially the same, both in terms of effectiveness and need for development.

The report, conducted by The New Teacher Project, a nonprofit founded by the lightning-rod D.C. schools chancellor Michelle Rhee, calls on districts to develop more robust teacher evaluation systems that reward successful teachers and easily identify less successful teachers.

The report comes amid a growing push to improve teaching quality across the country. President Obama has said that teachers who are not helping students learn should be removed from classrooms, and even the national American Federation of Teachers union is working internally to build a new method of evaluating teacher quality.

The report bases its findings on surveys of thousands of teachers and administrators across four states and 12 school districts, plus a scouring of the districts’ evaluation records. New York City was not one of the districts studied.

Many of the districts evaluate teachers based on less than a handful of observations, each of which last under an hour, and in some cases just 15 minutes. Across all districts, almost no teachers are rated unsatisfactory, and virtually none are dismissed for poor performance; in districts that rate teachers either competent or not, the ratio rated unsatisfactory versus satisfactory stands at 1 to 99.

That’s despite teachers’ and principals’ reports, in surveys, that at least one of their tenured colleagues is performing poorly, a statement that 81 percent of administrators and 58 percent of teachers surveyed told The New Teacher Project they agreed with. The percentages of educators reporting that tenured colleagues are not competent rises as a school’s students become more impoverished, as the chart above shows.

In New York City, principals rate teachers as either “unsatisfactory” or “satisfactory” at the end of each school year, based on observations they have conducted during the year. A sense that the system was too perfunctory led the Department of Education to create new teacher data reports last year that look at how teachers affect their students’ test scores. The reports have been criticized for their shaky statistical grounding, and a law lobbied for by the city teachers union prevents principals from using the data reports when making tenure decisions.

The New Teacher Project has been criticized in the past by teachers union president Randi Weingarten, who called its studies of the New York City teacher market biased because TNTP has a contract with the city to help bring new teachers into the public schools. TNTP also is the group behind a report urging the city to save money by terminating teachers who are currently without positions. To write this study, TNTP worked with union leaders and school district leaders in the districts studied, which include Chicago; Toledo, Ohio; Denver, Colo.; and Little Rock, Ark.

In a statement, Weingarten said she embraced the thrust of the new report but noted that it ignores a pioneering method of teacher evaluation, peer assistance and review, that has shown gains where it is being used, most notably in Toledo. “We are excited that TNTP shares our goal of redesigning teacher evaluations, and we look forward to working with TNTP and others to improve the quality of instruction in our schools,” she said.

  • http://edintheapple peter

    The law, regulations and union contracts in NYC provide all the tools to both attempt to dismiss and provide protections to tenured teachers.

    http://mets2006.wordpress.com/2009/06/01/what-is-tenure-a-just-cause-discipline-standard-or-a-protection-for-inept-teachers-and-why-isnt-it-working/

    Current evaluations, primarily based on compliance teacher observations, are archaic and ineffective. Ironically the Los Angeles Green Dot Charter union contract provides a nuanced teacher evaluation system … with coniderable responsibilities on the part of both teachers and the school leaders. BTW NYS law currently encourages teacher peer review in the tenure granting process … unfortunately school distirct leadership is too often wedded to outdated, topdown models, and, it’s easier to blame teacher union than to work to create meningful evalaution systems.

  • Kt

    There is such a wide gap between and especially within the categories of “satisfactory” and “unsatisfactory” that it becomes fairly meaningless to be satisfactory. I especially am pleased that the report mentions the fact that better teacher evaluations are not just a means to “get rid of” teachers but also to provide the right professional development to the right teachers. What I need at this point in my career is not at all what I needed five years ago, and hopefully not what I will need to learn in ten more years. We’ll see if/how this report is used and received, but I strongly agree with the message that U or S are not really the end-all be-all of performance evaluations.

  • http://www.thisweekineducation.com john thompson

    Speaking of “shaky statistical grounding,” the TNTP report on page 15 said that Toledo dismissed only .7% of probationary teachers over the last five years. Its my understanding that they misplaced a decimal point. And it is my understanding that the 450 probationary teachers dismissed under Toledo is still accurate.

    Regardless, check out the quote by the rookie pricnipal on page 21. If they understand Toledo. the TNTP report threw another low blow with that quote.

    This is significant. They are aware of the AFT’s efforts. Even if they just made another typo, they are overtly mis-characterizing the alternatives.

    And what about the mistakes the TNTP made last year with its original Teachers Reserve report?

    My background is academics, and I’ve written a legal history book, thus familarizing me with the ethics of the legal system. The cheap shots taken by the TNTP in reports are shameful. Even a workman’s compensation attorney would be risking his license if he committed so many wilfull mis-statements of evidence.

    And they want us to trust them with our careers?

    And they want us to just had one day to challenge evidence against us?

    Have they ever been taught ethics? Or are they just too arrogant to believe that rules of evidence and debate apply to them?

  • ceolaf

    I’m concerned about criticisms of short observations.

    I ask anyone who has worked in a professional or white color position outside of schools, how were you evaluated? Did they measure your output/production by some objective measure? Did they just sit and watch you work for an hour or so?

    Of course not!

    They looked at what you accomplished — qualitiatively and quantiatively, where appropraite. They worked *with* you on something to see how think. They did like micro-evaluations in all kinds of informal ways. Hopefully, they even had some conversations about your goals and intentions in your work, so that you and whomever were sure to have a common understanding of what you were supposed to be doing.

    Ken Marshall and others have written compellingly for years about mini-observations (or is it micro-?). There’s nothing wrong with short observations.

    Obviously, there is something wrong with small sample sizes. An couple hours out of apporximately 1000 hours instruction is clearly not enough to make valid inferences. But that some of that data might come in smaller — and consequently more frequent — chunks is not a problem in and of itself.

    There are many problems with how we evaluate teachers, but the fact that some observations might be short is not one of them.

    I also am unclear as to why observations of less than a hour would be a problem. The school day is usually split into segments of less than a hour in our schools. Why would it even need to be noted that they are this short, except to imply something (i.e. they are too short) that cannot be logically supported if explicitly stated.

    ********************

    I also have to argue with the phrase “less than a handful of observations.”

    First, it does not appear anywhere in the report, so I have to ask Elizabeth what it means. What is less than a handful, when it comes to obsevations? Whose hands are we talking about, how how bulky is an observation? A handful of M&Ms would number in the dozens, but a handful of dinner rolls would number less than half a dozen. Untenured traditional public school teachers are supposed to be formally evaluated six times a year. Is that “less than a handful”?

    Second, there’s an editorial nature to the phrase. If you (Elizabeth) think that they are not enough observations, say so. Perhaps you could guide to the reader to what an appropraite number might be.

    I think that I also have to criticize Elizabeth for not listing the four states examined in this report (i.e. Arkansas, Colorado, Illinois, Ohio). I don’t think that these states are necessarily representaive of the entire country, and therefore Elizabeth would have done well to mention which states they are. There are no Northeastern, Mid-Atlantic, Sothern or West Coast districts included among the 12 TNTP looked into. That seems odd to me. If general statements are going to be made about practices nationally, I would think they’d have a more nationally representative sample, or at least they’d provide some sort of explanation of for why we might make national inferences from the sample they selected.

    In fact, it is not clear why they chose these four states, unless I am missing something.

    I guess that I am asking Elizabeth to present reports a bit more critically. I don’t have a problem with her reporting on their findings, but I think that she should frame with at least a little but of guidance as to why and why not we should take them seriously. And, of course, that she should do this explicitly.

  • MeanTeacher

    Hi John,

    Did you happen to read who sat on the Advisory Panel?

    Francine Lawrence
    President, Toledo Federation of Teachers

  • ceolaf

    MeanTeacher,

    I am not sure what your point is? The advisory panels did not have any authority to approve or edit the report. Rather, “advisory panel members were given the opportunity to provide a written response to the process and recommendations.” (p. 38)

    These responses are posted on the report’s website.

    http://www.widgeteffect.org/panel-responses/

  • ceolaf

    Ceolaf,

    To be fair, if you read carefully to the end you’d see that Elizabeth *does* provide the reader with info on which states were included, even if she doesn’t do so when she mentions the the report covered just “four states and 12 school districts.”

  • http://www.thisweekineducation.com john thompson

    Francine,

    I had just noted the people who wrote the report. The Adisory Panel shows that the authors knew about the Toledo Plan, and just chose to not acknowledge it or explain its role. Shouldn’t a study of evaluations include an account of those evaluations?.

    The pattern is what bothers me. When the wrote the ATR report last year, would it have killed them to combine scary sounding percentages like “six times greater” to have received Unsatisfactory rating with the unscary raw number? Couldn’t they have included any caveats in their evidence, such as the number of teachers seeking work? Other reports described the ATR by showing the complexity of the situtation.

    The more the TNTP demonizes the union, the tougher they make it for reformers in the union. And again, they just want teachers to have a single day in front of an arbitrator.

    I couldn’t follow all of the moving words on their fancy web site but I liked the conclusion of the Cincinnati Federation of Teachers that “…Comprehensive
    evaluation systems like ours are very expensive to run and we can only
    evaluate 1/5 of the teachers each year. If this is where our country is going
    we will need to find many, many more dollars to do this, particularly
    if every teacher is comprehensively evaluated every year. Again, I caution
    us all to consider changing the larger context of school structure first.”

Tips, questions, feedback?

Contact us at .

Word from Our Sponsor

Follow GothamSchools

RSS
Subscribe to the daily email digest:

Chalk It Up

Recent Comments

2 comments so far today

Archives

May 2013
M T W T F S S
« Apr  
 12345
6789101112
13141516171819
20212223242526
2728293031