GothamSchools — daily independent reporting on NYC public schools

Improvement in progress report grades: real or random?

Last year, the first round of progress reports attracted anger and ridicule. Perhaps because far fewer schools received low grades, the response this year has been more muted, making room for measured, evidence-based discussion of the DOE’s methodology in constructing the reports.

Over at Eduwonkette, Harvard education professor Daniel Koretz offers a lengthy critique of the progress report methodology. He notes that test scores alone are not a legitimate way to evaluate schools; New York State’s tests were not designed to be used in “value-added” analysis like that behind the progress reports; and the progress reports, like all accountability systems, place pressure on school administrators that likely leads to score inflation. In addition, he writes that the DOE’s formula does not take into account “interval scaling,” or the reality that different amounts of “value” are required to move students from one proficiency level to the next at different points on the proficiency spectrum. (In June, I wrote about how interval scaling might contribute to the finding that No Child Left Behind has helped high-performing students less than their low-performing peers.)

But those problems exist in many test-based, value-added accountability systems — Koretz writes that New York’s progress report system has its own set of errors. The tremendous variation in schools’ grades from last year to this year probably has less to do with school improvement than sampling and measurement error, he writes.

Here’s an illustration of the effect of error. I first calculated the variation in schools’ grades between last year and this year and then graphed it against their enrollments. It’s obvious that larger schools were less likely to see sizable changes in their grades than smaller ones. No school with more than 1,500 students went up or down more than one grade, while all schools whose grades changed the maximum amount possible had fewer than 1,000 students; most of those that increased by that amount had 500 students or fewer.

A substantive explanation for this distribution might be that large schools don’t do a good job moving their students forward, and smaller schools can give more attention to each student’s individual needs. But I’m with Koretz that correct explanation is more likely to be methodological — and rooted in error. A school with 400 students sounds like it would produce stable results. But consider that elementary progress reports only look at two grades’ worth of students — those with two years of test scores. The progress report grade for a school with 400 students could depend on just 100 students’ test scores — hardly a sample that allows chance differences among students, and in each student’s year-to-year test experiences, to wash out.

3 Comments

Subscribe to comments with RSS or TrackBack

  1. Philissa, Really nice job with this analysis - since smaller schools are more likely to have both significant upward *and* downward mobility, I think that supports the error argument rather than the idea that large schools don’t do a good job moving their students forward.

  2. Ravi

    Philissa makes a great point. For an in-depth analysis of the statistical properties of school test score measures (including the one that Philissa discusses), check out this paper by Thomas Kane and Doug Staiger:

    ”The Promise and Pitfalls of Using Imprecise School Accountability Measures,” Journal of Economic Perspectives, 16(4):91-114, fall 2002

    http://www.dartmouth.edu/~dstaiger/Papers/kanestaigerjeparticle.pdf

  3. skoolboy

    Great insight by Philissa here. I took her analysis one step further, and looked at the stability of the progress report grades from last year to this year in a slightly more formal way (a gamma coefficient, which is a measure of ordinal association, for the crosstabulation of last year’s grade by this year’s grade, separately for elementary schools, K-8 schools, and middle schools.) I divided schools at each level into those that were above the mean in enrollment and those below the mean (roughly 600 students for each level). At the elementary school level, the progress report grades were significantly more stable among the larger schools than among the smaller schools. At the K-8 and middle school levels, school size was not significantly related to the stability of the progress report grades from last year to this year.

Leave a Reply

Tips, questions, feedback?

Contact us at .

Mapping the Budget Cuts

Post a comment about the budget cuts at your school on our interactive comment map. more »

Chalk It Up

Our Twitter Updates

  • Citywide Council on High Schools meeting is set to proceed as scheduled, for now. Same goes for the PEP meeting rescheduled from Jan. 26. 18 hrs ago
  • From the DOE: In anticipation of inclement weather, the Specialized High School open houses scheduled for Weds. have been postponed. 18 hrs ago
  • @datadiva What do you see as the biggest changes? We're having trouble figuring out what to make of the 2010-2011 changes. in reply to datadiva 19 hrs ago
  • NY is near the top RT @alexanderrusso: State by state list of National Merit cut scores shows gaps, equity problems http://bit.ly/dw1Ztv 19 hrs ago
  • RT @HillcrestHigh: The DOE states that ARIS Internet Parent Link is out of service until 2/12. 3 days ago

Events Calendar

Archives

February 2010
M T W T F S S
« Jan  
1234567
891011121314
15161718192021
22232425262728

GothamSchools by Email

Technology in Education

The blogroll is a work-in-progress; to be added or if you've been miscategorized, send us an email at .