GothamSchools — daily independent reporting on NYC public schools

testing testing

Making state tests public may also make them easier, experts say

Here’s one more reason state tests might be getting easier to pass: a longstanding State Education Department practice of publicly releasing every question on each year’s exam.

The unusual practice makes it harder for test-makers to gauge how difficult a test is, said Howard Everson, chair of the state’s Technical Advisory Group, an oversight committee that monitors state testing.

Many states release some test questions but keep others private so they can be used again to compare one year’s test to another’s, said Daniel Koretz, a Harvard University education professor who studies testing. But New York has long had a practice of releasing every single test question to the public soon after students sit for the exams.

“In the interests of transparency the State Education Department has a decades long practice of making our tests publicly available soon after their administration,” education department spokesman Tom Dunn said in an email. Dunn said the department has never re-used test questions from year to year as a way of comparing the difficulty of the exams.

Releasing every question soon after the test is given prevents test makers from field testing exam items in an actual test setting, Everson said.

Field tests allow test makers to figure out how hard questions should be and set the scale used to judge students. Exams like the SAT include field-test questions folded into the actual exam, but students don’t know which questions won’t count toward their scores. The experimental questions are then used again on future tests to gauge their difficulty.

Unable to field test questions in this real-world setting, the state must rely on no-stakes tests given to a sample of students on a different date. Dunn said that students who sit for the field tests are told that the exams are only experimental.

But when students know they won’t get a grade for the field test, they might be less motivated to do their best, Everson said.

As a result, field tests often suggest that questions are more difficult than they actually are. And because they’re used to set the scale by which the real tests are graded, the end result is an easier state exam, Everson said.

“This is not ideal,” Koretz wrote in an e-mail. “What we don’t know is how much of an impact this has had.”

Koretz and Everson have both been calling for an investigation of test score credibility for over a year. Koretz has frequently said that until such an investigation happens, it’s impossible to know whether the test score gains in recent years truly reflect real learning.

Releasing some or even a large portion of test questions to the public in the interests of transparency is common and has its benefits, testing experts said this week.

When the New York legislature first considered requiring the publishers of post-secondary admissions tests like the SAT and GRE to make their test questions publicly available, testing companies initially lobbied against the law, said Paul Kanarek, founder of the Princeton Review of California who was also involved in New York’s legislative discussions of the requirement.

But test-makers have since acknowledged the benefits of the practice. And releasing real test questions is good for students, he said, both because it helps them prepare for exams and also because it help ensure that the tests are equitable.

“Whenever you are forced to show what you are giving kids the light of the day, all of the sudden a lot of the obnoxious style questions drop out of the tests,” Kanarek said. “They’re no longer asking questions that are so class-based, for example.”

But both Kanarek and Michael Zieky, a test developer for the Educational Testing Service, said that it was very unusual for a test-maker to release its experimental field-test questions.

  • http://www.classsizematters.org Leonie Haimson

    This is very confusing. You write, “Exams like the SAT include field-test questions folded into the actual exam, but students don’t know which questions won’t count toward their scores.” Presumably, those questions are not released to the public since they are not counted for the actual scoring, no? Why couldn’t the state do the same?

  • Michael M.

    Tests getting passed more easily due to releasing the prior year’s questions, per Mr. Everson, seems in stark contradiction to prior Gotham Schools story featuring Mr. Everson (Higher Pass Rates Could Be Due to Tougher Tests Expert Says, Aug 18).

  • Maura Walz

    Michael — Everson’s account of what has happened with the tests actually strikes me as consistent in as far as it goes: An unusual field testing method leads test-makers to believe that their test questions are harder than they actually are. The test-makers, thinking that they have an exam full of very difficult questions, then adjust the grading scale down so that fewer correct answers are required to pass. I admit that I’ve been piecing this account together backwards, and it certainly seems to be evolving, but I think the explanation follows logically, at least.

    I’m not sure this explains everything — for example, New York state’s practice of releasing all the test questions and field testing this way is a long-standing practice, so why the huge jump this year? There are probably a lot of other questions still to ask. (And please, tell me what they are!)

    I’m also not sure that this explanation is meant to provide an excuse for test flaws. But it is an interesting angle on how mistakes might have happened.

    But basically, I don’t think Everson’s two points contradict one another. Again, if I’m wrong here, please do tell me why.

  • Michael M.

    Maura,
    Say you’re correct. How do you tell which effect is stronger, etc?

    I won’t dare attempt an alternate “string theory” of the two Everson statements. I hope for both of us, Mr. E weighs in to explain same.

    Two kids on a teeter-totter. One side rises. Is that the side with the lighter kid, or a heavier kid sitting closer to the fulcrum? Or maybe both kids are on the high side, Lake Woebegone style.

    But when it comes to these darn tests, the poobahs would say: gravity took a holiday on the higher side, or gravity is working overtime on the lower side, or… before Mayoral Control, both sides were low.

    I jest of course. Because I remain cornfused. And I’m not the type to say “oh yeah” when I’m stuck on “say WHA?”

  • Ellen

    Regents tests and answers have been released every year since …forever. Kids buy or connect to Barron’s and take practise tests. High School Juniors/Seniors take practise PSATs, SATs, LSAT, MCATS. This is not new. In fact it’s older than I am. What am I missing in this discussion? Are we saying that testers should not release their experimental test questions or any test questions? And I get the impression taht folks think it was “harder” to pass the tests in the past. Were the tests “harder” in 1980? 1970? 1960? All of the test questions that could be released were released then. How do we make the comparison between tests?

  • Fred Smith

    A few points: 1) Truth-in-Testing is a good thing. 2) Maura, you have not been putting the story together backwards and you are right about how the explanations keep evolving. The real problem is how the state’s appointed test defenders/apologists (they are called advisors or information officers)–how their evasions keep evolving as the public is getting wiser to the issues of field testing and standard setting. Now Professor Everson is letting us in on heretofore undisclosed problems concerning the limits of field testing and how it has misled the state into setting lower cut off scores. 3) Micheal M., you’re right to be vigilant about the doublespeak coming from Professsor Everson and the state education people: The NY Times (Sept. 14) quotes state officials as saying the “test questions themselves have actually gotten harder.” Item statistics provide evidence that is claim is untrue. 4) Finally, is NYSED’s Office of State Assessment and Commissioner Mills on record as saying the cutoff points (i.e., the test standards) are completely established before the operational test is given each year? Or is there some tweaking, adjusting, call-i-what-you-will of the Level 2 and Level 3 cut score thresholds after the test has been given and the score distributions are known by NYSED? Can Merryl Tisch get back to us on this…

  • Michael M.

    Thanks Fred.
    Note that if the questions are getting harder, and despite that the “Level” scores are getting better, and yet the School Progress Report Grades are getter better faster still… the Mayor is being unduly modest, yes modest, right here in Gotham City, which starts with “M,” which stands for “MIKE.”

    The schools are getting better even faster than the better he’s already bragging about. Weekly now, not just annually. Per glossy mailers filling landfills everywhere.

    I heard that next week’s mailer is going to credit Mayor Mike for inventing the whole concept of Public Education, while defenders in the Sloping Earth Society are already amping up talking points accusing his opponent of wanting to return us to the stone age.

    Of course, even in that scenario, your kid would STILL come home with a list of staples: slate tablet, chisel, mallet, etc.

  • Ellen McHugh

    …and ride to school on the dino-bus

  • QueensParent

    A quite circuitous route to arguing that “kids are actually dumber than we think they are.” I didn’t see any of you around saying these tests were too hard when NYC was in the low 30s and the suburban schools were the only ones getting high test scores but the minute NYC and other Big 5 school districts start doing well on these tests, well, something’s wrong, the tests must be too easy, or well, NYC hasn’t improved on a FEDERAL test, so the state test must be all wrong. How come the tests were fine as long as urban kids weren’t doing well on them?

Tips, questions, feedback?

Contact us at .

Follow GothamSchools

RSS

Feb. 10: You’re invited!

Chalk It Up

Recent Comments

0 comments so far today

Events Calendar

Our Twitter Updates

  • Despite some tense confrontations between protesters and police, nothing ever got physical and a lieutenant just said there were no arrests. 1 hr ago
  • He's been frozen in that stoic position all night MT @lisafleisher: A protester speaks with his middle finger. http://t.co/xLar4NRU 1 hr ago
  • Last of the occupy protesters just walked out together, shouting expletives and insults on their way out. #toughcrowd 1 hr ago
  • Frank Thomas, DOE spokesman just told me no arrests have been made tonight at PEP despite confrontation between protesters & police earlier. 2 hrs ago
  • RT @leoniehaimson: It's been shown repeatedly that as one schl closes another overwhelmed w/ high needs kids that small schls won't take 2 hrs ago
  • More updates...

Archives

February 2012
M T W T F S S
« Jan  
 12345
6789101112
13141516171819
20212223242526
272829  
?>