Education
School rankings don't tell you what you need to know
This is part 2 of a series on education in DC. See part 1.When choosing between public schools or deciding whether to send a child to private school or move to the suburbs, many parents look at the test scores for schools listed online. But DCPS, as in many states, reports just the percentage of students who scored "proficient" or higher in math and reading. That number actually doesn't tell parents what they need to know.
Let's take 2 children going to 2 different schools whose teachers are equally good. One year, a number of other kids, who happen to be doing worse, move from one school to the other. One school's proficiency numbers would go up, while the other would go down. But for all of the other kids, nothing may have changed.
A parent considering those 2 schools, however, would suddenly think one was better or worse than the other, and make choices on that basis. If those other kids influence things like classroom discipline, it might matter, but it could well be that both schools remain just as good as they were before. In short, the numbers are misleading.
How much will a school help your child?
Every parent wants the best for his or her child. A parent who has the ability to choose among places to live, and is choosing those based on the schools, will look at these test score rankings, but that's not really what they need to know.
What they need to know is simple: if I send my kid to this school versus that, will he or she come out more high-achieving? Or, another way to look at it is: if one could clone a kid and send him or her simultaneously to every school in the area, would the kid have more proficiency at the end at some of the schools versus others?
It doesn't inherently matter if at one school the other kids achieve more than at another; all a parent really cares about is his or her own kid. They don't need the percentage of proficiency. What they really want is what educators call a "value added" measure.
We could broadly say that there are 3 components of a kid's achievement:
- The influence of parents and others at home
- The quality of the teaching and instructional resources at the school
- The influence of other students
#3 matters if the other students have an effect. For example, a high achieving student might not learn as much if there are a lot of low-achieving students in the class because the teacher has to spend more time on basic concepts. Or, if most kids bully and mock higher-achieving students, that can encourage many kids to pretend not to know answers in class.
On the other hand, having a higher-achieving environment can push lower-achieving kids to work a little harder, perhaps, unless things are too hard, and a peer environment that rewards hard work can also positively reinforce a student's efforts.
If you look at a DCPS school profile, however, it doesn't separate #1, #2, and #3. A school with higher proficiency could only mean that the kids come from more privileged backgrounds (factor #1) and nothing about the experience at the school. It might be that sending your child to that school versus another has no actual benefit.
In fact, we can estimate some of this. Race does not equal income, but at DCPS it's very highly correlated. While Hardy Middle School in Georgetown has lower average proficiency than Deal Middle School, near Tenleytown, if you only look at white students the proficiency is the same.
Given that a kid's race (or income) is not going to change based on what school he or she goes to, there's no reason to believe the educational experience at one is better or worse. Yet a number of parents in the area don't want to send their kids there. Is it just a generalized fear of being in a school with poor and minority children?
Rankings can drive segregation
We know that lower-income students on average come into school worse off achievement-wise than their higher-income peers. There are many reasons for this. Their parents are less likely to have been able to spend as much time reading to the kids and teaching them outside of school. They are less likely to have sent the kid to camps and other programs with academic enrichment. Also, they might not have as many books available at home, and so on.
Moreover, studies have shown that lower-income kids perform worse on standardized tests in general. All of these factors add up to the fact that a school with only higher-income kids might have higher test scores than a school with a mix of incomes even if the intellectual ability and teacher quality are exactly the same.
Let's assume that you have a school with a bunch of terrific teachers that is doing a great job educating its kids. One year, a bunch of lower-income kids come into the school. Let's say that the teachers do just as terrific a job educating the existing kids and the new kids. Existing kids don't lose out at all. Yet the school's average test score will go down.
This reinforces the fact that we are measuring and reporting the wrong thing. But most people don't necessarily know this, nor do they have better data available, so they'll understandably choose the highest-performing school they can, even if "highest-performing" only really means "school with the fewest lower-income kids."
There are better metrics
Steve Glazerman pointed out some of these same flaws and recommended using a "value added" measure instead. This is the kind of calculation the IMPACT teacher evaluation uses, but DCPS could report an average across all teachers for the school.
Instead of answering the question, what percentage of kids at a school are doing well, which is very dependent on who goes there, this number would say how much each kid will probably gain from going to the school, which is really what will help parents make choices.
Have you used the school ranking data in DC or elsewhere? What would make it more useful?
Comments
- Young kids try to assault me while biking
- Focus transportation on downtown or neighborhoods?
- Metro bag searches aren't always optional
- Endless zoning update delay hurts homeowners
- DDOT agrees to repave 15th Street cycle track
- Redeveloping McMillan is the only way to save it
- Vienna Metro town center won't have a town center







by Megan on Jul 18, 2012 12:16 pm • link • report
An average "value-added" score for teachers at a school would help parents make choices, but only if that measuring tool for "value-added" is accurate, well-designed and statistically defensible.
I understand it is not. Is there any evidence that this tool measures what it says it measures?
by Trulee_Pist on Jul 18, 2012 12:17 pm • link • report
by citywalker on Jul 18, 2012 12:31 pm • link • report
An average "value-added" score for teachers at a school would help parents make choices, but only if that measuring tool for "value-added" is accurate, well-designed and statistically defensible.
I understand it is not. Is there any evidence that this tool measures what it says it measures?
Why do you think it isn't?
The main issue with the value-added metric has been how DCPS is using it. It works pretty well for analyzing aggregate performance, but should never be used to compare performance over multiple small sample sizes.
We can simplify it this way: each teacher has some quality level, and she is able to increase test scores for students at some rate consistent with that quality level. A value-added metric allows us to control for differences in students so that we can estimate that quality level. However! What we can't control for is the idiosyncratic differences in scores from day to day. If five students in a class of 25 arrive on a late bus and miss their breakfast before the test, we shouldn't use those results to argue that their teacher is worse than she should be. But if we take the value-added metric across many more students, we can allow those negative shocks to be roughly canceled out by positive shocks, and get a better measure of overall quality.
The additional issue with DCPS's use of the metric is that some teachers have cheated on the tests. This worked great for them, but no so great for their students' next teachers, who were told that they actually subtracted value from their students.
by Gray on Jul 18, 2012 12:32 pm • link • report
That's why the test metric is only half of the score and not the whole thing. The other half is made up of reviews from principals and multiple classroom observations.
by MLD on Jul 18, 2012 12:39 pm • link • report
You've just described my local elementary school. Great building, great teachers, motivated and high-performing students in the lower grades. But the students who are in the testing grades are all out-of-boundary, and almost none of the kids in the testing grades has been there for more than a year or two. (Not one of the fifth graders has been there since K).
Of course, the fact that the test scores in the upper grades are so low (coupled with looming middle school) make it less likely that parents will stay which simply reinforces the problem.
by oboe on Jul 18, 2012 12:45 pm • link • report
That's why the test metric is only half of the score and not the whole thing. The other half is made up of reviews from principals and multiple classroom observations.
Yep, that's why, but it shouldn't make up half either. It isn't an appropriate metric for evaluating individual teachers.
by Gray on Jul 18, 2012 12:50 pm • link • report
I think the bottom line is that as much as I love data and realize its value, choosing a school must also include visiting individual schools, considering intangibles such as school atmosphere, thinking of the particular desires for my kid (language immersion), and particular desires of my kid (arts integration, etc.).
by SE on Jul 18, 2012 12:52 pm • link • report
Do you know any studies/academic papers backing up what you're saying here? I'd be interested. Thanks.
by SE on Jul 18, 2012 12:54 pm • link • report
http://www.edweek.org/ew/articles/2012/03/01/kappan_hammond.html
by Gray on Jul 18, 2012 1:03 pm • link • report
by DCster on Jul 18, 2012 1:09 pm • link • report
by JustMe on Jul 18, 2012 1:13 pm • link • report
by KL on Jul 18, 2012 1:29 pm • link • report
Don't forget that you should also look at the specific programs a school offers that may be of interest to your specific kid. For example, if your kid is interested in math/science, does the school have a magnet program or partner with organizations such as NIH for special programs? Or if your kid is more linguistically oriented, is there an IB program? Also, extracurricular offerings are important too which isn't something that's captured in test scores. For example, not every school in Arlington even has tennis courts if your kid is a tennis player. H-B Woodlawn only has one sports team for the entire school -- ultimate frisbee.
by Falls Church on Jul 18, 2012 1:34 pm • link • report
David downplays his item #3, "The influence of other students," writing mostly about how teachers would be able to allocate time, or if there are problems with bullying. But of course the influence of other students is important, because being in an environment in which one's peers are thinking about the same problems and asking the same questions has synergistic effects. Being surrounded by others who are passionate about any subject or pursuit makes one more passionate about that subject, and the result of the positive feedback loop (in the engineering sense, not the words-of-praise sense) is dramatic. It's the same reason why high-tech companies still have physical offices in which creative and technical folks gather daily.
Finally, the Value Added Modeling, which is a component of IMPACT, is meaningless numerology. Here's an overview of quantitative methods of evaluation teacher performance which references this examination of VAM methods. Quoting
And another study they reference found All of which I take to mean that no amount of fancy statistical methodology can get around the basic maxim of "Garbage In, Garbage Out."by thm on Jul 18, 2012 1:47 pm • link • report
by Tom Veil on Jul 18, 2012 2:29 pm • link • report
The New Teacher Project (Admittedly on the opposite side of the spectrum from D-H) points out several ways that value-added is useful: http://tntp.org/ideas-and-innovations/view/myths-and-facts-about-value-added-analysis
The Gates Foundations' MET project (http://www.metproject.org/) also advances the policy of including value-added as one of several measurements of teacher effectiveness.
Of course, value-added data is only as good as the test it's based on, and small sample sizes can be problematic. But it's simply untrue that it's mere numerology.
by SchoolWatcher on Jul 18, 2012 2:44 pm • link • report
by Gray on Jul 18, 2012 3:03 pm • link • report
by SchoolWatcher on Jul 18, 2012 4:49 pm • link • report
The DC-CAS tests reflect information - starting in third grade - on test proficiency, race, economic status, and language.
1. Test proficiency is a limited predictor of parental education, mixed with teacher involvement.
2. Outside of white children(due to known information about incomes of white DC families), race or Hispanic background is a poor predictor of middle class/striver parents.
3. Economic status predicts only part of what you need to know if you're looking for middle class families, but the break isn't set at middle income vs. poor, it's set at poor vs. very poor.
4. The language competence thing is only relevant in DC's Salvadoran-heavy schools, of which there just aren't very many.
5. And I don't see anything in these "report cards" that reflects antisocial behavior or a teacher's added value.
Does anybody know how to get or get proxies for the following?
1. Some kind of proficiency data for preschool or Kindergarten, to see whether the kids are already behind by the time they start school.
2. How many children have at least one live-in parent with an accredited college degree.
3. The number of parents who can contribute significantly to school improvement with either time or money.
4. Parental unemployment rate.
5. Number of security incidents that took place, by year, by problematic student's grade in school.
6. Subjective, non-numerical reviews of the strengths and weaknesses of your kid's potential teachers by more than one qualified outsider.
7. A published curriculum for a given school year.
So - fault all of this as much as you want. These are the data I want as a potentially entering parent.
by arf on Jul 18, 2012 5:43 pm • link • report
It's definitely not "numerology," -- it's a regression, and founded in statistics -- even if there are folks who believe the variabilities of a single class are too high to use it in individual teachers' ratings.
I couldn't tell that you were responding to me because I never claimed it's magic or not founded in basic statistical theory.
Yes, it's a regression. And yes, the error term will often swamp any actual effect at the class-year level. It's a useful tool at the aggregate level, but applying it to measure the quality of individual teachers is not a statistically sound practice.
by Gray on Jul 18, 2012 8:02 pm • link • report
arf - While you won't find many of the statistics about family or parental income levels, you should be able to get information about a course curriculum from a principal. They may also be willing to share average student growth on widely-used comprehensive tests (GOLD is one example for DCPS in the ECE environment.)
by SchoolWatcher on Jul 18, 2012 10:27 pm • link • report
But asserting that VAM can measure what it says to measure does not make it so. Any regression analysis presumes, among many other things, that the dependent variable (e.g. student test scores) can be expressed as a certain linear combination of a certain set of independent variables (e.g. teacher quality, income level, parents education), and 'controlling' for various factors means changing around the parameters that are included and re-running the computations. If the underlying model is wrong, you'll still get numerical results, but they will be meaningless.
This is what the result from Rothstein (cited by, but not written by Darling-Hammond) demonstrates. If the methods of VAM show that fifth-grade teachers predict fourth-grade scores, then there is something very wrong with the model and the methods. I don't see how one can get around that. That's why I call it 'numerology.'
The New Teacher Project 'fact sheet,' and the Brookings report it references to support the claim that there exist academics who believe VAM has "an important role to play" do not even attempt to address this problem. They simply assert that VAM does what it purports to do, which Rothstein has demonstrated that it doesn't.
by thm on Jul 19, 2012 2:35 am • link • report
I would agree that a Value Added measure that was believable would be good. I haven't seen one yet but efforts are being made in that direction (impact etc) The DC CAS scores tend to give you an objective feel for the competence of the kids in a school. It doesn't tell you why they score well (teacher, family, peers).
As a parent of 4 DCPS and Charter students I am most concerned about peers, which DC CAS addresses. My kids have had great teachers at "so-so" schools and middling teachers at "great" schools. So that is hard to control for. Peers are what matter most and what you can get a feel for in DC.
by LeeinDC on Jul 19, 2012 9:41 am • link • report
What they found was that the graduation rate went up. Prior to the top 10% rule, kids who graduated in the middle of their high school class but had high SAT scores got in and kids who graduated high but had low scores didn't. The theory is that graduating high in your class shows work ethic and work ethic is far important to graduating from college then basic math and English skills. So, which high school you go to is only important to the extent that it does or does not teach you how to work hard and efficiently.
by David C on Jul 20, 2012 4:22 pm • link • report
Add a Comment