Psychometric Quality of Measures of Learning Outcomes in Low- and Middle-Income Countries

Masha Bertling; Abhijeet Singh; Karthik Muralidharan

by

Masha Bertling

,

Abhijeet Singh

and

Karthik Muralidharan

March 21, 2023

We investigate the properties of measures of learning outcomes, as these are the tools commonly used to monitor the progress toward identifying the most effective interventions. We review test properties across 158 studies and conduct item-level psychometric analysis of a subset of these studies to show that current tests vary widely in scope, content, administration, and analysis. Researchers rarely provide details about the properties of their test scores. Only 4 percent of studies we review provide reliability estimates of their tests, and 10 percent archive item-level replication data to evaluate test quality post hoc. The interpretation of any estimates is necessarily sensitive to the measurement of the core variables, even where treatments are randomly assigned. Since estimates of treatment effects are often expressed in standard deviation units, measurement error can bias treatment effects toward zero. Content analysis of question wordings reveals substantial variation in content coverage of the skills tested, even when students of similar grades are being tested in similar subjects. The findings indicate that comparisons of treatment effects must consider degrees of measurement error that are often unavailable and the content breadth of the tests to contextualize why effects may differ on substantively different outcome variables.

Topics

Education and Child Well-Being

CITATION

Bertling, Masha, Abhijeet Singh, and Karthik Muralidharan. 2023. Psychometric Quality of Measures of Learning Outcomes in Low- and Middle-Income Countries. Center for Global Development.

DISCLAIMER & PERMISSIONS

CGD's publications reflect the views of the authors, drawing on prior research and experience in their areas of expertise. CGD is a nonpartisan, independent organization and does not take institutional positions. You may use and disseminate CGD's publications under these conditions.

Thumbnail image by: Adobe Stock

WORKING PAPER

Psychometric Quality of Measures of Learning Outcomes in Low- and Middle-Income Countries

Recommended

WORKING PAPER

How Big Are Effect Sizes in International Education Studies?

Blog Post

A “Rosetta Stone” for Comparing Test Scores

Topics

CITATION

DISCLAIMER & PERMISSIONS

WORKING PAPER

Psychometric Quality of Measures of Learning Outcomes in Low- and Middle-Income Countries

Recommended

WORKING PAPER

How Big Are Effect Sizes in International Education Studies?

Blog Post

A “Rosetta Stone” for Comparing Test Scores

Topics

CITATION

DISCLAIMER & PERMISSIONS

More Reading

Blog Post

From 1,056 Studies to 49 Candidates for Tracking Learning’s Long-Run Effects

Blog Post

How and Where We Ask Matters: New Evidence on Measuring Violence Against Children in Schools

Blog Post

Introducing IDEA—the International Development Economics Association

CGD NOTE

Identifying Studies for the Return to Learning Initiative: An Updated Candidate Studies Dataset

Sign up to get weekly development updates: