Standards, accountability, assessment, and curriculum

Search EdWorkingPapers

Search EdWorkingPapers by author, title, or keywords.

The Prevalence and Policy Implications of Between-School Heterogeneity in Learning Outcomes: Evidence from Six Public Education Systems

Daniel Rodriguez-Segura, Savannah Tierney. 04/2024

While learning outcomes in low- and middle-income countries are generally at low levels, the degree to which students and schools more broadly within education systems lag behind grade-level proficiency can vary significantly. A substantial portion of existing literature advocates for aligning curricula closer to the proficiency level of the “median child” within each system. Yet, amidst considerable between-school heterogeneity in learning outcomes, choosing a single instructional level for the entire system may still leave behind those students in schools far from this level. Hence, establishing system-wide curriculum expectations in the presence of significant between-school heterogeneity poses a significant challenge for policymakers — especially as the issue of between-school heterogeneity has been relatively unexplored by researchers so far. This paper addresses the gap by leveraging a unique dataset on foundational literacy and numeracy outcomes, representative of six public educational systems encompassing over 900,000 enrolled children in South Asia and West Africa. With this dataset, we examine the current extent of between-school heterogeneity in learning outcomes, the potential predictors of this heterogeneity, and explore its potential implications for setting national curricula for different grade levels and subjects. Our findings reveal that between-school heterogeneity can indeed present both a severe pedagogical hindrance and challenges for policymakers, particularly in contexts with relatively higher levels of performance and in the higher grades. In response to meaningful between-system heterogeneity, we also demonstrate through simulation that a more nuanced, data-driven targeting of curricular expectations for different schools within a system could empower policymakers to effectively reach a broader spectrum of students through classroom instruction.

Download 04/20245.63 MB

Does One Plus One Always Equal Two? Examining Complementarities in Educational Interventions

Umut Özek. 03/2024

Public policies targeting individuals based on need often impose disproportionate burden on communities that lack the resources to implement these policies effectively. In an elementary school setting, I examine whether community-level interventions focusing on similar needs and providing resources to build capacity in these communities could improve outcomes by improving the effectiveness of individual-level interventions. I find that the extended school day policy that targets lowest-performing schools in reading in Florida significantly improved the effectiveness of the third-grade retention policy in these schools. These complementarities were large enough to close the gap in retention effects between targeted and higher-performing schools.

Download 03/20241.38 MB

The Notorious SBG: Administrators’ Perceptions of Standards-Based Grading Practices

Sarah Ruth Morris, Andy Parra-Martinez, Jonathan Wai, Robert Maranto. 02/2024

This mixed-methods study synthesizes Standards-Based Grading (SBG) literature, analyzes 249 Arkansas administrators' survey responses using OLS regressions, and identifies themes through in-vivo coding of qualitative feedback. Results show more SBG support among liberal, elementary-level administrators in larger, economically diverse districts. Qualitative insights highlight structural barriers and mindsets against SBG, emphasizing its importance for mastery-focused assessment and grading alignment. These findings underscore the influence of principals' beliefs on SBG support and suggest researching the contextual and ideological factors influencing SBG's implementation.

Download 02/2024835.17 KB

Taking Teacher Evaluation to Scale: The Effect of State Reforms on Achievement and Attainment

Joshua Bleiberg, Eric Brunner, Erica Harbatkin, Matthew A. Kraft, Matthew G. Springer. 02/2024

Federal incentives and requirements under the Obama administration spurred states to adopt major reforms to their teacher evaluation systems. We examine the effects of these reforms on student achievement and attainment at a national scale by exploiting their staggered implementation across states. We find precisely estimated null effects, on average, that rule out impacts as small as 0.017 standard deviations for achievement and 1.2 percentage points for high school graduation and college enrollment. We highlight five factors that likely limited the efficacy of teacher evaluation at scale: political opposition, decentralization, capacity constraints, limited generalizability, and the absence of compensating wages.

Download 02/20241.68 MB

GED® College Readiness Benchmarks and Post-Secondary Success

Blake Heller. 02/2024

In 2016, the GED® introduced college readiness benchmarks designed to identify testers who are academically prepared for credit-bearing college coursework. The benchmarks are promoted as awarding college credits or exempting “college-ready” GED® graduates from remedial coursework. I show descriptive evidence that those identified as college-ready by these benchmarks enroll and persist in college at significantly higher rates than others who pass the GED® exam, but at lower rates than recent graduates with traditional high school diplomas. Regression discontinuity estimates show that crossing a college readiness threshold does not substantially influence testers' college enrollment or persistence during the two years following their first test attempt. Relatedly, I observe little exam retaking by those who fall narrowly short of the minimum college readiness score thresholds. This contrasts strongly with retaking behavior near the lower GED® passing threshold that determines eligibility for a high school equivalency credential. Those who narrowly fail a GED® subject test are over 100 times more likely to retest than those who fall just short of a college readiness benchmark in the same subject. GED® college readiness benchmarks do not currently appear to promote better college outcomes, but in the absence of more detailed test score information they offer a simple heuristic to predict short-run college enrollment and persistence among GED® graduates, particularly for those who identify educational gain as a primary reason for testing. The results highlight the promise and challenges associated with building pathways for non-traditional students to earn credit for prior learning.

Download 02/20241.18 MB

Racialized Reactivity: How Metrics-Formation Contributed to a Racialized Organizational Order in Medical Education

Heather McCambly, Quinn Mulroy, Andrew Stein. 02/2024

A common point of contention across education policy debates is whether and how facially race-neutral metrics of quality produce or maintain racialized inequities. Medical education is a useful site for interrogating this relationship, as many scholars point to the 1910, Carnegie-funded Flexner Report—which proposed standardized quality metrics—as a main driver of the closure of five of the seven Black medical schools. Our research demonstrates how these proposed quality metrics, and their philanthropic and political advocates, instantiated a racialized organizational order that governed the distribution of resources, the development of state certification processes, and the regulation of medical schools. This analysis provides traction for uncovering how taken-for-granted standards of quality come to maintain racialized access to opportunity in education.

Download 02/20241010 KB

Misclassification of Career and Technical Education Concentrators: Analysis and Policy Recommendations

Yue Huang, Hojung Lee, Arielle Lentz, Kenneth A. Shores. 01/2024

Career and Technical Education (CTE) prepares students for life beyond high school by providing practical labor skills, workforce credentials, and early post-secondary credits. States are required to report the number of CTE concentrators to receive federal Perkins funding, but systems of identifying students as concentrators vary among states. We analyzed two distinct concentrator identification strategies, one based on local education agency administrator reporting and another universal screening system using transcript data. Analyses revealed moderate amounts of mismeasurement in concentration status and modest amounts of systematic mismeasurement penalizing students who qualify for free or reduced-price lunch, English language services, and special education services.

Download 01/2024757.41 KB

Bending Without Breaking - COVID-19 Tests the Resilience of State Education Policymaking Institutions

David Menefee-Libey, Carolyn Herrington, Kyoung-Jun Choi, Julie Marsh, Katrina Bulkley. 12/2023

COVID-19 upended schooling across the United States, but with what consequences for the state-level institutions that drive most education policy? This paper reports findings on two related research questions. First, what were the most important ways state government education policymakers changed schools and schooling from the moment they began to reckon with the seriousness of COVID-19 through the first full academic year of the pandemic? Second, how deep did those changes go – are there indications the pandemic triggered efforts to make lasting changes in states’ education policymaking institutions? Using multiple-methods research focused on Colorado, Florida, Louisiana, Michigan, and Oregon, we documented policies enacted during the period from March 2020 through June 2021 across states and across sectors (traditional and choice) in three COVID-19-related education policy domains: school closings and reopenings, budgeting and resource allocation, and assessment and accountability systems. We found that states quickly enacted radical changes to policies that had taken generations to develop. They mandated sweeping school closures in Spring 2020, and then a diverse array of school reopening policies in the 2020/2021 school year. States temporarily modified their attendance-based funding systems and allocated massive federal COVID-19 relief funds. Finally, states suspended annual student testing, modified the wide array of accountability policies and programs linked to the results of those tests, and adapted to new assessment methods. These crisis-driven policy changes deeply disrupted long-established patterns and practices in education. Despite this, we found that state education governance systems remained resilient, and that at least during the first 16 months of the pandemic, stakeholders showed little interest in using the crisis to trigger more lasting institutional change. We hope these findings enable state policymakers to better prepare for future crises.

Download 12/2023849.28 KB

Can States Sustain and Replicate School District Improvement? Evidence from Massachusetts

Beth Schueler, Liz Nigro, John Wang. 11/2023

The improvement of low-performing school systems is one potential strategy for mitigating educational inequality. Some evidence suggests districtwide reform may be more effective than school-level change, but limited research examines district-level turnaround. There is also little scholarship examining the effects of turnaround reforms on outcomes beyond the first few years of implementation, on outcomes beyond test scores, or on the effectiveness of efforts to replicate district improvement successes beyond an initial reform context. We study these topics in Massachusetts, home to the Lawrence district representing a rare case of demonstrated improvements in the early years of state takeover and turnaround and where state leaders have since intervened in three other contexts as a result. We use statewide student-level administrative data (2006-07 to 2018-19) and event study methods to estimate medium-term reform impacts on test and non-test outcomes across four Massachusetts-based contexts: Lawrence, Holyoke, Springfield, and Southbridge. We find substantial district improvement was possible although sustaining the rate of gains was more complicated. Replicating gains in new contexts was also possible but not guaranteed.

Download 11/20237.02 MB

Early Algebra Affects Peer Composition

Quentin Brummet, Lindsay Liebert, Thurston Domina, Paul Yoo, Andrew Penner. 11/2023

Although existing research suggests that students benefit on a range of outcomes when they enroll in early algebra classes, policy efforts that accelerate algebra enrollment for large numbers of students often have negative effects. Explanations for this apparent contradiction often emphasize the potential role of teacher and peer effects, which could create positive effects for individual students placed into early algebra that would not translate to larger-scale policies. We use detailed data from Oregon that contain information on the teachers and peers to whom students are exposed in order to investigate these explanations. Our regression discontinuity analyses replicate key findings from prior studies, indicating that placement in eighth-grade algebra boosts student achievement in math and English language arts. We then demonstrate that eighth-grade algebra placement positively affects the achievement level of students’ classmates, as well as the years of experience and value added of students’ math teachers. The effects on peer composition that we observe are large enough to plausibly explain the majority of the effects of eighth-grade algebra on student test scores.

Download 11/20231.46 MB