Search EdWorkingPapers

Search for EdWorkingPapers here by author, title, or keywords.

Standards, accountability, assessment, and curriculum

Emily Morton, Paul Thompson, Megan Kuhfeld.

Four-day school weeks are becoming increasingly common in the United States, but their effect on students’ achievement is not well-understood. The small body of existing research suggests the four-day schedule has relatively small, negative average effects (~-0.02 to -0.09 SD) on annual, standardized state test scores in math and reading, but these studies include only a single state or are limited by using district-level data. We conduct the first multi-state, student-level analysis that estimates the effect of four-day school weeks on student achievement and a more proximal measure of within-year growth using NWEA MAP Growth assessment data. We conduct difference-in-differences analyses to estimate the effect of attending a four-day week school relative to attending a five-day week school. We estimate significant negative effects of the schedule on spring reading achievement (-0.07 SD) and fall-to-spring achievement gains in math and reading (-0.06 SD in both). The negative effects of the schedule are disproportionately larger in non-rural schools than rural schools and for female students, and they may grow over time. Policymakers and practitioners will need to weigh the policy’s demonstrated negative average effects on achievement in their decisions regarding how and if to implement a four-day week.

More →


John Papay, Ann Mantil, Richard J. Murnane.

Many states use high-school exit examinations to assess students’ career and college readiness in core subjects. We find meaningful consequences of barely passing the mathematics examination in Massachusetts, as opposed to just failing it. However, these impacts operate at different educational attainment margins for low-income and higher-income students. As in previous work, we find that barely passing increases the probability of graduating from high school for low-income (particularly urban low-income) students, but not for higher-income students. However, this pattern is reversed for 4-year college graduation. For higher-income students only, just passing the examination increases the probability of completing a 4-year college degree by 2.1 percentage points, a sizable effect given that only 13% of these students near the cutoff graduate.

More →


Walter Herring.

Because high-stakes testing for school accountability does not begin until third grade, accountability ratings for elementary schools do not directly measure students’ academic progress in grades K through 2. While it is possible that children’s test scores in grades 3 and above are highly correlated with children’s outcomes in the untested grades, research provides reasons to believe that this might not be the case in all schools. This study explores whether measures of school quality based on test scores in grades 3 through 5 serve as a strong proxy for children’s academic outcomes in grades K through 2. The results show that directly accounting for children’s test scores in the early grades could lead to meaningful changes in schools’ test-based performance ratings. The findings have important implications for accountability policy.

More →


Morgan S. Polikoff, Laura M. Desimone, Andrew C. Porter, Michael S. Garet, Amy Stornaiuolo, Katie Pak, Toni M. Smith, Mengli Song, Nelson Flores, Lynn S. Fuchs, Douglas Fuchs, T. Philip Nichols.

Standards have been at the heart of state and federal efforts to improve education for several decades. Most recently, standards-based reforms have evolved with a focus on more ambitious "college- and career-ready" (CCR) standards. This paper synthesizes the results of a seven-year national research center focused on the implementation and effects of CCR standards. The paper draws on evidence from a quasi-experimental longitudinal study using NAEP data, a cluster-randomized trial of an alignment feedback intervention, and detailed implementation data from state-representative surveys and case studies of five districts. Situating our work in a "policy attributes theory," we find important gaps in the theory of change underlying current standards-based reform efforts. We conclude that the CCR standards movement is not succeeding in achieving its desired outcomes. We make specific suggestions for improving instructional policy, including a) providing more specific instructional guidance, b) reconceptualizing professional learning, c) building buy-in through the involvement of trusted leaders, d) providing better supports for differentiation, and e) devoting attention and guidance to the intersection of content and pedagogy, and f) addressing persistent deficit thinking among educators. 

More →


Monnica Chan, Zachary Mabel, Preeya Pandya Mbekeani.

Performance-based funding models for higher education, which tie state support for institutions to performance on student outcomes, have proliferated in recent decades. Some states have designed these policies to also address educational attainment gaps by including bonus payments for traditionally low-performing groups. Using a Synthetic Control Method research design, we examine the impact of these funding regimes on race-based completion gaps in Tennessee and Ohio. We find no evidence that performance-based funding narrowed race-based completion gaps. In fact, contrary to their intended purpose, we find that performance-based funding widened existing gaps in certificate completion in Tennessee. Across both states, the estimated impacts on associate degree outcomes are also directionally consistent with performance-based funding exacerbating racial inequities in associate degree attainment.

More →


Dan Goldhaber, Zeyu Jin, Richard Startz.

We present new estimates of the importance of teachers in early grades for later grade outcomes, but unlike the existing literature that examines teacher “fade-out,” we directly compare the contribution of early-grade teachers to later year outcomes against the contributions of later year teachers to the same later year outcomes. Where the prior literature finds that much of the contribution of early teachers fades away, we find that the contributions of early-year teachers remain important in later grades. The difference in contributions to eighth-grade outcomes between an effective and ineffective fourth-grade teacher is about half the difference among eighth-grade teachers. The effect on eighth-grade outcomes of replacing a fourth-grade teacher who is below the 5th percentile with a median teacher is about half the underrepresented minority (URM)/non-URM achievement gap. Our results reinforce earlier conclusions in the literature that teachers in all grades are important for student achievement.

More →


Todd Pugatch, Paul Thompson.

Can public university honors programs deliver the benefits of selective undergraduate education within otherwise nonselective institutions? We evaluate the impact of admission to the Honors College at Oregon State University, a large nonselective public university. Admission to the Honors College depends heavily on a numerical application score. Nonlinearities in admissions probabilities as a function of this score allow us to compare applicants with similar scores, but different admissions outcomes, via a fuzzy regression kink design. The first stage is strong, with takeup of Honors College programming closely following nonlinearities in admissions probabilities. To estimate the causal effect of Honors College admission on human capital formation, we use these nonlinearities in the admissions function as instruments, combined with course-section fixed effects to account for strategic course selection. Honors College admission increases course grades by 0.10 grade points on the 0-4 scale, or 0.14 standard deviations. Effects are concentrated at the top of the course grade distribution. Previous exposure to Honors sections of courses in the same subject is a leading potential channel for increased grades. However, course grades of first-generation students decrease in response to Honors admission, driven by low performance in natural science courses. Results suggest that selective Honors programs can accelerate skill acquisition for high-achieving students at public universities, but not all students benefit from Honors admission.

More →


Edward J. Kim.

This study introduces the signal weighted teacher value-added model (SW VAM), a value-added model that weights student-level observations based on each student’s capacity to signal their assigned teacher’s quality. Specifically, the model leverages the repeated appearance of a given student to estimate student reliability and sensitivity parameters, whereas traditional VAMs represent a special case where all students exhibit identical parameters. Simulation study results indicate that SW VAMs outperform traditional VAMs at recovering true teacher quality when the assumption of student parameter invariance is met but have mixed performance under alternative assumptions of the true data generating process depending on data availability and the choice of priors. Evidence using an empirical data set suggests that SW VAM and traditional VAM results may disagree meaningfully in practice. These findings suggest that SW VAMs have promising potential to recover true teacher value-added in practical applications and, as a version of value-added models that attends to student differences, can be used to test the validity of traditional VAM assumptions in empirical contexts.

More →


Ana P. Cañedo, Paul T. von Hippel.

Von Hippel & Cañedo (2021) reported that US kindergarten teachers placed girls, Asian-Americans, and children from families of high socioeconomic status (SES) into higher ability groups than their test scores alone would warrant. The results fit the view that teachers were biased.

This comment asks whether parents’ lobbying for higher placement might explain these results. The answer, for the most part, is no. Measures of parent-teacher contact explained little variation in children’s ability group placement, and did not account for the higher placement of girls, Asian-Americans, or high-SES children. In fact, Asian-American parents had less teacher contact than did white children. It appears that the biases observed by von Hippel & Cañedo resided primarily in teachers, not in parents.

We also ask whether teachers who used more objective assessment techniques were less biased in placing children into higher and lower ability groups. The answer, again, was no. Unfortunately, biases persisted in the face of objective information about students’ skill. Fortunately, the biases were not terribly large.

More →


Heather McCambly, Quinn Mulroy.

Public discussions of racial inclusion and equal opportunity initiatives in the U.S. are often met with claims that expanding access to an institution, space, or public good is likely to diminish its quality. Examples of this pattern include: anticipated (and real) property value declines when predominantly white neighborhoods become more racially diverse; fears that the excellence of white schools will decline when the population of Black and brown students grows; apprehensions that equitable hiring practices necessarily entail lower standards for job candidates. In this paper, we examine how a federal agency, the Fund for the Improvement of Postsecondary Education (FIPSE), charged with addressing the aftermath of the ‘access wave’ of new college students promulgated by the Higher Education Act of 1965, came to reconcile its commitments to educational equity and quality. Through a novel examination of the historical development of what we term (e)quality politics in the administration of civil rights policy in higher education, we trace how two concepts - equity and quality – became discursively linked and contested in American politics. (E)quality politics refers to the introduction of a policy paradigm that reframes equity discussions and goals around the professed need to preserve and advance institutional “quality” using measures and standards that are, importantly, defined and instantiated under the era of segregation that precedes equal access policies. In particular, we uncover the discursive patterns by which the perceived threats to “quality” posed by racial diversity can prompt administrators to compensate, protect, and maintain the prerogatives of high-status institutions or groups that benefited under previous eras of exclusion. Understood as part of a backlash to egalitarian reforms, we argue, these quality measures undermine equity goals.

More →