Search for EdWorkingPapers here by author, title, or keywords.
Educator preparation, professional development, performance and evaluation
Student surveys are widely used to evaluate university teaching and increasingly adopted at the K-12 level, although there remains considerable debate about what they measure. Much disagreement focuses on the well-documented correlation between student grades and their evaluations of instructors. Using individual-level data from 19,000 evaluations of 700 course sections at a flagship public university, we leverage both within-course and within-student variation to rule out popular explanations for this correlation. Specifically, we show that the relationship cannot be explained by instructional quality, workload, grading stringency, or student sorting into courses. Instead, student grade satisfaction -- regardless of the underlying cause of the grades -- appears to be an important driver of course evaluations. We also present results from a randomized intervention with potential to reduce the magnitude of the association by reminding students to focus on relevant teaching and learning considerations and by increasing the salience of the stakes attached to evaluations for instructor careers. However, these prove ineffective in muting the relationship between grades and student scores.
To study beliefs about ability and STEM major choice, I conduct a field experiment where I provide students with information that they are above average in their top fields of study. I find that STEM students are more likely to switch out of their major and that non-STEM students fail to switch into STEM at the same rates as other fields. I also find that learning you are above average in your top field of study increases STEM major choice by almost a third, as STEM students appear more like to persist and non-STEM students increase their switching into STEM fields.
Using administrative data from D.C. Public Schools, I use exogenous variation in the presence and intensity of teacher monitoring to show it significantly improves student test scores and reduces suspensions. Uniquely, my setting allows me to separately identify the effect of pre-evaluation monitoring from post-evaluation feedback. Monitoring's effect is strongest among teachers with a large incentive to increase student test scores. As tests approach, unmonitored teachers sacrifice higher-level learning, classroom management, and student engagement, even though these pedagogical tasks are among the most effective. One possible explanation is teachers ``teach to the test'' as a risk mitigation strategy, even if it is less effective on average. This is supported by showing teaching to the test has a smaller effect on student test score variance than other teaching approaches. These results illustrate the importance of monitoring in contexts where teachers have the strongest incentive to deviate from pedagogically sound practices.
As states and districts expand their goals for equitable mathematics instruction to focus on cultural responsiveness and rigor, it is critical to understand how teachers integrate multiple teaching approaches. Drawing on survey data from a larger study of professional learning, we use mixture modeling to identify seven unique ways that middle school mathematics teachers integrate ambitious, traditional, and culturally responsive (CR) mathematics instruction. The resulting typology is driven almost exclusively by variation in CR teaching. About half of teachers reported rarely engaging in CR teaching. Teachers who emphasized CR teaching tended to be teachers of color and have high CR teaching self-efficacy. Findings suggest that tailoring teacher development to how teachers blend multiple approaches may best support equitable mathematics instruction.
Multiple meta-analyses have now documented small positive effects of teacher professional development (PD) on pupil test scores. However, the field lacks any validated explanatory account of what differentiates more from less effective in-service training. As a result, researchers have little in the way of advice for those tasked with designing or commissioning better PD. We set out to remedy this by developing a new theory of effective PD based on combinations of causally active components targeted at developing teachers’ insights, goals, techniques, and practice. We test two important implications of the theory using a systematic review and meta-analysis of 104 randomized controlled trials, finding qualified support for our framework. While further research is required to test and refine the theory, we argue that it presents an important step forward in being able to offer actionable advice to those responsible for improving teacher PD.
There is broad consensus across academic disciplines that access to same-race/ethnicity teachers is a critical resource for supporting the educational experiences and outcomes of Black, Hispanic, and other students of color. While theoretical and qualitative lines of inquiry further describe a set of teacher mindsets and practices aligned to “culturally responsive teaching” as likely mechanisms for these effects, to date there is no causal evidence on this topic. In experimental data where upper-elementary teachers were randomly assigned to classes, I find large effects upwards of 0.45 standard deviations of teachers of color on the short- and longer-term social-emotional, academic, and behavioral outcomes of their students. These average effects are explained in part by teachers’ growth mindset beliefs that student intelligence is malleable rather than fixed, interpersonal relationships with students and families, time spent planning for and differentiating instruction for individual students’ needs, and the extent to which teachers lead well-organized classrooms in which student (mis)behavior is addressed productively without creating a negative classroom climate.
Paraeducators perform multiple roles in U.S. classrooms, including among others preparing classroom activities, working with students individually and in small groups, supporting individualized programming for students with disabilities, managing classroom behavior, and engaging with parents and communities. Yet, little research provides insights into this key group of educators. This study combines an analysis of national administrative data to describe the paraeducator labor market with a systematic review of collective bargaining agreements and other job-defining documents in ten case-study districts. We find a large and expanding labor market of paraeducators, far more diverse along ethnic and racial lines than certified teachers but with far lower wages, fewer performance incentives, less professional development, and fewer opportunities for advancement within the profession.
Personnel evaluation systems have historically failed to identify and remediate low-performing teachers. In 2012, Chicago Public Schools implemented an evaluation system that incorporated remediation and dismissal plans for low-rated teachers. Regression discontinuity estimates indicate that the evaluation reform increased the exit of low-rated tenured teachers by 50 percent. The teacher labor supply available to replace low-rated teachers was higher performing on multiple dimensions, and instrumental variables estimates indicate that policy-induced exit of low-rated teachers significantly improved teacher quality in subsequent years. Policy simulations show that the teacher labor supply in Chicago is sufficient to remove significantly more low-performing teachers.
Principals shape the academic setting of schools. Yet, there is limited evidence on whether principal professional development improves schooling outcomes. Beginning in 2008-09, Pennsylvania’s Inspired Leadership (PIL) induction program required that newly hired principals complete targeted in-service professional development tied to newly established state leadership standards within five years of employment. Using panel data on all Pennsylvania students, teachers, and principals, we leverage variation in the timing of PIL induction across principal-school cells and employ difference-in-differences and event study strategies to estimate the impact of PIL induction on teacher and student outcomes. We find that PIL induction increased student math achievement through improvements in teacher effectiveness, and that the effects of PIL induction on teacher effectiveness were concentrated among the most economically disadvantaged and urban schools in Pennsylvania. Principal professional development had the greatest impact on teacher effectiveness when principals completed PIL induction during their first two years in the principalship. We also find evidence that teacher turnover declined in the years following the completion of PIL induction. We discuss the implications of our findings for principal induction efforts.
Narrative accounts of classroom instruction suggest that external interruptions, such as intercom announcements and visits from staff, are a regular occurrence in U.S. public schools. We study the frequency, nature, and duration of external interruptions in the Providence Public School District (PPSD) using original data from a district-wide survey and classroom observations. We estimate that a typical classroom in PPSD is interrupted over 2,000 times per year, and that these interruptions and the disruptions they cause result in the loss of between 10 to 20 days of instructional time. Administrators appear to systematically underestimate the frequency and negative consequences of these interruptions. We propose several organizational approaches schools might adopt to reduce external interruptions to classroom instruction.