Research Report

April 202640 min read

The Literacy Crisis: How We Taught Millions of Kids Not to Read

A report on the structural failure of American reading instruction — documented across pedagogy, policy, demographics, publishing, and a fifty-year war over a question science already answered.

When Jaylen enters third grade in a mid-size American city, his teacher notices that he reads slowly, guesses at words he doesn't know, and skips lines when the text gets hard. She refers him for evaluation. The assessment finds him reading nearly two years below grade level. The label attached to his file — "learning disability" — will follow him through elementary school, into middle school, and into the statistical record that documents what American education does to low-income Black boys who cannot decode.

What Jaylen's file will not reflect is that his reading failure was not inevitable. It was not neurological. It was not the product of poverty alone, though poverty is always present in this story. It was the product of a decision made before he was born — a decision about how to teach children to read — that was wrong when it was made, was known to be wrong within a decade, and was nonetheless repeated in the classroom where Jaylen sat for four years learning to guess rather than decode.

Jaylen is not unlucky. He is American. And he is one of approximately 1.6 million fourth-graders who scored below the Basic reading level on the 2024 National Assessment of Educational Progress — the largest share since 2002, in a country that has been spending more on education, not less, for the entirety of his life.

This report documents how that happened: the scientific consensus that was ignored, the pedagogical ideology that replaced it, the publishing industry that profited from the replacement, the teacher preparation pipeline that transmitted the error into hundreds of thousands of classrooms, the demographics of who paid the price, and the reform movement that is now — perhaps too late for Jaylen, perhaps not too late for the children behind him — trying to undo forty years of damage.

The structural causes are understood. The interventions are documented. The question the data does not answer — the one each generation will have to answer for itself — is why a failure this thoroughly mapped has proven this resistant to repair.

Part One: The Numbers Are Not Improving

The standard framing of the literacy crisis is that the pandemic broke something that was previously working. That framing is inaccurate, and the inaccuracy matters because it misidentifies the timeline and therefore the cause.

American reading scores were in decline before COVID-19. The 2019 NAEP — the last pre-pandemic assessment — showed fourth-grade reading scores that had already fallen from a 2013 peak. The pandemic accelerated and deepened a trend that was structural, not accidental. This distinction is not semantic. If COVID caused the crisis, the solution is recovery. If the crisis predates COVID and COVID merely exposed and deepened it, the solution is something more fundamental.

The 2024 NAEP results make the structural argument difficult to dismiss. Average fourth-grade reading scores fell to their lowest point since 2002, with 40% of fourth-graders scoring below the Basic level — the threshold at which students demonstrate partial mastery of grade-level skills. At eighth grade, a record 33% scored below Basic, the worst result in the assessment's history. No state recorded a statistically significant gain in fourth-grade reading between 2022 and 2024. The lowest-performing students — those at the 10th percentile — posted reading scores not seen since the early 1990s, and their pandemic-era losses grew 70% larger between 2022 and 2024 rather than recovering. More than $189 billion in federal pandemic relief funds were distributed to schools with no measurable effect on these numbers.

The international picture is not more reassuring. The 2022 PISA assessment, which tests 15-year-olds across 81 countries, recorded an average reading score decline of approximately 10 points across OECD nations — the largest single-cycle drop the assessment has ever measured. The 2021 Progress in International Reading Literacy Study found that 21 of 32 countries with comparable trend data scored lower in fourth-grade reading than they had in 2016. Pre-pandemic declines were already underway in Australia, Canada, New Zealand, and the Netherlands before the first school closed in 2020. Singapore was the only country to improve consistently across all five PIRLS cycles.

The adult literacy data confirms that what is failing in classrooms does not repair itself over time. The OECD's 2023 PIAAC survey — the most comprehensive assessment of adult skills in the developed world — found that 28% of American adults scored at or below the lowest measurable literacy level, up from 19% in 2017. That is a 9-percentage-point increase in six years. The average American adult literacy score fell 12 points between 2012 and 2023. Canada, which shares many of the same pedagogical traditions as the United States, now reports that 48% of its adults lack the literacy skills needed for everyday life, up from 42% in 2003. Australia reports roughly 44%. The OECD average adult literacy score has declined in 70% of member nations over the past decade.

The aggregate economic consequence is not speculative. Low literacy costs the United States an estimated $2.2 trillion annually in lost productivity, higher public assistance usage, reduced tax revenue, and increased healthcare and criminal justice expenditures — the last item reflecting the well-documented correlation between reading failure in third grade and eventual incarceration. Children who cannot read at grade level by third grade are four times less likely to graduate from high school than proficient readers, and 85% of children in the juvenile justice system have literacy problems. These are not coincidences. They are the downstream accounting of an instructional failure that was made at the front of a classroom, in a specific grade, with specific children, using specific methods that we now know did not work.

Part Two: The Reading Wars — A Self-Inflicted Wound

The roots of the current crisis extend further than the pandemic, further than the past decade of declining scores, and further than the publishing industry that will feature prominently later in this report. They reach back to 1967, to a single paper, to a specific argument that was contested from the moment of its publication and nonetheless won the ideological battle that followed.

In 1967, University of Arizona linguist Kenneth Goodman published "Reading: A Psycholinguistic Guessing Game" — a paper arguing that skilled readers do not decode words letter by letter or sound by sound, but instead predict words from context, sentence structure, illustrations, and the first letter of the word in question. Reading, in Goodman's framing, was not primarily a phonological process. It was a meaning-making process, and the way to teach it was to immerse children in rich language environments and let the process emerge naturally. Phonics instruction, in this view, was not merely unnecessary but potentially harmful — a mechanical drill that disrupted the natural development of reading as comprehension.

In the same year, Harvard professor Jeanne Chall published the results of a landmark comparative review of reading research. Her book Learning to Read: The Great Debate examined decades of studies comparing code-emphasis instruction — explicit phonics, systematic decoding — with meaning-emphasis approaches. Her conclusion was direct: code-emphasis instruction was superior for beginning readers across every measure examined, including reading achievement, word recognition, spelling, and oral reading. The children who learned to decode explicitly learned to read better.

The two publications represented genuinely incompatible theories of how reading works in the human brain. One of them was right. The research record accumulated over the subsequent fifty years would make it increasingly clear which one. But in 1967, the ideological climate of American education — shaped by progressive child-centered philosophy, by John Dewey's legacy, by a professional culture that valued teacher autonomy and distrusted systematic instruction — was predisposed to embrace Goodman's vision and resist Chall's.

What followed is sometimes called the Reading Wars, a framing that implies two roughly equivalent factions engaged in legitimate disagreement. The framing is misleading. The scientific consensus was never genuinely contested among reading researchers. The debate was between researchers and the education establishment — between what the science established and what the pedagogy culture preferred.

The whole language movement that Goodman's theory generated swept through American elementary education across the 1970s and 1980s. California formally adopted a whole language framework in 1987, making it the most consequential state-level instructional decision of the era. The results arrived with unusual clarity. In 1994, California fourth-graders scored at the absolute bottom of the NAEP reading assessment, tied with Louisiana and Guam. Fifty-nine percent scored below the Basic level. A state that had invested heavily in a pedagogical theory had produced, at scale, exactly the reading failure that theory's critics had predicted.

California reversed course. Congress commissioned the National Reading Panel, which in April 2000 delivered a 449-page synthesis of reading research that represented something rare in education policy: a genuine scientific consensus. The panel identified five essential components of effective reading instruction — phonemic awareness, phonics, fluency, vocabulary, and comprehension — and found that systematic phonics instruction produced measurably better outcomes than instruction that was unsystematic or absent. The report was not ambiguous. It was not contested by serious researchers. It was the science of reading, documented exhaustively, available to every school district, every teacher preparation program, and every educational publisher in the country.

It was not acted upon comprehensively. And the reasons it was not acted upon — a federal program that collapsed under scandal, an education culture resistant to external mandates, and a publishing industry with a billion-dollar stake in the status quo — constitute the second phase of the self-inflicted wound.

Part Three: Reading First, and How It Fell

The political response to the National Reading Panel was the most significant federal intervention in reading instruction in American history. President George W. Bush's No Child Left Behind Act, signed in January 2002, included Reading First — a six-year, $6 billion initiative that required states receiving funds to adopt "scientifically based" reading curricula aligned with the NRP's five pillars. For a brief period, federal money was tied to evidence.

What happened to Reading First stands as one of the most consequential policy collapses in education history, not because the program failed — though it ultimately did — but because the scandal surrounding it discredited the phonics movement for nearly a decade and created the political and cultural space for balanced literacy to fill.

A 2006 investigation by the Department of Education's Inspector General found that federal officials administering Reading First had directed states and school districts toward specific commercial reading programs in which those officials had undisclosed financial interests. The conflicts were extensive and well-documented. Regional technical assistance centers — funded by the federal government to help states implement Reading First — were in some cases run by organizations connected to the same publishers whose products were being recommended. The investigation documented what amounted to a pay-to-play structure embedded in a program ostensibly designed around scientific objectivity.

The program's congressional appropriation was eliminated in 2008. In the same year, an independent evaluation commissioned by the Department of Education found that Reading First had not improved students' reading comprehension, though it had improved decoding and fluency instruction. The combination — a corruption scandal and a mixed efficacy finding — was enough. Reading First died. And with it, for nearly a decade, died the political will to mandate phonics-based instruction at the federal or state level.

The vacuum left by Reading First's collapse was filled by what became known as balanced literacy — a pedagogy that positioned itself as a pragmatic synthesis of whole language and phonics but retained the theoretical architecture of the former while offering the latter as a supplement rather than a foundation. The dominant commercial expression of balanced literacy was developed at Columbia University's Teachers College, primarily by Lucy Calkins, whose Units of Study curriculum would become the most widely used reading program in American elementary education.

Part Four: The Billion-Dollar Mistake

To understand how a disproven reading method persisted in American classrooms for four decades, it is necessary to understand the commercial ecosystem that sustained it — and the scale at which that ecosystem operated.

Lucy Calkins was not a fringe figure. She was among the most influential reading educators in the history of American schooling, a Columbia University professor whose Teachers College Reading and Writing Project trained thousands of literacy coaches and curriculum directors who then disseminated her methods across the country. Her Units of Study curriculum was mandated across New York City's 1.1 million students in 2003, a decision that former NYC Schools Chancellor Joel Klein would later describe as one of his greatest regrets. At its peak, Calkins' curriculum was estimated to be in use in roughly one in four American elementary schools.

Her publisher was Heinemann, a Portsmouth, New Hampshire imprint that served as the primary revenue driver for its parent company, Houghton Mifflin Harcourt. APM Reports documented that Heinemann generated sales from Calkins' and related balanced literacy materials that grew on an unbroken streak from 2006 through 2019, reaching revenue that made the company one of the most profitable educational publishers in the country. Separate Fountas and Pinnell materials — a leveled reading system developed by Irene Fountas and Gay Su Pinnell, also published by Heinemann — added substantially to that total. APM Reports estimated Heinemann earned at least $1.6 billion over a decade from these lines.

The commercial structure was self-reinforcing in ways that made disruption extraordinarily difficult. A school district that adopted Calkins' curriculum purchased not just the core units but aligned professional development, classroom libraries organized by Fountas and Pinnell's leveling system, assessment tools, supplemental materials, and coaching contracts — all from the same ecosystem, all premised on the same theoretical framework. Teachers were trained in that framework during professional development sessions led by consultants who had been trained by the same program. Literacy coaches hired to support classroom implementation had often been trained at Teachers College. The curriculum, the training, the assessment, and the professional culture were all interlocked.

The three-cueing strategy that sat at the heart of this ecosystem — teaching children to use meaning, syntax, and visual cues (the first letter of a word) to guess unknown words rather than decode them phonetically — was not a minor methodological preference. It was a direct implementation of Goodman's guessing game theory, translated into a classroom technique that teachers were explicitly trained to use and to value. The strategy was documented, named, and taught. It was also, as the science of reading made clear, counterproductive — training children to engage in exactly the behavior that skilled readers do not engage in, and that struggling readers use as a compensatory strategy when decoding fails them.

The financial scale of the balanced literacy ecosystem created an enormous institutional interest in its perpetuation. Publishers had invested in curriculum development and production. Districts had invested in materials and professional development. Teachers had invested years in learning and internalizing the methods. Literacy coaches had built careers on them. The sunk costs — financial, professional, and psychological — ran into the billions, and the ecosystem's stakeholders were not passive in the face of challenges to its legitimacy.

Part Five: The Failure of Teacher Preparation

The commercial ecosystem produced a damaged curriculum. The teacher preparation pipeline ensured that the damage was installed in classrooms for decades, by waves of teachers who had been trained to deploy it.

A 2023 evaluation by the National Council on Teacher Quality examined teacher preparation programs across the country and found that only 23% adequately taught all five components of evidence-based reading instruction as identified by the National Reading Panel in 2000. That is, twenty-three years after the federal government published a definitive synthesis of reading research, fewer than one in four programs that train elementary teachers were teaching those teachers what the science established. Forty percent of programs were still explicitly teaching the three-cueing strategy that the science of reading identifies as counterproductive. Two out of three programs failed to adequately address phonemic awareness — the ability to hear and manipulate the sounds in spoken words, which is the foundational precondition for phonics instruction to work.

The implications are not theoretical. NCTQ estimates that if all elementary teachers used evidence-based reading instruction, the reading failure rate could fall from its current 20-30% range to 2-10% — meaning that somewhere between 700,000 and one million additional children could reach proficiency by fourth grade each year. The gap between what teacher preparation programs teach and what the science establishes is not a gap in resources. It is a gap in will — the same gap that has characterized this crisis at every level.

The problem is compounding in ways that legislative reform alone cannot address. Many of the teachers currently in classrooms were trained before any state passed science of reading legislation. They are being asked — now, in some cases under legal mandate — to unlearn methods they have practiced for years and that were validated by their professional preparation, their district's purchased curriculum, and their coaching support. The cognitive and professional dissonance involved is real. Education Week reported that even in states with strong science of reading laws, a 2022 survey found 61% of teachers still using three-cueing strategies. As one North Carolina principal put it: "Your philosophy on reading is as deep as religion." Laws change curricula. They change what is on the shelf. They do not automatically change what happens in the moment when a child is struggling with a word and a teacher reaches for the response she has practiced ten thousand times.

The Southern Regional Education Board found that teachers in Southern states — many of which have passed aggressive science of reading legislation — reported receiving, on average, fewer than three hours of professional development annually on early literacy instruction. Three hours is not sufficient to retrain a professional. It is sufficient to create awareness of a problem without providing the tools to address it.

Part Six: Compounding Factors — Screens, COVID, and Structural Inequality

Flawed instruction is the central driver of the literacy crisis, but it operates within a context of compounding factors that amplify its effects and determine who suffers most from them.

The decline of reading for pleasure represents a change in the ecosystem of literacy that structural reform cannot easily address. Reading volume — the sheer amount of time a child spends with text — is one of the strongest predictors of reading proficiency, and reading volume has been declining for decades as screens have competed for and won children's attention. Only 14% of American 13-year-olds reported reading for fun "almost every day" in 2023, compared to 27% in 2012. In the United Kingdom, the National Literacy Trust found that only 34.6% of children aged 8-18 said they enjoyed reading in 2024 — the lowest figure since the organization began tracking the question in 2005. A 2024 study published in Advanced Science used causal inference methods to demonstrate that increased screen time reduces reading time, which in turn measurably affects the cognitive development and brain architecture associated with reading comprehension.

The consequences are not uniform. Children from households with more books, more reading models, more engaged parental involvement in literacy, and more access to libraries can compensate for reduced in-school reading time in ways that children from less resourced environments cannot. The displacement of reading by screens is a crisis for all children, but it is a catastrophe specifically for children whose home environment does not provide the kind of rich language exposure that partially compensates for inadequate instruction.

COVID-19 school closures delivered what the NAEP described as the largest single-cycle reading score decline for 9-year-olds in 30 years — a 5-point drop that erased nearly all the gains accumulated since 1990. Schools that spent more time in remote instruction lost more ground. The losses were not distributed evenly. Low-income students lost roughly five months of reading progress during the pandemic period, compared to two to three months for their more affluent peers. This differential is consistent with what researchers call the "summer slide" — the well-documented phenomenon in which children from lower-income households lose more ground over extended breaks from school because they have less access to the compensatory literacy experiences that higher-income households provide. The pandemic was an extreme version of a pattern the research had established long before it.

The $189.5 billion in Emergency School Relief funds distributed to address pandemic learning loss has not produced measurable recovery. A Defense of Freedom Institute analysis found no statistically significant relationship between ESSER spending and NAEP score improvements. The reasons are complex — the money was broadly distributed, the implementation was uneven, tutoring programs were often poorly designed or underscaled — but the result is that four years after schools reopened, the pandemic reading losses have not been recovered, and in the lowest-performing quartile, the gap has actually widened.

Poverty remains the deepest structural determinant of reading outcomes, operating through mechanisms that interact with and amplify instructional failure at every level. Children from low-income families enter kindergarten an estimated 18 months behind their more affluent peers in vocabulary — a gap that reflects differences in the number of words heard, the complexity of language used, and the presence of books and other print in the home environment. More than half of families living in poverty have no children's books at home. These children then encounter instructional methods that require them to use context and meaning to compensate for decoding they have not been taught, and they lack the vocabulary breadth that makes meaning-based guessing even partially workable. They attend schools that receive less funding, are more likely to be staffed with less experienced teachers, and are more likely to use whichever curriculum was available rather than one selected on the basis of efficacy.

The funding inequity is structural and persistent. Approximately one-third of U.S. states provide less funding to high-poverty districts than to low-poverty ones. Districts serving the highest proportions of students of color receive on average $2,700 less per student in state and local funding than districts serving predominantly white students. The Learning Policy Institute has documented that a 10% increase in per-pupil spending sustained over 12 school years raises high school graduation rates by 7 percentage points for all students and by 10 percentage points for low-income students specifically. The inverse — reduced or inequitably distributed funding — produces predictable results in the same direction.

Part Seven: Who Bears the Weight

The literacy crisis is not an equal-opportunity failure. Its costs fall with precise and predictable disproportionality on children who are poor, non-white, male, learning English as a second language, or coping with learning differences — and these categories overlap substantially, creating compounding disadvantages that the data captures with uncomfortable clarity.

On the 2024 NAEP, White fourth-graders averaged a score of 226 while Hispanic students averaged 204 and Black students 202 — a 24-point White-Black gap and 22-point White-Hispanic gap that has persisted, in roughly this magnitude, for decades. Asian students led all groups at 243. Since 2011, the gap between Asian and Black students has widened from 31 points to 42 points. In the bottom quarter of all fourth-grade reading scores, more than 40% of students are Hispanic, approximately 25% are Black, over 67% are economically disadvantaged, approximately a third have a disability classification, and a quarter are English language learners. In the top quarter, 61% are White, 14% are Asian, and fewer than 5% are Black. These are not random distributions. They are the demographic signature of a system that has failed certain children more completely than others, and that has done so with the consistency that only structural causes produce.

The gender gap in reading is universal across every country that participates in international literacy assessments, and it is large. Girls outperformed boys by 10 points on the 2024 fourth-grade NAEP and by an average of 24 points across OECD countries on the 2022 PISA. Globally, for every 100 girls who achieve minimum reading proficiency, only 87 boys do — and in middle-income countries, that ratio falls to 72 boys per 100 girls. Boys are more likely to score below basic proficiency in every country assessed. They are more likely to be diagnosed with reading disabilities. They are less likely to read for pleasure. The explanations for this gap are contested — biological, developmental, and instructional factors have all been proposed — but the gap itself is not contested, and its consistency across every educational system in the developed world suggests causes that transcend any single country's policy choices.

English language learners face a 32-point reading gap relative to their non-ELL peers on NAEP — a gap that reflects the extraordinary difficulty of learning to decode in a language one is simultaneously learning to speak. ELL students constitute roughly a quarter of students in the bottom performance quartile on NAEP while making up fewer than 3% of the top quartile. The three-cueing system that asks children to use meaning and context to compensate for decoding limitations is particularly harmful for ELL students, who lack the English vocabulary and syntactic familiarity that make context-based guessing even partially viable.

Students with dyslexia occupy perhaps the most painful position in this story. Dyslexia is estimated to affect between 15% and 20% of the population — making it far more common than any other recognized learning disability — and it is fundamentally a difficulty with phonological processing, meaning that the specific skill systematic phonics instruction addresses directly. Decades of research, including work from the Yale Center for Dyslexia and Creativity, demonstrates that the vast majority of students with dyslexia can learn to read at grade level with explicit, systematic phonics instruction. Many cannot learn to read adequately without it. In a system that provided such instruction universally, most students with dyslexia would never be identified as having a reading disability because their reading would be adequate. Instead, the students who most needed the phonics instruction that whole language and balanced literacy withheld are the ones most likely to have been labeled, placed in special education, and tracked into futures their reading level, rather than their intelligence, determined.

Part Eight: A Reform Wave in Progress

The turning point came not from policymakers, not from the publishing industry, and not from university research departments. It came from a journalist.

In September 2018, APM Reports correspondent Emily Hanford published "Hard Words: Why Aren't Our Kids Being Taught to Read?" — a long-form audio documentary that asked, with unusual directness, why schools across America were using reading methods that the science had discredited. The piece reached audiences far beyond the education policy world. Parents recognized their children's struggles. Teachers recognized the methods being described. Principals recognized their curricula. In October 2022, Hanford's six-part podcast series "Sold a Story" was released and downloaded more than 3.5 million times — an extraordinary reach for education journalism — winning a duPont-Columbia Award and triggering the most significant legislative response to reading instruction in American history.

By late 2024, 40 states and the District of Columbia had passed laws or policies requiring evidence-based reading instruction, the vast majority of them enacted after 2019. These laws vary in comprehensiveness but share a common architecture: mandatory systematic phonics instruction, prohibition or restriction of three-cueing strategies, universal K-3 literacy screening, teacher preparation reform requirements, and in many cases third-grade retention policies for students who cannot read at grade level. The legislative wave has been strikingly bipartisan — co-sponsored by Democrats and Republicans in at least 14 states, reflecting a rare convergence of political interest around an issue with obvious, measurable consequences for children across the political spectrum.

At the curriculum level, the reckoning has been swift and in some cases dramatic. New York City, which had mandated Calkins' curriculum across 1.1 million students for two decades, abandoned it entirely in May 2023, requiring all 700 elementary schools to adopt one of three approved phonics-based curricula. In September 2023, Columbia University's Teachers College formally dissolved the Reading and Writing Project that Calkins had led for 40 years — an institutional acknowledgment, however reluctant, that the program's foundational approach had not served the children it was designed to help. Heinemann's sales fell approximately 75% from their 2019 peak. In December 2024, a class-action lawsuit was filed against Calkins, Fountas and Pinnell, Heinemann, and Teachers College, alleging deceptive marketing of curricula the defendants knew did not produce the outcomes they promised.

The most compelling evidence that reform works comes from a state that almost no one would have predicted. Mississippi, which ranked 49th in the nation in fourth-grade reading as recently as 2013, is now — when adjusted for student demographics — the top-performing state in the country on NAEP fourth-grade reading. Black students in Mississippi outperform Black students in 47 other states. The transformation took approximately six years and cost roughly $15 million per year — $32 per student annually.

The Mississippi model is worth examining in detail because it was comprehensive rather than partial, and because its comprehensiveness appears to be the source of its success. The Literacy-Based Promotion Act of 2013 combined several mutually reinforcing elements: mandatory teacher training in evidence-based reading instruction, placement of literacy coaches in schools, a requirement that teachers use curricula aligned with the science of reading, universal K-3 literacy screening to identify struggling readers early, a third-grade retention policy for students who had not reached reading proficiency, and sustained administrative accountability for implementation. No single element was sufficient alone — states that adopted only one or two of these elements saw little improvement. Mississippi adopted all of them and sustained them across multiple governors and administrations. The result was not a miracle. It was what evidence-based, comprehensively implemented, sustained instruction produces.

England's experience adds an international data point. The country mandated systematic synthetic phonics in 2007 and introduced a universal Year 1 Phonics Screening Check in 2012, which tracks every student's phonics decoding skills at the end of first grade. Pass rates on the screening rose from 58% in 2012 to approximately 80% by the early 2020s. England was one of only six education systems whose PIRLS fourth-grade reading scores did not decline between 2016 and 2021 — a distinction that places it in stark contrast with the global trend and directly correlates with its instructional commitment.

Part Nine: The Resistance and What Drives It

Despite the scientific consensus, the legislative momentum, and the evidence from Mississippi and England, the reform movement faces resistance that is organized, well-resourced, and deeply embedded in the institutions responsible for implementing change.

Teachers' unions have been the most visible opponents of science of reading legislation, framing curriculum mandates as assaults on teacher professionalism and autonomy. The California Teachers Association was instrumental in killing AB 2222, the state's science of reading bill, arguing that mandating specific instructional approaches violated teachers' professional judgment. The Massachusetts Teachers Association opposed literacy reform legislation that ultimately passed the state House 155 to 0. The Australian Education Union attacked Victoria's 2024 phonics mandate as disrespecting teachers' expertise. The union argument is not without any merit — mandates imposed without adequate professional development, without coaching support, and without acknowledgment of the genuine complexity of teaching can fail for reasons unrelated to the quality of the underlying approach. But the pattern of opposition to science of reading legislation specifically, regardless of its implementation provisions, suggests something beyond reasonable concerns about professional development. It suggests ideological resistance to the acknowledgment that what teachers have been doing has not been working.

The financial resistance is less publicly visible but more structurally significant. The reading curriculum market is estimated at approximately $6 billion annually, and the transition from balanced literacy to evidence-based curricula represents a complete replacement of the products, professional development programs, classroom libraries, and assessment systems that publishers have built and that districts have purchased. The companies that profited from the old order — and their investor owners — have significant financial interests in slowing the transition. Heinemann's experience illustrates the stakes: a 75% revenue decline from peak to 2024 represents the commercial consequence of a paradigm shift, and it was preceded by years of lobbying, marketing, and institutional defense of an approach the company's own authors were simultaneously revising in response to public pressure.

Academic resistance persists in education schools, where constructivist philosophy — the theoretical tradition that holds that children construct knowledge through experience rather than receiving it through direct instruction — has shaped the culture for decades. A meaningful number of education school faculty continue to publish papers questioning the scientific consensus on phonics, arguing variously that the research has been misinterpreted, that the political context of reading debates renders the science suspect, or that the complexity of reading instruction resists the kind of prescriptive recommendations the science of reading movement makes. These arguments have not gained traction in the neuroscience and cognitive psychology literature where reading research actually lives — but they have provided professional cover for education schools that have been slow to change their preparation programs.

The structural fragmentation of American public education may be the most durable barrier. The United States has approximately 13,000 school districts with significant local control over curriculum decisions. Even when states pass comprehensive reform legislation, implementation varies enormously by district, by school, and by classroom. A 2023 analysis by researchers at Michigan State University found that while 41 states had early literacy policies on the books, only 12 could be considered truly comprehensive — and only those 12 showed measurable gains in student reading outcomes. The law on paper and the instruction in the classroom are connected by a chain of implementation that runs through state education agencies, district curriculum directors, building principals, literacy coaches, and finally individual teachers, each of whom brings their own training, beliefs, and habits to the question of how to respond when a child struggles with a word.

Part Ten: What Policy Would Fix

The policy solutions to the literacy crisis are not mysterious. They are documented, available, and in most cases already written into draft legislation that has not passed or regulations that have not been enforced.

Universal implementation of science of reading instruction — explicit, systematic phonics as the foundation of reading instruction in every kindergarten through third-grade classroom in the country — would, by NCTQ's estimates, reduce the reading failure rate from its current 20-30% to 2-10%. The instruction is not expensive. Effective phonics curricula cost no more than ineffective ones. What costs money is the professional development required to retrain the existing teaching workforce, and the coaching support required to sustain the change once training is complete. Mississippi spent $32 per student per year to accomplish this at scale. The question is not whether the country can afford this. It is whether the institutions responsible for implementing it will choose to.

Comprehensive reform of teacher preparation programs would address the crisis at its source rather than chasing it downstream. If every teacher entering an elementary classroom was trained in evidence-based reading instruction before their first day, the requirement for mass retraining of the existing workforce would diminish within a decade. The National Council on Teacher Quality has developed specific standards for what adequate reading preparation looks like, and a growing number of states are now requiring teacher preparation programs to meet those standards as a condition of accreditation. Progress is measurable. In 2020, 27% of programs covered all five essential reading components adequately. By 2023, that figure had risen to roughly 50%. The trajectory is in the right direction. The speed is not adequate for the urgency.

Universal K-3 literacy screening — a brief, validated assessment that identifies struggling readers at the earliest possible point, before reading failure becomes entrenched — is perhaps the single highest-leverage investment available to any school system. Mississippi, Louisiana, and Arkansas have made universal screening mandatory and have paired it with tiered intervention systems that provide additional instruction to identified students. The earlier a reading difficulty is caught, the less severe the intervention required and the higher the probability of full remediation. Children who receive appropriate intervention in kindergarten require significantly less support than children identified in third grade, and essentially none of the intervention required by children who are not identified until middle school — at which point the probability of ever catching up fully drops substantially.

High-dosage tutoring — intensive, one-on-one or small-group reading instruction delivered by trained tutors outside of regular classroom time — has produced the most consistent evidence of learning recovery of any intervention studied in the pandemic era. An analysis of 96 rigorous studies found significant improvement in 87% of tutoring programs, equivalent to approximately half a year of additional learning. Tennessee has embedded literacy tutoring into its state funding formula at $500 per fourth-grader annually. Louisiana has paired mandatory K-5 tutoring with a $30 million annual appropriation. The ESSER-funded tutoring programs that proved most effective were those that operated at high dosage — three or more sessions per week — with trained tutors and structured, evidence-based materials. Most of those programs are now in jeopardy as ESSER funds have expired and have not been replaced.

The funding equity problem that concentrates reading failure in high-poverty districts will not be solved by instructional reform alone. The documented relationship between per-pupil funding and student outcomes — particularly for low-income students — is robust enough that states cannot achieve equitable reading outcomes while maintaining funding systems that systematically under-resource the schools serving the children most at risk.

Conclusion: The Inheritance

The literacy crisis is a present condition, not a future risk. It is a condition distributed unevenly across race, income, gender, and geography, but expanding its reach in all directions. The NAEP scores that document the current state of reading achievement in America are not projections. They are measurements of what is happening now, in classrooms that exist, to children who have one chance at a literate childhood.

What distinguishes this crisis from many others is not its complexity. It is the completeness with which its structural causes have been identified, the availability of documented interventions, and the persistence of the gap between what is known and what is done. The science of reading has been settled for longer than many of the teachers currently in classrooms have been alive. The National Reading Panel published its findings in 2000. The Mississippi model demonstrated at scale that comprehensive implementation works. England's phonics mandate demonstrated that a national commitment to systematic decoding instruction produces measurably better readers.

Each generation experiences the failure of this system differently but is shaped by it regardless. Baby Boomers were educated in an era when phonics instruction, however imperfect, was more common in American classrooms than it would be by the time their children entered school. Gen X absorbed the first wave of whole language, entering classrooms in the late 1970s and 1980s as the movement reached its peak — many of them readers whose school experience gave them less explicit instruction than their parents received. Millennials were educated at the height of balanced literacy's dominance, in classrooms where Lucy Calkins' methods were at their widest distribution. Gen Z is bearing the weight of all of it — the worst NAEP scores in a generation, the steepest pandemic losses, the widening gaps, and the knowledge that a reform movement is finally underway that may not move fast enough to reach them before they age out of the window when reading remediation is most effective.

Jaylen, whose composite story opened this report, is not an abstraction. He is the demographic reality of a system that made specific, documentable choices — about what to teach, how to train teachers, which curricula to mandate, whose research to honor and whose to ignore — and distributed the consequences of those choices in ways that were entirely predictable from the data available at the time the choices were made. His reading failure was not inevitable. It was constructed, one institutional decision at a time, over fifty years.

The system was not broken by neglect alone. It was shaped by choices — ideological, financial, and political. It will only be repaired by different ones. The children currently in kindergarten do not have the luxury of waiting for the institutions responsible for their education to reach consensus on a question the science answered in 2000.

What the research establishes, with the clarity that fifty years of accumulated evidence produces, is this: the difference between a child who learns to read and one who does not is not primarily about intelligence, not primarily about poverty, and not primarily about family. It is primarily about instruction. We know what instruction works. We have known for a long time. The question that remains — the one that will define the educational legacy of the current generation of policymakers, administrators, and teachers — is whether we will choose to use it.

Share Your Voice

Join the conversation to share your thoughts and help others understand this topic better.

Join the Conversation

Community Feedback

No comments yet. Be the first to share your thoughts!