Computational Model of Reading Development

Data Scientist · 2023 · 2 min read · Updated Feb 7, 2026

Built a connectionist neural network that simulates how children learn to read, achieving 97% accuracy in phonological production and generalizing to novel non-words.

PythonConnectionist Neural NetworksPostgreSQL

Overview

Developed a temporal connectionist model that learns to map orthography (written words) to phonology (speech sounds), serving as a 'digital twin' of the human reading system to explore cognitive mechanisms underlying reading development.

Problem

Understanding how children learn to read—and why some struggle—requires computational models that can capture the temporal dynamics of word recognition. Existing models were limited in handling words of varying lengths and generalizing to unseen inputs.

Approach

Designed a connectionist model with temporal processing mechanisms that learns orthography-to-phonology mappings from a large word corpus. Trained the model to produce phonological output patterns from printed input, then evaluated on both seen words and novel non-words to test generalization.

Constraints

Model must handle words of arbitrary length, not just fixed-size inputs
Must generalize to non-words to demonstrate learned reading rules rather than memorization
Architecture must capture temporal processing dynamics of real reading behavior

Key Decisions

Temporal processing architecture over static feed-forward network

Reading is inherently sequential—letters are processed over time. A temporal architecture better captures the dynamics of how the reading system activates phonological representations.

Alternatives: Static feed-forward neural networkRecurrent sequence-to-sequence model

Connectionist framework over symbolic rule-based model

Connectionist models naturally capture graded effects like word frequency and spelling-sound consistency, which are well-documented in behavioral reading research.

Result & Impact

97%

Phonological accuracy

Successfully captured key behavioral effects including word frequency and spelling-sound consistency influences. Presented at the 2024 Society for the Scientific Study of Reading Annual Meeting in Copenhagen.

Learnings

Temporal dynamics are critical for modeling reading—static snapshots miss important processing characteristics.
A model that reads non-words convincingly has likely learned generalizable orthographic-phonological rules, not just memorized training examples.

All projects