Developmental Cognitive Interpretability (DCI) proposes modeling how latent constructs like goals and motivations evolve during training. This approach seeks to predict agent behavior on out-of-distribution inputs where standard evaluations fail. LessWrong researchers argue that tracking cognitive shifts prevents misidentifying scheming as alignment. Practitioners can use DCI to increase pre-deployment confidence.