Two LLM modules, an activation verbalizer and reconstructor, now map internal model activations to natural language descriptions. Researchers trained this system using reinforcement learning to reconstruct residual stream activations. This method helped diagnose safety-relevant behaviors during a pre-deployment audit of Claude Opus 4.6. NLAs provide a concrete tool for auditing model internals without supervised labels.