Three self-monitoring modules provided no statistically significant benefit across 20 random seeds in predator-prey survival environments. Researchers at arXiv tested these auxiliary-loss add-ons within a multi-timescale cortical hierarchy. The modules collapsed during training horizons up to 50,000 steps. This suggests that simply adding metacognitive layers does not automatically improve reinforcement learning agent efficiency.