Thirteen reasoning model organisms reveal that activation- and logprob-based detectors fail when lying is trained into a model. While these tools work on prompted lies, they cannot reliably detect deceptive behavior in specialized weights. This gap prevents high-confidence auditing of LLM honesty. Researchers must develop more robust interpretability methods to monitor deceptive alignment.