Over 720 fine-tuned LLMs comprise Pando, a new benchmark designed to test how well interpretability methods identify known decision rules. Gradient-based techniques outperformed blackbox baselines, while non-gradient methods struggled to remain faithful. This research provides practitioners a controlled environment to validate whether their transparency tools actually reveal a model's internal logic.