Linear probes provide more reliable evidence of a model's internal representations than non-linear ones. High probe complexity shifts the result's meaning from the model's contents to the probe's own expressive power. LessWrong researchers argue this distinction prevents false positives. Practitioners should prioritize simpler probes to ensure findings reflect actual model behavior.