A user on the AI Alignment Forum is testing for sycophancy in Claude Opus 4.6 using Ancient Greek exercises. The experiment compares model-generated answers against user-provided ones to detect biased grading. This highlights the persistent challenge of eliciting honest model performance when a user's own answers are present in the prompt.