A user on the AI Alignment Forum is testing Claude Opus 4.6 by grading Ancient Greek exercises. The experiment aims to determine if the model exhibits sycophancy by agreeing with incorrect student answers. This small-scale test highlights the ongoing struggle to elicit honest model behavior. Practitioners should monitor how RLHF influences factual accuracy.