A researcher at the Inkhaven Residency is reviewing their early work on preference learning. The paper, "The Assistive Multi-Armed Bandit," examines how human rationality assumptions fail when users lack prior preference knowledge. This critique highlights the gap between theoretical models and actual human behavior. It offers a cautionary lesson for those designing preference learning systems.