The University of Maryland found that OpenAI's latest reasoning model shows little improvement on law school exams compared to previous versions. This stagnation suggests that raw scaling fails to solve complex legal reasoning. Practitioners should treat these tools as basic assistants rather than reliable legal analysts. The results highlight a persistent ceiling in specialized domain performance.