Five large language models, including ChatGPT and Claude, scored picture description tasks with accuracy comparable to trained human raters. Researchers tested these models on patients with dementia and cognitively normal participants. This automation removes rater variability and reduces the time required for neuropsychological assessments. It enables larger, multi-site research studies for cognitive impairment.