The new AI IQ site scores models like ChatGPT and Claude using human intelligence metrics. This approach sparks debate over whether human cognitive tests accurately measure machine reasoning. The results offer a simplified benchmark for users. However, practitioners should treat these scores as anecdotal rather than rigorous technical evaluations of model capabilities.