Google released a study showing that 70 % of LLMs exhibit consistent behavioral dispositions across prompts. The research compares model outputs to a behavioral taxonomy and finds that alignment improves when training data includes diverse scenarios. Practitioners can use these metrics to benchmark safety and predictability in downstream applications for developers.