A new exploratory project uses black-box LLM autoraters to generate 10-20 qualitative features from model transcripts. Researchers split data into user turns, thoughts, and responses to identify surprising behavioral correlations. This method helps AI Alignment practitioners find novel behaviors in deployment distributions. It offers a scalable alternative to manual qualitative analysis for model safety.