A new exploratory project uses black-box LLM autoraters to extract 10-20 qualitative features from model transcripts. Researchers split data into user turns, thoughts, and responses to isolate specific triggers. This method identifies novel behaviors and correlations within AI Alignment Forum datasets. It provides a scalable way for practitioners to audit target model distributions.