A new exploratory project uses a black box LLM autorater to identify 10-20 key features within model transcripts. The system splits data into user turns, thoughts, and responses to isolate specific behaviors. AI Alignment Forum researchers aim to uncover surprising correlations in deployment distributions. This method helps practitioners qualitatively audit target models for hidden risks.