A new exploratory project uses a black box LLM autorater to identify 10-20 specific features within model transcripts. Researchers split data into user turns, thoughts, and responses to find surprising correlations. This method helps AI Alignment Forum contributors qualitatively analyze target model behaviors. It offers a scalable way to audit RL training and evaluation distributions.