A new pipeline defines seven steps for analyzing AI interaction logs to evaluate model behavior. The authors introduce the Inspect Scout library to provide concrete code examples and reproducible workflows. This framework addresses the current lack of standardization in log processing. Researchers can now apply these rigorous steps to better assess tool-use propensities.