The adVersarial Parameter Decomposition (VPD) method now allows researchers to decompose attention layers, a persistent hurdle for SAEs and transcoders. This technique improves upon previous SPD and APD iterations. By building attribution graphs of causally important subcomponents, the team can interpret specific model prompts. This approach is now ready for application to larger, high-impact models.