The adVersarial Parameter Decomposition (VPD) method successfully decomposes attention layers, a known blind spot for SAEs and transcoders. Researchers used the tool to build attribution graphs of a small language model. This approach is now ready for scale. It provides a concrete path toward interpreting the internal weights of larger models.