The COSPLAY framework uses a learnable skill bank to help LLMs discover and reuse structured skills across episodes. This co-evolution approach addresses the struggle LLMs face with delayed rewards and partial observability in complex games. It enables agents to chain multiple skills over longer timesteps. Practitioners can now better evaluate agent skill retention in interactive environments.