The SpecMD framework standardizes how developers benchmark expert caching policies for Mixture-of-Experts models. It tests how various hardware configurations interact with sparse parameter activation to reduce inference latency. This tool helps researchers optimize memory movement. Practitioners can now quantify the actual performance gains of specific caching strategies across different hardware targets.