Apple developed SpecMD to benchmark how expert caching policies interact with different hardware configurations in Mixture-of-Experts models. The framework addresses a gap in understanding how sparse parameter activation translates to actual inference speed. It provides a standardized way to test ad-hoc cache policies. This helps researchers optimize memory throughput for large-scale sparse models.