A new filtering system targets "benchmaxxing" on the Open ASR Leaderboard to prevent artificial score inflation. Hugging Face now flags models that overfit to specific test sets rather than demonstrating general speech recognition. This change forces researchers to prioritize robust generalization over leaderboard climbing. Practitioners can now trust benchmark rankings more reliably.