A new OCR model trained on 10 million synthetic images enables fast, multilingual text recognition. Hugging Face leveraged synthetic data to bridge the gap in low-resource language datasets. This approach reduces reliance on expensive manual labeling. Developers can now deploy lightweight vision models that maintain high accuracy across diverse scripts and languages.