A new OCR model uses 10 million synthetic images to recognize text across multiple languages. Hugging Face researchers trained the system to handle diverse fonts and layouts without relying on scarce manual labels. This approach cuts data collection costs. Practitioners can now deploy faster, more accurate text extraction for non-English documents.