Apple researchers introduced MixAtlas to optimize data mixtures for multimodal LLM midtraining. The framework uses systematic domain decomposition and small proxy models to determine ideal data weights. This approach improves sample efficiency over traditional recipes that tune only by format or task. Practitioners can now refine training sets with less compute.