Microsoft trained its MAI models using unlicensed web data like Common Crawl. This contradicts company claims that the models relied solely on clean, commercially licensed datasets. The company now relies on fair use arguments. This creates a legal risk for enterprise customers who expected a fully compliant training pipeline for their deployments.