Indigenous communities in New Zealand are developing a te reo Māori text-to-speech model to counter data scraping by OpenAI and Anthropic. These Big Tech firms ingested community-produced audio without permission to train their LLMs. The new project prioritizes data sovereignty. It ensures linguistic control remains with the speakers rather than corporate entities.