An 8B parameter model called iLLaDA uses diffusion instead of traditional autoregressive generation to produce text. Developed by ByteDance and Renmin University, it matches Qwen2.5 in base performance. However, the model loses its edge after fine-tuning. This experiment tests whether diffusion architectures can realistically replace standard LLM token prediction for general tasks.