The RealChart2Code benchmark shows top proprietary models lose nearly 50% of their performance when processing complex visualizations. These models fail as datasets grow more intricate. This gap exposes a critical weakness in current multimodal reasoning. Practitioners should avoid relying on LLMs for high-fidelity data extraction from sophisticated charts.