Developers on Lobste.rs are testing how AI agents process complex documentation and long-form text. The discussion highlights persistent failures in long-context retrieval and logical reasoning. These gaps suggest that current LLM agents still struggle with deep reading. Practitioners should expect continued reliability issues when deploying agents for autonomous technical research.