Six major LLMs, including ChatGPT and Claude, failed to provide a consistent pilot hole size for #8 screws in particleboard. The author tested these models against a specific DIY query to highlight factual reliability gaps. This failure demonstrates that LLMs still struggle with niche, concrete technical specifications where training data is sparse.