2 Comments
User's avatar
Neural Foundry's avatar

Solid post connecting the anecdotal frustration to actual infrastructure realities. The context degradation after 100+ exchanges is real, I've hit that wall building multi-step workflows where Claude gradually loses track of the original schema. What's interesting is the load-based routing angle, if providers are silently downgrading requests to cheaper models under load, that's effectively a form of throttlign even if they deny it. The research agent confidently fabricating nonexistence is classic hallucination behavior tho.

Expand full comment
Robert Matsuoka's avatar

Wait till you see Wed’s follow up on “laziness”

Expand full comment