Solid post connecting the anecdotal frustration to actual infrastructure realities. The context degradation after 100+ exchanges is real, I've hit that wall building multi-step workflows where Claude gradually loses track of the original schema. What's interesting is the load-based routing angle, if providers are silently downgrading requests to cheaper models under load, that's effectively a form of throttlign even if they deny it. The research agent confidently fabricating nonexistence is classic hallucination behavior tho.
Solid post connecting the anecdotal frustration to actual infrastructure realities. The context degradation after 100+ exchanges is real, I've hit that wall building multi-step workflows where Claude gradually loses track of the original schema. What's interesting is the load-based routing angle, if providers are silently downgrading requests to cheaper models under load, that's effectively a form of throttlign even if they deny it. The research agent confidently fabricating nonexistence is classic hallucination behavior tho.
Wait till you see Wed’s follow up on “laziness”