The Coding Agent Economy.
• $92 avg cost per active user / month
• Claude powers 92% of all coding agent spend (up from 68%)
• Cache hit rates jumped 52% → 86%
requesty.ai/coding-agent-eco…
The throughput density data suggests something counterintuitive:
the highest throughput providers are not necessarily serving the largest requests.
They are serving a massive number of relatively small generations extremely efficiently.
A lot of AI infrastructure…
The surprising thing in the latency data is how compressed the top providers have become.
For a lot of workloads, the gap between “fast” and “slow” providers is now smaller than the variance introduced by tool calls, long context, and agentic execution itself.
Model latency…
Most AI teams have zero control over which models employees and agents can actually use.
Today we’re launching Approved Models + Access Lists in Requesty.
You can now:
• approve models org-wide
• restrict models by API key or group
• enforce regional/compliance…
Discuss this model
Add corrections, implementation notes, pricing changes, or usage caveats for other readers.