How unique /fresh is the market data? Can you precompute it and cache push to a cdn for retrieval reducing the LLM calls ?
You could also do a rolling cost window circuit breaker type thing
If you’re worried about a botnet golang can do some magic with multiple concurrent requests for the same resource you can pull one payload and serve it back to everyone
{no channel}