What algorithm do you use for rate limits?
What if I hit a rate limit, how long is the cool down period?
Since token refilling is a continuous process, there’s no fixed cooldown period. As soon as a single token is refilled, you can make another request. The wait time depends on how many tokens you’ve used and the refill rate - it could be just milliseconds if you’re slightly over the limit, or longer if you’ve exhausted your entire token bucket.
If you have a 600 requests/minute limit:
How long will an API query run before timing out?