FAQs
Tokens-per-minute rate limiting
If you face any problems with proprietary models (like OpenAI models) where you are rate limited, we provide the option to set exponential_backoff=True
and to change delay_between_retries
to a value in seconds (defaults to 1 second).
For example:
See our models documentation for specific information about rate limiting.
In the case of OpenAI, they have tier based rate limits. See the docs for more information.