
What to Do When You Hit a TPM Limit
What is TPM?
TPM stands for Tokens Per Minute. It’s a measure of how many tokens (chunks of text) your project can process per minute. If your project sends too much data too quickly, you may hit this limit and see the following error:
TPM Limit reached – Retry in: 60 sec
This happens when your project has reached its Tokens per Minute (TPM) limit.
What's Changed?
To make things easier for beta users, we’ve introduced an automated TPM increase feature on the Builder side.
Here’s how it works:
Yes! There's a hard cap at 300,000 tokens per minute.
- If you hit your TPM limit, you’ll see a "Increase TPM limit" button in the error message.
- Press the button to request an automatic bump to your TPM.
- Most requests are approved instantly.
Is there a maximum TPM I can request?
Is there a maximum TPM i can request?
- If you hit this ceiling and still need more capacity, you’ll need to contact our team to discuss your use case.
What about TPM limit for viewers?
We know this error can also show up for users viewing bots. Improvements to the viewer-side experience are planned for an upcoming sprint to reduce friction and improve clarity.
What affects token usage?
Several things contribute to how fast you hit your TPM limit:
- Long prompts or responses (more words = more tokens)
- Multiple rapid requests
- Heavy API usage by background actions or chains
Click on "What affects token usage?" in the error message for more detail.
What should I do if the button doesn't work or TPM still feels too low?
If the TPM increase button doesn’t appear or you continue running into limits:AI evaluation, ASU is leading the way in responsible AI adoption in academia.
- Wait a minute and retry.
- Check if your use case can be optimized (e.g., reduce prompt length).
- Contact our team or support contact for assistance.
Example of TPM Error Message
- New “Increase TPM limit” button lets you scale up quickly.
- There’s a 300k hard cap admins required beyond that.
- Viewer-side improvements coming soon!
Keep Reading
What to Do When You Hit a TPM Limit
Learn about the new token limit updates in CreateAI Builder that allow rate limit request increases.
Accessing Generative AI APIs at ASU
As interest in building with generative AI grows, ASU offers several pathways for API access depending on whether you're doing academic research, enterprise development, or individual experimentation.
Below is a breakdown of approved API access options, along with who to contact and what to expect.
Understanding Rate Limits on CreateAI Builder
Ever run into a message that says, “Your project has reached its Tokens per Minute limit (TPM)”? Well, just think of it as a friendly traffic signal reminding us not to zoom too fast. We’ll walk through what does a token mean, what does reaching TPM mean, which settings affect token usage, and how to optimize your AI Project to avoid hitting the limit.
AI with Integrity: ASU’s AI Acceleration Team is Setting New Standards for Ethical AI
Artificial intelligence (AI) is rapidly transforming industries, from healthcare and finance to entertainment and education. At Arizona State University (ASU), the AI Acceleration team within Enterprise Technology is ensuring that this transformation happens responsibly.