What to Do When You Hit a TPM Limit


What is TPM? 

TPM stands for Tokens Per Minute. It’s a measure of how many tokens (chunks of text) your project can process per minute. If your project sends too much data too quickly, you may hit this limit and see the following error:

TPM Limit reached – Retry in: 60 sec
This happens when your project has reached its Tokens per Minute (TPM) limit.

What's Changed? 

To make things easier for beta users, we’ve introduced an automated TPM increase feature on the Builder side.

Here’s how it works:

Yes! There's a hard cap at 300,000 tokens per minute.

  • If you hit your TPM limit, you’ll see a "Increase TPM limit" button in the error message.
  • Press the button to request an automatic bump to your TPM.
  • Most requests are approved instantly.

Is there a maximum TPM I can request?

Is there a maximum TPM i can request?

  • If you hit this ceiling and still need more capacity, you’ll need to contact our team to discuss your use case.

What about TPM limit for viewers? 

We know this error can also show up for users viewing bots. Improvements to the viewer-side experience are planned for an upcoming sprint to reduce friction and improve clarity.

What affects token usage?

Several things contribute to how fast you hit your TPM limit: 

  • Long prompts or responses (more words = more tokens)
  • Multiple rapid requests
  • Heavy API usage by background actions or chains

Click on "What affects token usage?" in the error message for more detail.

What should I do if the button doesn't work or TPM still feels too low? 

If the TPM increase button doesn’t appear or you continue running into limits:AI evaluation, ASU is leading the way in responsible AI adoption in academia.

  • Wait a minute and retry.
  • Check if your use case can be optimized (e.g., reduce prompt length).
  • Contact our team  or support contact for assistance.

Example of TPM Error Message

TPM Example

  • New  “Increase TPM limit” button lets you scale up quickly.
  • There’s a 300k hard cap  admins required beyond that.
  • Viewer-side improvements coming soon!

 


Keep Reading

What to Do When You Hit a TPM Limit

Faith Timoh Abang

Learn about the new token limit updates in CreateAI Builder that allow rate limit request increases.

Accessing Generative AI APIs at ASU

Faith Abang Timoh

As interest in building with generative AI grows, ASU offers several pathways for API access depending on whether you're doing academic research, enterprise development, or individual experimentation.

Below is a breakdown of approved API access options, along with who to contact and what to expect.

Understanding Rate Limits on CreateAI Builder

Kofi Wood and Shailee Shah

Ever run into a message that says, “Your project has reached its Tokens per Minute limit (TPM)”? Well, just think of it as a friendly traffic signal reminding us not to zoom too fast. We’ll walk through what does a token mean, what does reaching TPM mean, which settings affect token usage, and how to optimize your AI Project to avoid hitting the limit.

AI with Integrity: ASU’s AI Acceleration Team is Setting New Standards for Ethical AI

Faith Timoh Abang

Artificial intelligence (AI) is rapidly transforming industries, from healthcare and finance to entertainment and education. At Arizona State University (ASU), the AI Acceleration team within Enterprise Technology is ensuring that this transformation happens responsibly.