TPToolpazar

Global Araç

Ai Rate Limit Tracker

SağlayıcıPlanFiyatRPMTPMGünlükNotlar
AnthropicTier 1 ($5 funded)$5+5020kDay-1 cap; raises with usage
AnthropicTier 4 (sustained)$200+40001MGranted ~30 days of usage
AnthropicClaude Pro$20/mo5x usageHourly + weekly caps
AnthropicClaude Max 5x$100/mo20x usageHigher weekly cap
OpenAITier 1 (after $5)$5+500200k10kGPT-5 = 30k TPM here
OpenAITier 5 (sustained)$1000+1000030MProd-tier
OpenAIChatGPT Plus$20/moGPT-5: 200/3hPlus throttles when busy
OpenAIChatGPT Pro$200/moHigherPro uses o-pro reasoning
GoogleGemini API Free$051M25Hard rate-limit
GoogleGemini API Tier 1Pay10004MMost apps land here
GoogleGemini Advanced$20/moGenerous1-day cooldown if hammered
DeepSeekAPIPayNo published rate limit
PerplexityPro$20/mo300/day Pro Search+ unlimited quick search
xAIGrok APIPay6010kDay-1 default
Limitlere erken mi ulaşıyorsunuz? Tüm sağlayıcılar kümülatif harcama + ilk ödemeden sonraki gün sayısına göre kademeleri yükseltir. Anthropic otomatik yükseltir; OpenAI her kademede 7 gün sonra otomatik yükseltir. Üretimde 429 hatalarını önlemek için, istemcinize üstel geri çekilme ile yeniden deneme ekleyin ve akış yanıt API'sini kullanın (yalnızca gerçekten ürettiğiniz için faturalandırır).
Veri şeffaflığı: oran limitleri sağlayıcı belgelerine göre 2026-04-30 tarihinde doğrulanmıştır. Kademe eşikleri bildirimsiz değişir — belirli sayılara göre mimari planlamadan önce doğrudan sağlayıcı konsolunuzdan teyit edin. Tam kaynakça için kaynak & şeffaflık sayfasına bakın.

AI provider rate limits affect both consumer plans (ChatGPT Plus, Claude Pro, Gemini Advanced have message-per-window caps) and API workloads (RPM = requests per minute, TPM = tokens per minute, daily / monthly quotas). Hitting limits at the wrong moment — production traffic spike, demo to investors, scheduled batch run — can break product reliability. Understanding your current tier and the path to higher tiers is essential for anyone building on top of frontier models. Most providers tier limits based on cumulative spend: Tier 1 ($5+ paid), Tier 2 ($50+), Tier 3 ($100+), Tier 4 ($250+), and so on with rapidly increasing limits. Hitting tier 4 takes weeks to months of consistent usage — plan ahead if you anticipate burst demand.

The tracker covers current rate limits across major providers in both consumer and API contexts: Anthropic Claude API (Sonnet, Haiku, Opus tiers; usage tier progression from $5 to $400+ thresholds), Anthropic Claude Pro/Max consumer ($20/mo Pro 5x usage cap; Max $100/mo with 5-20x usage), OpenAI GPT-5 / o3 API tiers (Tier 1-5+ progression), ChatGPT Plus ($20/mo), ChatGPT Pro ($200/mo), Google Gemini API (free tier substantial; paid tiers based on cumulative usage), Gemini Advanced consumer ($20/mo), DeepSeek API (very generous limits, cheap), Perplexity API and consumer Pro, xAI Grok API and Grok consumer access. Plus weekly / daily caps that some providers enforce on top of per-minute limits.

Strategies for managing rate limits: (1) Implement exponential backoff retry on 429 (rate limit exceeded) responses — most SDKs do this automatically; you just need to enable it. (2) Distribute load across multiple providers (use Sonnet for most, Haiku for batch, GPT-5 for specific tasks) — load-balancing reduces single- provider risk. (3) Use Batch APIs for non- time-sensitive work (50% discount + much higher rate limits, returns in 24h). (4) For consumer usage hitting limits early — Claude Pro 5x cap is shared across web/ desktop/mobile/API; if you're hitting it, consider Max ($100/mo) or API consumption directly. (5) For high-volume API workloads, contact provider sales for custom limits before hitting tier ceilings. (6) Monitor your headers — providers return X-RateLimit-Remaining and similar headers showing remaining quota; build dashboards or logs around these.

Nasıl Kullanılır

  1. Filter by provider (Anthropic, OpenAI, Google, DeepSeek, Perplexity, xAI).
  2. Read tier-by-tier RPM, TPM, daily quota limits.
  3. Identify your current tier (typically based on cumulative paid usage).
  4. Plan path to higher tiers if needed (consistent usage to hit spend thresholds).
  5. For consumer plans, see weekly/daily message caps.

Ne Zaman Kullanılır

  • Architecting an LLM-powered product — sizing infrastructure to expected traffic.
  • Hitting rate limits in production — confirming you understand the path to higher tiers.
  • Comparing consumer plans (Claude Pro vs Max, ChatGPT Plus vs Pro).
  • Choosing between providers for a specific workload (TPM matters for long-context jobs).
  • Quarterly capacity planning — reviewing whether current tier is sufficient for next quarter's growth.

Ne Zaman Kullanılmaz

  • Real-time current quota — providers return X-RateLimit headers in API responses; check those for actual remaining quota.
  • Custom enterprise contracts — those have negotiated limits beyond published tiers.
  • Rate limits for specific feature endpoints (search, embeddings, fine-tuning) — those have their own quotas not always covered.
  • Older deprecated models — limits may differ from current model lineup.

Yaygın Kullanım Senaryoları

  • Verifying a number or output before passing it on
  • Quick use during a typical workday
  • Pre-decision sanity-check on inputs and outputs
  • Educational use — demonstrating the underlying concept

Sık Sorulan Sorular

What's RPM vs TPM?

RPM = Requests Per Minute (each API call counts as 1 request). TPM = Tokens Per Minute (input + output tokens summed across all requests). For chat use, RPM is usually the binding constraint (lots of small requests). For long-context work (summarizing books, processing transcripts), TPM is binding (few requests but many tokens each). Both apply simultaneously — you hit whichever limit first.

How do I move up tiers?

Most providers move you automatically based on cumulative paid spend over a rolling period (typically the past 30-90 days). Anthropic: Tier 1 ($5+ paid), Tier 2 ($50), Tier 3 ($100), Tier 4 ($400). OpenAI: Tier 1 ($5+ paid), through Tier 5 ($1000+). The progression typically takes weeks. For larger limits, contact sales — most providers offer enterprise tiers with negotiated limits.

Is Claude Pro enough?

Depends on usage. Claude Pro ($20/mo) has a 5x usage limit vs free; sufficient for most casual use but easily exhausted by heavy users (large documents, long conversations, multiple projects per day). Heavy users should consider Claude Max ($100/mo for 5-20x Pro's limit), or supplement with API access for specific workloads. Pro's “5x” is shared across all interfaces (web, desktop, mobile, Claude Code).

What about ChatGPT Plus vs Pro?

ChatGPT Plus ($20/mo): 80 messages per 3 hours on GPT-5; advanced features. ChatGPT Pro ($200/mo): 10x Plus message limit; access to o3-pro model; longer task limits. Most users find Plus sufficient. Pro is for power users hitting Plus limits, agentic workflows requiring extended runtime, or workflows specifically benefiting from o3-pro's deeper reasoning.

How do I handle 429 errors?

Implement exponential backoff: on 429 response, wait 1s and retry, then 2s, 4s, 8s, etc. Most SDKs (Anthropic Python, OpenAI Python) do this automatically — enable retries in client config. For burst traffic patterns, queue requests and rate-limit them on YOUR side to stay under provider limits. Don't rely on provider 429s as a flow-control mechanism; it adds latency.

Can I get higher limits than published tiers?

Yes — contact provider sales. Anthropic, OpenAI, Google, DeepSeek all have enterprise programs with negotiated limits beyond published tiers. Typically requires demonstrating sustained substantial usage (e.g., $5K+/month consistently) and a use case explanation. Custom limits often come with custom pricing, dedicated support, security reviews. For startups under $10K/month spend, just rely on the standard tier progression.