To maintain system stability and equitable access, our API enforces rate limiting—controlling how frequently an organization can send requests within a certain time window. API rate limits are defined in two ways:Documentation Index
Fetch the complete documentation index at: https://docs.gmicloud.ai/llms.txt
Use this file to discover all available pages before exploring further.
- TPM (Tokens per Minute) for LLM models
- RPH (Requests per Hour) for video models
Usage Tiers and Auto Upgrades
Rate limits vary by usage tier, with each tier offering different quotas for each model. By default, organizations are assigned to Tier 1. As you buy credit from our platform, we automatically upgrade you to the next usage tier, using the following tier system. For example, after purchasing a $50 credit balance, you will be upgraded to Tier 2 within 24 hours. Please note that voucher redemptions do not count towards purchase.| Tier Name | Total Purchase Amount | Time After |
|---|---|---|
| Tier 1 | $0 | Immediately |
| Tier 2 | $50 | 24 hours |
| Tier 3 | $500 | 24 hours |
| Tier 4 | $2000 | 24 hours |
| Tier 5 | $10000 | 24 hours |
Rate Limit Table
| Model Name | Tier 1 TPM | Tier 2 TPM | Tier 3 TPM | Tier 4 TPM | Tier 5 TPM |
|---|---|---|---|---|---|
| anthropic/claude-3.7-sonnet | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| anthropic/claude-haiku-4.5 | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| anthropic/claude-opus-4 | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| anthropic/claude-opus-4.1 | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| anthropic/claude-opus-4.5 | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| anthropic/claude-opus-4.6 | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| anthropic/claude-opus-4.7 | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| anthropic/claude-sonnet-4 | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| anthropic/claude-sonnet-4.5 | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| anthropic/claude-sonnet-4.6 | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| bytedance/seed-2.0-mini | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| deepseek-ai/DeepSeek-Prover-V2-671B | 100,000 | 450,000 | 800,000 | 2,000,000 | 150,000,000 |
| deepseek-ai/DeepSeek-R1 | 100,000 | 450,000 | 800,000 | 2,000,000 | 30,000,000 |
| deepseek-ai/DeepSeek-R1-0528 | 150,000,000 | ||||
| deepseek-ai/DeepSeek-R1-Distill-Llama-70B | 100,000 | 450,000 | 800,000 | 2,000,000 | 30,000,000 |
| deepseek-ai/DeepSeek-R1-Distill-Llama-8B | 100,000 | 450,000 | 800,000 | 2,000,000 | 30,000,000 |
| deepseek-ai/DeepSeek-R1-Distill-Qwen-14B | 100,000 | 450,000 | 800,000 | 2,000,000 | 30,000,000 |
| deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B | 100,000 | 450,000 | 800,000 | 2,000,000 | 30,000,000 |
| deepseek-ai/DeepSeek-R1-Distill-Qwen-32B | 100,000 | 450,000 | 800,000 | 2,000,000 | 30,000,000 |
| deepseek-ai/DeepSeek-R1-Distill-Qwen-7B | 100,000 | 450,000 | 800,000 | 2,000,000 | 30,000,000 |
| deepseek-ai/DeepSeek-R1-Zero | 100,000 | 450,000 | 800,000 | 2,000,000 | 30,000,000 |
| deepseek-ai/DeepSeek-R2 | 100,000 | 450,000 | 800,000 | 2,000,000 | 30,000,000 |
| deepseek-ai/DeepSeek-V3 | 100,000 | 450,000 | 800,000 | 2,000,000 | 30,000,000 |
| deepseek-ai/DeepSeek-V3-0324 | 100,000 | 450,000 | 800,000 | 2,000,000 | 150,000,000 |
| deepseek-ai/DeepSeek-V3.1 | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| deepseek-ai/DeepSeek-V3.1-Terminus | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| deepseek-ai/DeepSeek-V3.2 | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| deepseek-ai/DeepSeek-V3.2-Exp | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| deepseek-ai/DeepSeek-V3.2-Speciale | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| deepseek-ai/DeepSeek-V3-Base | 100,000 | 450,000 | 800,000 | 2,000,000 | 30,000,000 |
| deepseek-v3 | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| google/gemini-3.1-flash-lite-preview | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| google/gemini-3.1-pro | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| google/gemini-3.1-pro-preview | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| google/gemini-3-flash | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| google/gemini-3-flash-preview | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| google/gemini-3-pro | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| google/gemini-3-pro-preview | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| google/gemma-4-26b-a4b-it | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| google/gemma-4-31b-it | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| gpt-5.4 | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| kwaipilot/kat-coder-pro-v2 | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| meta-llama/Llama-3.1-8B | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| meta-llama/Llama-3.3-70B-Instruct | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| meta-llama/Llama-4-Maverick-17B-128E-Instruct | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| meta-llama/Llama-4-Scout-17B-16E-Instruct | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| MiniMaxAI/MiniMax-M2 | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| MiniMaxAI/MiniMax-M2.1 | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| MiniMaxAI/MiniMax-M2.5 | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| MiniMaxAI/MiniMax-M2.7 | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| moonshotai/Kimi-K2.5 | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| moonshotai/Kimi-K2-Instruct | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| moonshotai/Kimi-K2-Instruct-0905 | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| moonshotai/Kimi-K2-Thinking | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| openai/gpt-4o | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| openai/gpt-4o-mini | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| openai/gpt-5 | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| openai/gpt-5.1 | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| openai/gpt-5.1-chat | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| openai/gpt-5.2 | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| openai/gpt-5.2-chat | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| openai/gpt-5.2-codex | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| openai/gpt-5.3-codex | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| openai/gpt-5.4 | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| openai/gpt-5.4-mini | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| openai/gpt-5.4-nano | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| openai/gpt-5.4-pro | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| openai/gpt-oss-120b | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| Qwen/Qwen2.5-0.5B-Instruct | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| Qwen/Qwen2.5-14B | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| Qwen/Qwen2.5-14B-Instruct | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| Qwen/Qwen2.5-14B-Instruct-AWQ | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| Qwen/Qwen2.5-14B-Instruct-GGUF | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| Qwen/Qwen2.5-14B-Instruct-GPTQ-Int4 | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| Qwen/Qwen2.5-1.5B-Instruct | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| Qwen/Qwen2.5-32B | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| Qwen/Qwen2.5-32B-Instruct | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| Qwen/Qwen2.5-7B-Instruct | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| Qwen/Qwen2.5-7B-Instruct-1M | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| Qwen/Qwen2.5-7B-Instruct-AWQ | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| Qwen/Qwen2.5-7B-Instruct-GGUF | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| Qwen/Qwen2.5-7B-Instruct-GPTQ-Int4 | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| Qwen/Qwen2.5-7B-Instruct-GPTQ-Int8 | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| Qwen/Qwen2.5-Coder-14B-Instruct | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| Qwen/Qwen2.5-Coder-32B-Instruct | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| Qwen/Qwen2.5-Coder-7B-Instruct | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| Qwen/Qwen2.5-Math-7B-Instruct | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| Qwen/Qwen2.5-VL-7B-Instruct | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| Qwen/Qwen3-235B-A22B-FP8 | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| Qwen/Qwen3-235B-A22B-Instruct-2507-FP8 | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| Qwen/Qwen3-235B-A22B-Thinking-2507-FP8 | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| Qwen/Qwen3-30B-A3B | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| Qwen/Qwen3-32B-FP8 | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| Qwen/Qwen3.5-122B-A10B | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| Qwen/Qwen3.5-27B | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| Qwen/Qwen3.5-35B-A3B | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| Qwen/Qwen3.5-397B-A17B | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| Qwen/Qwen3.6-Max-Preview | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| Qwen/Qwen3.6-Plus | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| Qwen/Qwen3.6-Plus-2026-04-02 | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8 | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| Qwen/Qwen3-Next-80B-A3B-Instruct | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| Qwen/Qwen3-Next-80B-A3B-Thinking | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| Qwen/QwQ-32B | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| Qwen/QwQ-32B-AWQ | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| Qwen/QwQ-32B-GGUF | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| Qwen/QwQ-32B-Preview | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| zai-org/GLM-4.5-Air-FP8 | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| zai-org/GLM-4.5-FP8 | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| zai-org/GLM-4.6 | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| zai-org/GLM-4.7-FP8 | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| zai-org/GLM-5.1-FP8 | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |
| zai-org/GLM-5-FP8 | 100,000 | 2,000,000 | 4,000,000 | 10,000,000 | 150,000,000 |