模型库¶
本页根据 API 数据自动生成,展示所有大模型提供商与模型的综合信息。
统计
提供商数量: 66 模型数量: 1411 最后更新: 2025/11/23 02:33:51
能力图例: 🧠 推理 🔧 工具 📎 附件 🌡️ 温度
AIHubMix¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| GPT-4.1 nano | gpt-4.1-nano | 1M | 32.8K | Input: $0.1 Output: $0.4 Cache Read: $0.03 | Model: 0.050 Completion: 4.000 Cache: 0.300 | 📎 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-04-14 |
| Qwen3 235B A22B Instruct 2507 | qwen3-235b-a22b-instruct-2507 | 262.1K | 262.1K | Input: $0.28 Output: $1.12 | Model: 0.140 Completion: 4.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-30 |
| Claude Opus 4.1 | claude-opus-4-1 | 200K | 32K | Input: $16.5 Output: $82.5 Cache Read: $1.5 Cache Write: $18.75 | Model: 8.250 Completion: 5.000 Cache: 0.091 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-08-05 |
| GPT-5.1 Codex | gpt-5.1-codex | 400K | 128K | Input: $1.25 Output: $10 Cache Read: $0.13 | Model: 0.625 Completion: 8.000 Cache: 0.104 | 📎 🧠 🔧 🌡️ | 2025-11 | In: text, image Out: text | Released: 2025-11-15 |
| Claude Haiku 4.5 | claude-haiku-4-5 | 200K | 64K | Input: $1.1 Output: $5.5 Cache Read: $0.11 Cache Write: $1.25 | Model: 0.550 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-07-31 | In: text, image Out: text | Released: 2025-09-29 |
| Gemini 3 Pro Preview | gemini-3-pro-preview | 1M | 65K | Input: $2 Output: $12 Cache Read: $0.5 | Model: 1.000 Completion: 6.000 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-11 | In: text, image, audio, video Out: text | Released: 2025-11-19 |
| Gemini 2.5 Flash | gemini-2.5-flash | 1M | 65K | Input: $0.075 Output: $0.3 Cache Read: $0.02 | Model: 0.037 Completion: 4.000 Cache: 0.267 | 📎 🔧 🌡️ | 2025-04 | In: text, image, audio, video Out: text | Released: 2025-09-15 |
| GPT-4.1 mini | gpt-4.1-mini | 1M | 32.8K | Input: $0.4 Output: $1.6 Cache Read: $0.1 | Model: 0.200 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-04-14 |
| Claude Sonnet 4.5 | claude-sonnet-4-5 | 200K | 64K | Input: $3.3 Output: $16.5 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.650 Completion: 5.000 Cache: 0.091 | 📎 🧠 🔧 🌡️ | 2025-07-31 | In: text, image Out: text | Released: 2025-09-29 |
| DeepSeek-V3.2-Exp | DeepSeek-V3.2-Exp | 163K | 163K | Input: $0.27 Output: $0.41 | Model: 0.135 Completion: 1.519 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-09-29 |
| GPT-5.1 Codex Mini | gpt-5.1-codex-mini | 400K | 128K | Input: $0.25 Output: $2 Cache Read: $0.03 | Model: 0.125 Completion: 8.000 Cache: 0.120 | 📎 🧠 🔧 🌡️ | 2025-11 | In: text, image Out: text | Released: 2025-11-15 |
| Qwen3 235B A22B Thinking 2507 | qwen3-235b-a22b-thinking-2507 | 262.1K | 262.1K | Input: $0.28 Output: $2.8 | Model: 0.140 Completion: 10.000 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-30 |
| GPT-5.1 | gpt-5.1 | 400K | 128K | Input: $1.25 Output: $10 Cache Read: $0.125 | Model: 0.625 Completion: 8.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-11 | In: text, image Out: text | Released: 2025-11-15 |
| GPT-5-Nano | gpt-5-nano | 128K | 16.4K | Input: $0.5 Output: $2 Cache Read: $0.25 | Model: 0.250 Completion: 4.000 Cache: 0.500 | 📎 🔧 🌡️ | 2024-09-30 | In: text, image Out: text | Released: 2025-09-15 |
| GPT-5-Codex | gpt-5-codex | 400K | 128K | Input: $1.25 Output: $10 Cache Read: $0.13 | Model: 0.625 Completion: 8.000 Cache: 0.104 | 🧠 🔧 | 2024-09-30 | In: text, image Out: text | Released: 2025-09-15 |
| GPT-4o | gpt-4o | 128K | 16.4K | Input: $2.5 Output: $10 Cache Read: $1.25 | Model: 1.250 Completion: 4.000 Cache: 0.500 | 📎 🔧 🌡️ | 2023-09 | In: text, image Out: text | Released: 2024-05-13 Updated: 2024-08-06 |
| GPT-4.1 | gpt-4.1 | 1M | 32.8K | Input: $2 Output: $8 Cache Read: $0.5 | Model: 1.000 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-04-14 |
| GLM-4.6 | glm-4.6 | 204.8K | 204.8K | Input: $0.27 Output: $1.1 Cache Read: $0.11 Cache Write: $0 | Model: 0.135 Completion: 4.074 Cache: 0.407 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-09-30 |
| o4-mini | o4-mini | 200K | 65.5K | Input: $1.5 Output: $6 Cache Read: $0.75 | Model: 0.750 Completion: 4.000 Cache: 0.500 | 🧠 | 2024-09 | In: text Out: text | Released: 2025-09-15 |
| GPT-5-Mini | gpt-5-mini | 200K | 64K | Input: $1.5 Output: $6 Cache Read: $0.75 | Model: 0.750 Completion: 4.000 Cache: 0.500 | 📎 🧠 🔧 🌡️ | 2024-09-30 | In: text, image Out: text | Released: 2025-09-15 |
| Gemini 2.5 Pro | gemini-2.5-pro | 2M | 65K | Input: $1.25 Output: $5 Cache Read: $0.31 | Model: 0.625 Completion: 4.000 Cache: 0.248 | 📎 🧠 🔧 🌡️ | 2025-04 | In: text, image, audio, video Out: text | Released: 2025-09-15 |
| DeepSeek-V3.2-Exp-Think | DeepSeek-V3.2-Exp-Think | 131K | 64K | Input: $0.27 Output: $0.41 | Model: 0.135 Completion: 1.519 | 🧠 🔧 🌡️ | 2025-09 | In: text Out: text | Open Weights Released: 2025-09-29 |
| GPT-4o (2024-11-20) | gpt-4o-2024-11-20 | 128K | 16.4K | Input: $2.5 Output: $10 Cache Read: $1.25 | Model: 1.250 Completion: 4.000 Cache: 0.500 | 📎 🔧 🌡️ | 2023-09 | In: text, image Out: text | Released: 2024-11-20 |
| Qwen3 Coder 480B A35B Instruct | qwen3-coder-480b-a35b-instruct | 262.1K | 131K | Input: $0.82 Output: $3.29 | Model: 0.410 Completion: 4.012 | 🔧 🌡️ | - | In: text Out: text | Released: 2025-08-01 |
| GPT-5 | gpt-5 | 400K | 128K | Input: $5 Output: $20 Cache Read: $2.5 | Model: 2.500 Completion: 4.000 Cache: 0.500 | 📎 🧠 🔧 🌡️ | 2024-09-30 | In: text, image Out: text | Released: 2025-09-15 |
| Kimi K2 0905 | Kimi-K2-0905 | 262.1K | 262.1K | Input: $0.55 Output: $2.19 | Model: 0.275 Completion: 3.982 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-09-05 |
| GPT-5-Pro | gpt-5-pro | 400K | 128K | Input: $7 Output: $28 Cache Read: $3.5 | Model: 3.500 Completion: 4.000 Cache: 0.500 | 📎 🧠 🔧 🌡️ | 2024-09-30 | In: text, image Out: text | Released: 2025-09-15 |
Alibaba¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| Qwen3-LiveTranslate Flash Realtime | qwen3-livetranslate-flash-realtime | 53.2K | 4.1K | Input: $10 Output: $10 Input Audio: $10 Output Audio: $38 | Model: 5.000 Completion: 3.800 | 🌡️ | 2024-04 | In: text, image, audio, video Out: text, audio | Released: 2025-09-22 |
| Qwen3-ASR Flash | qwen3-asr-flash | 53.2K | 4.1K | Input: $0.035 Output: $0.035 | Model: 0.018 Completion: 1.000 | - | 2024-04 | In: audio Out: text | Released: 2025-09-08 |
| Qwen-Omni Turbo | qwen-omni-turbo | 32.8K | 2K | Input: $0.07 Output: $0.27 Input Audio: $4.44 Output Audio: $8.89 | Model: 2.220 Completion: 2.002 | 🔧 🌡️ | 2024-04 | In: text, image, audio, video Out: text, audio | Released: 2025-01-19 Updated: 2025-03-26 |
| Qwen-VL Max | qwen-vl-max | 131.1K | 8.2K | Input: $0.8 Output: $3.2 | Model: 0.400 Completion: 4.000 | 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2024-04-08 Updated: 2025-08-13 |
| Qwen3-Next 80B-A3B Instruct | qwen3-next-80b-a3b-instruct | 131.1K | 32.8K | Input: $0.5 Output: $2 | Model: 0.250 Completion: 4.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-09 |
| Qwen Turbo | qwen-turbo | 1M | 16.4K | Input: $0.05 Output: $0.2 Reasoning: $0.5 | Model: 0.025 Completion: 4.000 | 🧠 🔧 🌡️ | 2024-04 | In: text Out: text | Released: 2024-11-01 Updated: 2025-04-28 |
| Qwen3-VL 235B-A22B | qwen3-vl-235b-a22b | 131.1K | 32.8K | Input: $0.7 Output: $2.8 Reasoning: $8.4 | Model: 0.350 Completion: 4.000 | 🧠 🔧 🌡️ | 2025-04 | In: text, image Out: text | Open Weights Released: 2025-04 |
| Qwen3 Coder Flash | qwen3-coder-flash | 1M | 65.5K | Input: $0.3 Output: $1.5 | Model: 0.150 Completion: 5.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Released: 2025-07-28 |
| Qwen3-VL 30B-A3B | qwen3-vl-30b-a3b | 131.1K | 32.8K | Input: $0.2 Output: $0.8 Reasoning: $2.4 | Model: 0.100 Completion: 4.000 | 🧠 🔧 🌡️ | 2025-04 | In: text, image Out: text | Open Weights Released: 2025-04 |
| Qwen3 14B | qwen3-14b | 131.1K | 8.2K | Input: $0.35 Output: $1.4 Reasoning: $4.2 | Model: 0.175 Completion: 4.000 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04 |
| QVQ Max | qvq-max | 131.1K | 8.2K | Input: $1.2 Output: $4.8 | Model: 0.600 Completion: 4.000 | 🧠 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-03-25 |
| Qwen Plus Character (Japanese) | qwen-plus-character-ja | 8.2K | 512 | Input: $0.5 Output: $1.4 | Model: 0.250 Completion: 2.800 | 🔧 🌡️ | 2024-04 | In: text Out: text | Released: 2024-01 |
| Qwen2.5 14B Instruct | qwen2-5-14b-instruct | 131.1K | 8.2K | Input: $0.35 Output: $1.4 | Model: 0.175 Completion: 4.000 | 🔧 🌡️ | 2024-04 | In: text Out: text | Open Weights Released: 2024-09 |
| QwQ Plus | qwq-plus | 131.1K | 8.2K | Input: $0.8 Output: $2.4 | Model: 0.400 Completion: 3.000 | 🧠 🔧 🌡️ | 2024-04 | In: text Out: text | Released: 2025-03-05 |
| Qwen3-Coder 30B-A3B Instruct | qwen3-coder-30b-a3b-instruct | 262.1K | 65.5K | Input: $0.45 Output: $2.25 | Model: 0.225 Completion: 5.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04 |
| Qwen-VL OCR | qwen-vl-ocr | 34.1K | 4.1K | Input: $0.72 Output: $0.72 | Model: 0.360 Completion: 1.000 | 🌡️ | 2024-04 | In: text, image Out: text | Released: 2024-10-28 Updated: 2025-04-13 |
| Qwen2.5 72B Instruct | qwen2-5-72b-instruct | 131.1K | 8.2K | Input: $1.4 Output: $5.6 | Model: 0.700 Completion: 4.000 | 🔧 🌡️ | 2024-04 | In: text Out: text | Open Weights Released: 2024-09 |
| Qwen3-Omni Flash | qwen3-omni-flash | 65.5K | 16.4K | Input: $0.43 Output: $1.66 Input Audio: $3.81 Output Audio: $15.11 | Model: 1.905 Completion: 3.966 | 🧠 🔧 🌡️ | 2024-04 | In: text, image, audio, video Out: text, audio | Released: 2025-09-15 |
| Qwen Flash | qwen-flash | 1M | 32.8K | Input: $0.05 Output: $0.4 | Model: 0.025 Completion: 8.000 | 🧠 🔧 🌡️ | 2024-04 | In: text Out: text | Released: 2025-07-28 |
| Qwen3 8B | qwen3-8b | 131.1K | 8.2K | Input: $0.18 Output: $0.7 Reasoning: $2.1 | Model: 0.090 Completion: 3.889 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04 |
| Qwen3-Omni Flash Realtime | qwen3-omni-flash-realtime | 65.5K | 16.4K | Input: $0.52 Output: $1.99 Input Audio: $4.57 Output Audio: $18.13 | Model: 2.285 Completion: 3.967 | 🔧 🌡️ | 2024-04 | In: text, image, audio, video Out: text, audio | Released: 2025-09-15 |
| Qwen2.5-VL 72B Instruct | qwen2-5-vl-72b-instruct | 131.1K | 8.2K | Input: $2.8 Output: $8.4 | Model: 1.400 Completion: 3.000 | 🔧 🌡️ | 2024-04 | In: text, image Out: text | Open Weights Released: 2024-09 |
| Qwen3-VL Plus | qwen3-vl-plus | 262.1K | 32.8K | Input: $0.2 Output: $1.6 Reasoning: $4.8 | Model: 0.100 Completion: 8.000 | 🧠 🔧 🌡️ | 2025-04 | In: text, image Out: text | Released: 2025-09-23 |
| Qwen Plus | qwen-plus | 1M | 32.8K | Input: $0.4 Output: $1.2 Reasoning: $4 | Model: 0.200 Completion: 3.000 | 🧠 🔧 🌡️ | 2024-04 | In: text Out: text | Released: 2024-01-25 Updated: 2025-09-11 |
| Qwen2.5 32B Instruct | qwen2-5-32b-instruct | 131.1K | 8.2K | Input: $0.7 Output: $2.8 | Model: 0.350 Completion: 4.000 | 🔧 🌡️ | 2024-04 | In: text Out: text | Open Weights Released: 2024-09 |
| Qwen2.5-Omni 7B | qwen2-5-omni-7b | 32.8K | 2K | Input: $0.1 Output: $0.4 Input Audio: $6.76 | Model: 3.380 Completion: 0.059 | 🔧 🌡️ | 2024-04 | In: text, image, audio, video Out: text, audio | Open Weights Released: 2024-12 |
| Qwen Max | qwen-max | 32.8K | 8.2K | Input: $1.6 Output: $6.4 | Model: 0.800 Completion: 4.000 | 🔧 🌡️ | 2024-04 | In: text Out: text | Released: 2024-04-03 Updated: 2025-01-25 |
| Qwen2.5 7B Instruct | qwen2-5-7b-instruct | 131.1K | 8.2K | Input: $0.175 Output: $0.7 | Model: 0.087 Completion: 4.000 | 🔧 🌡️ | 2024-04 | In: text Out: text | Open Weights Released: 2024-09 |
| Qwen2.5-VL 7B Instruct | qwen2-5-vl-7b-instruct | 131.1K | 8.2K | Input: $0.35 Output: $1.05 | Model: 0.175 Completion: 3.000 | 🔧 🌡️ | 2024-04 | In: text, image Out: text | Open Weights Released: 2024-09 |
| Qwen3 235B-A22B | qwen3-235b-a22b | 131.1K | 16.4K | Input: $0.7 Output: $2.8 Reasoning: $8.4 | Model: 0.350 Completion: 4.000 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04 |
| Qwen-Omni Turbo Realtime | qwen-omni-turbo-realtime | 32.8K | 2K | Input: $0.27 Output: $1.07 Input Audio: $4.44 Output Audio: $8.89 | Model: 2.220 Completion: 2.002 | 🔧 🌡️ | 2024-04 | In: text, image, audio Out: text, audio | Released: 2025-05-08 |
| Qwen-MT Turbo | qwen-mt-turbo | 16.4K | 8.2K | Input: $0.16 Output: $0.49 | Model: 0.080 Completion: 3.063 | 🌡️ | 2024-04 | In: text Out: text | Released: 2025-01 |
| Qwen3-Coder 480B-A35B Instruct | qwen3-coder-480b-a35b-instruct | 262.1K | 65.5K | Input: $1.5 Output: $7.5 | Model: 0.750 Completion: 5.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04 |
| Qwen-MT Plus | qwen-mt-plus | 16.4K | 8.2K | Input: $2.46 Output: $7.37 | Model: 1.230 Completion: 2.996 | 🌡️ | 2024-04 | In: text Out: text | Released: 2025-01 |
| Qwen3 Max | qwen3-max | 262.1K | 65.5K | Input: $1.2 Output: $6 | Model: 0.600 Completion: 5.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Released: 2025-09-23 |
| Qwen3 Coder Plus | qwen3-coder-plus | 1M | 65.5K | Input: $1 Output: $5 | Model: 0.500 Completion: 5.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-23 |
| Qwen3-Next 80B-A3B (Thinking) | qwen3-next-80b-a3b-thinking | 131.1K | 32.8K | Input: $0.5 Output: $6 | Model: 0.250 Completion: 12.000 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-09 |
| Qwen3 32B | qwen3-32b | 131.1K | 16.4K | Input: $0.7 Output: $2.8 Reasoning: $8.4 | Model: 0.350 Completion: 4.000 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04 |
| Qwen-VL Plus | qwen-vl-plus | 131.1K | 8.2K | Input: $0.21 Output: $0.63 | Model: 0.105 Completion: 3.000 | 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2024-01-25 Updated: 2025-08-15 |
| DeepSeek R1 | deepseek-r1 | 128K | - | Input: $4 Output: $16 | Model: 2.000 Completion: 4.000 | - | - | In: text Out: text | - |
Alibaba (China)¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| DeepSeek R1 Distill Qwen 7B | deepseek-r1-distill-qwen-7b | 32.8K | 16.4K | Input: $0.072 Output: $0.144 | Model: 0.036 Completion: 2.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Released: 2025-01-01 |
| Qwen3-ASR Flash | qwen3-asr-flash | 53.2K | 4.1K | Input: $0.032 Output: $0.032 | Model: 0.016 Completion: 1.000 | - | 2024-04 | In: audio Out: text | Released: 2025-09-08 |
| DeepSeek R1 0528 | deepseek-r1-0528 | 131.1K | 16.4K | Input: $0.574 Output: $2.294 | Model: 0.287 Completion: 3.997 | 🧠 🔧 🌡️ | - | In: text Out: text | Released: 2025-05-28 |
| DeepSeek V3 | deepseek-v3 | 65.5K | 8.2K | Input: $0.287 Output: $1.147 | Model: 0.143 Completion: 3.997 | 🔧 🌡️ | - | In: text Out: text | Released: 2024-12-01 |
| Qwen-Omni Turbo | qwen-omni-turbo | 32.8K | 2K | Input: $0.058 Output: $0.23 Input Audio: $3.584 Output Audio: $7.168 | Model: 1.792 Completion: 2.000 | 🔧 🌡️ | 2024-04 | In: text, image, audio, video Out: text, audio | Released: 2025-01-19 Updated: 2025-03-26 |
| Qwen-VL Max | qwen-vl-max | 131.1K | 8.2K | Input: $0.23 Output: $0.574 | Model: 0.115 Completion: 2.496 | 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2024-04-08 Updated: 2025-08-13 |
| DeepSeek V3.2 Exp | deepseek-v3-2-exp | 131.1K | 65.5K | Input: $0.287 Output: $0.431 | Model: 0.143 Completion: 1.502 | 🔧 🌡️ | - | In: text Out: text | Released: 2025-01-01 |
| Qwen3-Next 80B-A3B Instruct | qwen3-next-80b-a3b-instruct | 131.1K | 32.8K | Input: $0.144 Output: $0.574 | Model: 0.072 Completion: 3.986 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-09 |
| DeepSeek R1 | deepseek-r1 | 131.1K | 16.4K | Input: $0.574 Output: $2.294 | Model: 0.287 Completion: 3.997 | 🧠 🔧 🌡️ | - | In: text Out: text | Released: 2025-01-01 |
| Qwen Turbo | qwen-turbo | 1M | 16.4K | Input: $0.044 Output: $0.087 Reasoning: $0.431 | Model: 0.022 Completion: 1.977 | 🧠 🔧 🌡️ | 2024-04 | In: text Out: text | Released: 2024-11-01 Updated: 2025-07-15 |
| Qwen3-VL 235B-A22B | qwen3-vl-235b-a22b | 131.1K | 32.8K | Input: $0.286705 Output: $1.14682 Reasoning: $2.867051 | Model: 0.143 Completion: 4.000 | 🧠 🔧 🌡️ | 2025-04 | In: text, image Out: text | Open Weights Released: 2025-04 |
| Qwen3 Coder Flash | qwen3-coder-flash | 1M | 65.5K | Input: $0.144 Output: $0.574 | Model: 0.072 Completion: 3.986 | 🔧 🌡️ | 2025-04 | In: text Out: text | Released: 2025-07-28 |
| Qwen3-VL 30B-A3B | qwen3-vl-30b-a3b | 131.1K | 32.8K | Input: $0.108 Output: $0.431 Reasoning: $1.076 | Model: 0.054 Completion: 3.991 | 🧠 🔧 🌡️ | 2025-04 | In: text, image Out: text | Open Weights Released: 2025-04 |
| Qwen3 14B | qwen3-14b | 131.1K | 8.2K | Input: $0.144 Output: $0.574 Reasoning: $1.434 | Model: 0.072 Completion: 3.986 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04 |
| QVQ Max | qvq-max | 131.1K | 8.2K | Input: $1.147 Output: $4.588 | Model: 0.574 Completion: 4.000 | 🧠 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-03-25 |
| DeepSeek R1 Distill Qwen 32B | deepseek-r1-distill-qwen-32b | 32.8K | 16.4K | Input: $0.287 Output: $0.861 | Model: 0.143 Completion: 3.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Released: 2025-01-01 |
| Qwen Plus Character | qwen-plus-character | 32.8K | 4.1K | Input: $0.115 Output: $0.287 | Model: 0.058 Completion: 2.496 | 🔧 🌡️ | 2024-04 | In: text Out: text | Released: 2024-01 |
| Qwen2.5 14B Instruct | qwen2-5-14b-instruct | 131.1K | 8.2K | Input: $0.144 Output: $0.431 | Model: 0.072 Completion: 2.993 | 🔧 🌡️ | 2024-04 | In: text Out: text | Open Weights Released: 2024-09 |
| QwQ Plus | qwq-plus | 131.1K | 8.2K | Input: $0.23 Output: $0.574 | Model: 0.115 Completion: 2.496 | 🧠 🔧 🌡️ | 2024-04 | In: text Out: text | Released: 2025-03-05 |
| Qwen2.5-Coder 32B Instruct | qwen2-5-coder-32b-instruct | 131.1K | 8.2K | Input: $0.287 Output: $0.861 | Model: 0.143 Completion: 3.000 | 🔧 🌡️ | 2024-04 | In: text Out: text | Open Weights Released: 2024-11 |
| Qwen3-Coder 30B-A3B Instruct | qwen3-coder-30b-a3b-instruct | 262.1K | 65.5K | Input: $0.216 Output: $0.861 | Model: 0.108 Completion: 3.986 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04 |
| Qwen Math Plus | qwen-math-plus | 4.1K | 3.1K | Input: $0.574 Output: $1.721 | Model: 0.287 Completion: 2.998 | 🔧 🌡️ | 2024-04 | In: text Out: text | Released: 2024-08-16 Updated: 2024-09-19 |
| Qwen-VL OCR | qwen-vl-ocr | 34.1K | 4.1K | Input: $0.717 Output: $0.717 | Model: 0.358 Completion: 1.000 | 🌡️ | 2024-04 | In: text, image Out: text | Released: 2024-10-28 Updated: 2025-04-13 |
| Qwen Doc Turbo | qwen-doc-turbo | 131.1K | 8.2K | Input: $0.087 Output: $0.144 | Model: 0.043 Completion: 1.655 | 🔧 🌡️ | 2024-04 | In: text Out: text | Released: 2024-01 |
| Qwen Deep Research | qwen-deep-research | 1M | 32.8K | Input: $7.742 Output: $23.367 | Model: 3.871 Completion: 3.018 | 🔧 🌡️ | 2024-04 | In: text Out: text | Released: 2024-01 |
| Qwen2.5 72B Instruct | qwen2-5-72b-instruct | 131.1K | 8.2K | Input: $0.574 Output: $1.721 | Model: 0.287 Completion: 2.998 | 🔧 🌡️ | 2024-04 | In: text Out: text | Open Weights Released: 2024-09 |
| Qwen3-Omni Flash | qwen3-omni-flash | 65.5K | 16.4K | Input: $0.058 Output: $0.23 Input Audio: $3.584 Output Audio: $7.168 | Model: 1.792 Completion: 2.000 | 🧠 🔧 🌡️ | 2024-04 | In: text, image, audio, video Out: text, audio | Released: 2025-09-15 |
| Qwen Flash | qwen-flash | 1M | 32.8K | Input: $0.022 Output: $0.216 | Model: 0.011 Completion: 9.818 | 🧠 🔧 🌡️ | 2024-04 | In: text Out: text | Released: 2025-07-28 |
| Qwen3 8B | qwen3-8b | 131.1K | 8.2K | Input: $0.072 Output: $0.287 Reasoning: $0.717 | Model: 0.036 Completion: 3.986 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04 |
| Qwen3-Omni Flash Realtime | qwen3-omni-flash-realtime | 65.5K | 16.4K | Input: $0.23 Output: $0.918 Input Audio: $3.584 Output Audio: $7.168 | Model: 1.792 Completion: 2.000 | 🔧 🌡️ | 2024-04 | In: text, image, audio Out: text, audio | Released: 2025-09-15 |
| Qwen2.5-VL 72B Instruct | qwen2-5-vl-72b-instruct | 131.1K | 8.2K | Input: $2.294 Output: $6.881 | Model: 1.147 Completion: 3.000 | 🔧 🌡️ | 2024-04 | In: text, image Out: text | Open Weights Released: 2024-09 |
| Qwen3-VL Plus | qwen3-vl-plus | 262.1K | 32.8K | Input: $0.143353 Output: $1.433525 Reasoning: $4.300576 | Model: 0.072 Completion: 10.000 | 🧠 🔧 🌡️ | 2025-04 | In: text, image Out: text | Released: 2025-09-23 |
| Qwen Plus | qwen-plus | 1M | 32.8K | Input: $0.115 Output: $0.287 Reasoning: $1.147 | Model: 0.058 Completion: 2.496 | 🧠 🔧 🌡️ | 2024-04 | In: text Out: text | Released: 2024-01-25 Updated: 2025-09-11 |
| Qwen2.5 32B Instruct | qwen2-5-32b-instruct | 131.1K | 8.2K | Input: $0.287 Output: $0.861 | Model: 0.143 Completion: 3.000 | 🔧 🌡️ | 2024-04 | In: text Out: text | Open Weights Released: 2024-09 |
| Qwen2.5-Omni 7B | qwen2-5-omni-7b | 32.8K | 2K | Input: $0.087 Output: $0.345 Input Audio: $5.448 | Model: 2.724 Completion: 0.063 | 🔧 🌡️ | 2024-04 | In: text, image, audio, video Out: text, audio | Open Weights Released: 2024-12 |
| Qwen Max | qwen-max | 131.1K | 8.2K | Input: $0.345 Output: $1.377 | Model: 0.172 Completion: 3.991 | 🔧 🌡️ | 2024-04 | In: text Out: text | Released: 2024-04-03 Updated: 2025-01-25 |
| Qwen Long | qwen-long | 10M | 8.2K | Input: $0.072 Output: $0.287 | Model: 0.036 Completion: 3.986 | 🔧 🌡️ | 2024-04 | In: text Out: text | Released: 2025-01-25 |
| Qwen2.5-Math 72B Instruct | qwen2-5-math-72b-instruct | 4.1K | 3.1K | Input: $0.574 Output: $1.721 | Model: 0.287 Completion: 2.998 | 🔧 🌡️ | 2024-04 | In: text Out: text | Open Weights Released: 2024-09 |
| Moonshot Kimi K2 Instruct | moonshot-kimi-k2-instruct | 131.1K | 131.1K | Input: $0.574 Output: $2.294 | Model: 0.287 Completion: 3.997 | 🔧 🌡️ | - | In: text Out: text | Released: 2025-01-01 |
| Tongyi Intent Detect V3 | tongyi-intent-detect-v3 | 8.2K | 1K | Input: $0.058 Output: $0.144 | Model: 0.029 Completion: 2.483 | 🌡️ | 2024-04 | In: text Out: text | Released: 2024-01 |
| Qwen2.5 7B Instruct | qwen2-5-7b-instruct | 131.1K | 8.2K | Input: $0.072 Output: $0.144 | Model: 0.036 Completion: 2.000 | 🔧 🌡️ | 2024-04 | In: text Out: text | Open Weights Released: 2024-09 |
| Qwen2.5-VL 7B Instruct | qwen2-5-vl-7b-instruct | 131.1K | 8.2K | Input: $0.287 Output: $0.717 | Model: 0.143 Completion: 2.498 | 🔧 🌡️ | 2024-04 | In: text, image Out: text | Open Weights Released: 2024-09 |
| DeepSeek V3.1 | deepseek-v3-1 | 131.1K | 65.5K | Input: $0.574 Output: $1.721 | Model: 0.287 Completion: 2.998 | 🔧 🌡️ | - | In: text Out: text | Released: 2025-01-01 |
| DeepSeek R1 Distill Llama 70B | deepseek-r1-distill-llama-70b | 32.8K | 16.4K | Input: $0.287 Output: $0.861 | Model: 0.143 Completion: 3.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Released: 2025-01-01 |
| Qwen3 235B-A22B | qwen3-235b-a22b | 131.1K | 16.4K | Input: $0.287 Output: $1.147 Reasoning: $2.868 | Model: 0.143 Completion: 3.997 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04 |
| Qwen2.5-Coder 7B Instruct | qwen2-5-coder-7b-instruct | 131.1K | 8.2K | Input: $0.144 Output: $0.287 | Model: 0.072 Completion: 1.993 | 🔧 🌡️ | 2024-04 | In: text Out: text | Open Weights Released: 2024-11 |
| DeepSeek R1 Distill Qwen 14B | deepseek-r1-distill-qwen-14b | 32.8K | 16.4K | Input: $0.144 Output: $0.431 | Model: 0.072 Completion: 2.993 | 🧠 🔧 🌡️ | - | In: text Out: text | Released: 2025-01-01 |
| Qwen-Omni Turbo Realtime | qwen-omni-turbo-realtime | 32.8K | 2K | Input: $0.23 Output: $0.918 Input Audio: $3.584 Output Audio: $7.168 | Model: 1.792 Completion: 2.000 | 🔧 🌡️ | 2024-04 | In: text, image, audio Out: text, audio | Released: 2025-05-08 |
| Qwen Math Turbo | qwen-math-turbo | 4.1K | 3.1K | Input: $0.287 Output: $0.861 | Model: 0.143 Completion: 3.000 | 🔧 🌡️ | 2024-04 | In: text Out: text | Released: 2024-09-19 |
| Qwen-MT Turbo | qwen-mt-turbo | 16.4K | 8.2K | Input: $0.101 Output: $0.28 | Model: 0.051 Completion: 2.772 | 🌡️ | 2024-04 | In: text Out: text | Released: 2025-01 |
| DeepSeek R1 Distill Llama 8B | deepseek-r1-distill-llama-8b | 32.8K | 16.4K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | - | In: text Out: text | Released: 2025-01-01 |
| Qwen3-Coder 480B-A35B Instruct | qwen3-coder-480b-a35b-instruct | 262.1K | 65.5K | Input: $0.861 Output: $3.441 | Model: 0.430 Completion: 3.997 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04 |
| Qwen-MT Plus | qwen-mt-plus | 16.4K | 8.2K | Input: $0.259 Output: $0.775 | Model: 0.130 Completion: 2.992 | 🌡️ | 2024-04 | In: text Out: text | Released: 2025-01 |
| Qwen3 Max | qwen3-max | 262.1K | 65.5K | Input: $0.861 Output: $3.441 | Model: 0.430 Completion: 3.997 | 🔧 🌡️ | 2025-04 | In: text Out: text | Released: 2025-09-23 |
| QwQ 32B | qwq-32b | 131.1K | 8.2K | Input: $0.287 Output: $0.861 | Model: 0.143 Completion: 3.000 | 🧠 🔧 🌡️ | 2024-04 | In: text Out: text | Open Weights Released: 2024-12 |
| Qwen2.5-Math 7B Instruct | qwen2-5-math-7b-instruct | 4.1K | 3.1K | Input: $0.144 Output: $0.287 | Model: 0.072 Completion: 1.993 | 🔧 🌡️ | 2024-04 | In: text Out: text | Open Weights Released: 2024-09 |
| Qwen3-Next 80B-A3B (Thinking) | qwen3-next-80b-a3b-thinking | 131.1K | 32.8K | Input: $0.144 Output: $1.434 | Model: 0.072 Completion: 9.958 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-09 |
| DeepSeek R1 Distill Qwen 1.5B | deepseek-r1-distill-qwen-1-5b | 32.8K | 16.4K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | - | In: text Out: text | Released: 2025-01-01 |
| Qwen3 32B | qwen3-32b | 131.1K | 16.4K | Input: $0.287 Output: $1.147 Reasoning: $2.868 | Model: 0.143 Completion: 3.997 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04 |
| Qwen-VL Plus | qwen-vl-plus | 131.1K | 8.2K | Input: $0.115 Output: $0.287 | Model: 0.058 Completion: 2.496 | 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2024-01-25 Updated: 2025-08-15 |
| Qwen3 Coder Plus | qwen3-coder-plus | 1M | 65.5K | Input: $1 Output: $5 | Model: 0.500 Completion: 5.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-23 |
aliyun-bailian¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| animate-anyone-gen2 | animate-anyone-gen2 | - | - | Per Second Standard: ¥0.08 | Model: 0.080 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| animate-anyone-template-gen2 | animate-anyone-template-gen2 | - | - | Per Second Standard: ¥0.08 | Model: 0.080 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| cosyvoice-v1 | cosyvoice-v1 | - | - | ¥2/10K chars | Model: 2.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| cosyvoice-v2 | cosyvoice-v2 | - | - | ¥2/10K chars | Model: 2.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| cosyvoice-v3-plus | cosyvoice-v3-plus | - | - | ¥2/10K chars | Model: 2.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| cosyvoice-v3 | cosyvoice-v3 | - | - | ¥0.4/10K chars | Model: 0.400 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| fun-asr-2025-08-25 | fun-asr-2025-08-25 | - | - | ¥0.00022/s | Model: 0.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| fun-asr-mtl-2025-08-25 | fun-asr-mtl-2025-08-25 | - | - | ¥0.00022/s | Model: 0.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| fun-asr-mtl | fun-asr-mtl | - | - | ¥0.00022/s | Model: 0.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| fun-asr-realtime-2025-09-15 | fun-asr-realtime-2025-09-15 | - | - | ¥0.00033/s | Model: 0.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| fun-asr-realtime | fun-asr-realtime | - | - | ¥0.00033/s | Model: 0.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| fun-asr | fun-asr | - | - | ¥0.00022/s | Model: 0.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| gte-rerank-v2 | gte-rerank-v2 | - | - | Input: ¥0.8 Output: - | Model: 0.400 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| image-out-painting | image-out-painting | - | - | ¥0.18/img | Model: 0.180 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| multimodal-embedding-v1 | multimodal-embedding-v1 | - | - | Text: ¥0.7/1K Image: ¥0.9/1K | - | - | - | In: text Out: text | - |
| paraformer-8k-v2 | paraformer-8k-v2 | - | - | ¥0.00008/s | Model: 0.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| paraformer-realtime-8k-v2 | paraformer-realtime-8k-v2 | - | - | ¥0.00024/s | Model: 0.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| paraformer-realtime-v2 | paraformer-realtime-v2 | - | - | ¥0.00024/s | Model: 0.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| paraformer-v2 | paraformer-v2 | - | - | ¥0.00008/s | Model: 0.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qvq-max-2025-03-25 | qvq-max-2025-03-25 | - | - | Input: ¥8 Output: ¥32 | Model: 4.000 Completion: 4.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen-coder-turbo-2024-09-19 | qwen-coder-turbo-2024-09-19 | - | - | Input: ¥2 Output: ¥6 | Model: 1.000 Completion: 3.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen-coder-turbo-latest | qwen-coder-turbo-latest | - | - | Input: ¥2 Output: ¥6 | Model: 1.000 Completion: 3.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen-flash-2025-07-28 | qwen-flash-2025-07-28 | - | - | Input: ¥0.15 Output: ¥1.5 Input 128k 256k: ¥0.6 Input 256k 1m: ¥1.2 Output 128k 256k: ¥6 Output 256k 1m: ¥12 | Model: 0.600 Completion: 10.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen-flash | qwen-flash | - | - | Input: ¥0.15 Output: ¥1.5 Cache Read: ¥0.015 Cache Read 128k 256k: ¥0.06 Cache Read 256k 1m: ¥0.12 Cache Write: ¥0.188 Cache Write 128k 256k: ¥0.75 Cache Write 256k 1m: ¥1.5 Input 128k 256k: ¥0.6 Input 256k 1m: ¥1.2 Output 128k 256k: ¥6 Output 256k 1m: ¥12 | Model: 0.600 Completion: 10.000 Cache: 0.100 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen-image-edit | qwen-image-edit | - | - | ¥0.3/img | Model: 0.300 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen-image-plus | qwen-image-plus | - | - | ¥0.2/img | Model: 0.200 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen-image | qwen-image | - | - | ¥0.25/img | Model: 0.250 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen-long-latest | qwen-long-latest | - | - | Input: ¥0.5 Output: ¥2 | Model: 0.250 Completion: 4.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen-long | qwen-long | - | - | Input: ¥0.5 Output: ¥2 | Model: 0.250 Completion: 4.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen-max-latest | qwen-max-latest | - | - | Input: ¥2.4 Output: ¥9.6 | Model: 1.200 Completion: 4.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen-max | qwen-max | - | - | Input: ¥2.4 Output: ¥9.6 Cache Read: ¥0.48 | Model: 1.200 Completion: 4.000 Cache: 0.200 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen-mt-image | qwen-mt-image | - | - | ¥0.003/img | Model: 0.003 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen-mt-plus | qwen-mt-plus | - | - | Input: ¥1.8 Output: ¥5.4 | Model: 0.900 Completion: 3.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen-mt-turbo | qwen-mt-turbo | - | - | Input: ¥0.7 Output: ¥1.95 | Model: 0.350 Completion: 2.786 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen-omni-turbo-latest | qwen-omni-turbo-latest | - | - | Text Input: ¥0.4 Vision Input: ¥1.5 Audio Input: ¥25 Output: ¥50 Multi Output: ¥50 Multiin Text Output: ¥4.5 Purein Text Output: ¥1.6 | - | - | - | In: text Out: text | - |
| qwen-omni-turbo-realtime-latest | qwen-omni-turbo-realtime-latest | - | - | Text Input: ¥1.6 Vision Input: ¥6 Audio Input: ¥25 Output: ¥50 Multi Output: ¥50 Multiin Text Output: ¥18 Purein Text Output: ¥6.4 | - | - | - | In: text Out: text | - |
| qwen-omni-turbo-realtime | qwen-omni-turbo-realtime | - | - | Text Input: ¥1.6 Vision Input: ¥6 Audio Input: ¥25 Output: ¥50 Multi Output: ¥50 Multiin Text Output: ¥18 Purein Text Output: ¥6.4 | - | - | - | In: text Out: text | - |
| qwen-omni-turbo | qwen-omni-turbo | - | - | Text Input: ¥0.4 Vision Input: ¥1.5 Audio Input: ¥25 Output: ¥50 Audio Input Cache: ¥5 Multi Output: ¥50 Multiin Text Output: ¥4.5 Purein Text Output: ¥1.6 Text Input Cache: ¥0.08 Vision Input Cache: ¥0.3 | - | - | - | In: text Out: text | - |
| qwen-plus-2024-09-19 | qwen-plus-2024-09-19 | - | - | Input: ¥0.8 Output: ¥2 | Model: 0.400 Completion: 2.500 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen-plus-latest | qwen-plus-latest | - | - | Input: ¥0.8 Output: ¥2 Input 128k 256k: ¥2.4 Input 256k 1m: ¥4.8 Output 128k 256k: ¥20 Output 256k 1m: ¥48 Thinking Input: ¥0.8 Thinking Input 128k 256k: ¥2.4 Thinking Input 256k 1m: ¥4.8 Thinking Output: ¥8 Thinking Output 128k 256k: ¥24 Thinking Output 256k 1m: ¥64 | Model: 2.400 Completion: 13.333 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen-plus | qwen-plus | - | - | Input: ¥0.8 Output: ¥2 Cache Read: ¥0.08 Cache Read 128k 256k: ¥0.24 Cache Read 256k 1m: ¥0.48 Cache Write: ¥1 Cache Write 128k 256k: ¥3 Cache Write 256k 1m: ¥6 Input 128k 256k: ¥2.4 Input 256k 1m: ¥4.8 Output 128k 256k: ¥20 Output 256k 1m: ¥48 Thinking Cache Read: ¥0.08 Thinking Cache Read 128k 256k: ¥0.24 Thinking Cache Read 256k 1m: ¥0.48 Thinking Cache Write: ¥1 Thinking Cache Write 128k 256k: ¥3 Thinking Cache Write 256k 1m: ¥6 Thinking Input: ¥0.8 Thinking Input 128k 256k: ¥2.4 Thinking Input 256k 1m: ¥4.8 Thinking Output: ¥8 Thinking Output 128k 256k: ¥24 Thinking Output 256k 1m: ¥64 | Model: 2.400 Completion: 13.333 Cache: 0.100 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen-turbo-latest | qwen-turbo-latest | - | - | Input: ¥0.3 Output: ¥0.6 Thinking Input: ¥0.3 Thinking Output: ¥3 | Model: 0.150 Completion: 10.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen-turbo | qwen-turbo | - | - | Input: ¥0.3 Output: ¥0.6 Cache Read: ¥0.06 Thinking Cache Read: ¥0.06 Thinking Input: ¥0.3 Thinking Output: ¥3 | Model: 0.150 Completion: 10.000 Cache: 0.200 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen-vl-max-latest | qwen-vl-max-latest | - | - | Input: ¥1.6 Output: ¥4 | Model: 0.800 Completion: 2.500 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen-vl-max | qwen-vl-max | - | - | Input: ¥1.6 Output: ¥4 Cache Read: ¥0.32 | Model: 0.800 Completion: 2.500 Cache: 0.200 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen-vl-ocr-latest | qwen-vl-ocr-latest | - | - | VL: ¥5/1K | - | - | - | In: text Out: text | - |
| qwen-vl-ocr | qwen-vl-ocr | - | - | VL: ¥5/1K | - | - | - | In: text Out: text | - |
| qwen-vl-plus-latest | qwen-vl-plus-latest | - | - | Input: ¥0.8 Output: ¥2 | Model: 0.400 Completion: 2.500 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen-vl-plus | qwen-vl-plus | - | - | Input: ¥0.8 Output: ¥2 Cache Read: ¥0.16 | Model: 0.400 Completion: 2.500 Cache: 0.200 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen2.5-14b-instruct-1m | qwen2.5-14b-instruct-1m | - | - | Input: ¥1 Output: ¥3 | Model: 0.500 Completion: 3.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen2.5-14b-instruct | qwen2.5-14b-instruct | - | - | Input: ¥1 Output: ¥3 | Model: 0.500 Completion: 3.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen2.5-32b-instruct | qwen2.5-32b-instruct | - | - | Input: ¥2 Output: ¥6 | Model: 1.000 Completion: 3.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen2.5-3b-instruct | qwen2.5-3b-instruct | - | - | Input: ¥0.3 Output: ¥0.9 | Model: 0.150 Completion: 3.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen2.5-72b-instruct | qwen2.5-72b-instruct | - | - | Input: ¥4 Output: ¥12 | Model: 2.000 Completion: 3.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen2.5-7b-instruct-1m | qwen2.5-7b-instruct-1m | - | - | Input: ¥0.5 Output: ¥1 | Model: 0.250 Completion: 2.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen2.5-7b-instruct | qwen2.5-7b-instruct | - | - | Input: ¥0.5 Output: ¥1 | Model: 0.250 Completion: 2.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen2.5-coder-14b-instruct | qwen2.5-coder-14b-instruct | - | - | Input: ¥2 Output: ¥6 | Model: 1.000 Completion: 3.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen2.5-coder-32b-instruct | qwen2.5-coder-32b-instruct | - | - | Input: ¥2 Output: ¥6 | Model: 1.000 Completion: 3.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen2.5-coder-7b-instruct | qwen2.5-coder-7b-instruct | - | - | Input: ¥1 Output: ¥2 | Model: 0.500 Completion: 2.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen2.5-omni-7b | qwen2.5-omni-7b | - | - | Text Input: ¥0.6 Vision Input: ¥2 Audio Input: ¥38 Output: ¥76 Multi Output: ¥76 Multiin Text Output: ¥6 Purein Text Output: ¥2.4 | - | - | - | In: text Out: text | - |
| qwen2.5-vl-32b-instruct | qwen2.5-vl-32b-instruct | - | - | Input: ¥8 Output: ¥24 | Model: 4.000 Completion: 3.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen2.5-vl-3b-instruct | qwen2.5-vl-3b-instruct | - | - | Input: ¥1.2 Output: ¥3.6 | Model: 0.600 Completion: 3.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen2.5-vl-72b-instruct | qwen2.5-vl-72b-instruct | - | - | Input: ¥16 Output: ¥48 | Model: 8.000 Completion: 3.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen2.5-vl-7b-instruct | qwen2.5-vl-7b-instruct | - | - | Input: ¥2 Output: ¥5 | Model: 1.000 Completion: 2.500 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen3-0.6b | qwen3-0.6b | - | - | Input: ¥0.3 Output: ¥1.2 Thinking Input: ¥0.3 Thinking Output: ¥3 | Model: 0.150 Completion: 10.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen3-1.7b | qwen3-1.7b | - | - | Input: ¥0.3 Output: ¥1.2 Thinking Input: ¥0.3 Thinking Output: ¥3 | Model: 0.150 Completion: 10.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen3-14b | qwen3-14b | - | - | Input: ¥1 Output: ¥4 Thinking Input: ¥1 Thinking Output: ¥10 | Model: 0.500 Completion: 10.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen3-235b-a22b-instruct-2507 | qwen3-235b-a22b-instruct-2507 | - | - | Input: ¥2 Output: ¥8 | Model: 1.000 Completion: 4.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen3-235b-a22b-thinking-2507 | qwen3-235b-a22b-thinking-2507 | - | - | Thinking Input: ¥2 Thinking Output: ¥20 | Model: 1.000 Completion: 10.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen3-235b-a22b | qwen3-235b-a22b | - | - | Input: ¥2 Output: ¥8 Thinking Input: ¥2 Thinking Output: ¥20 | Model: 1.000 Completion: 10.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen3-30b-a3b | qwen3-30b-a3b | - | - | Input: ¥0.75 Output: ¥3 Thinking Input: ¥0.75 Thinking Output: ¥7.5 | Model: 0.375 Completion: 10.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen3-32b | qwen3-32b | - | - | Input: ¥2 Output: ¥8 Thinking Input: ¥2 Thinking Output: ¥20 | Model: 1.000 Completion: 10.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen3-4b | qwen3-4b | - | - | Input: ¥0.3 Output: ¥1.2 Thinking Input: ¥0.3 Thinking Output: ¥3 | Model: 0.150 Completion: 10.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen3-8b | qwen3-8b | - | - | Input: ¥0.5 Output: ¥2 Thinking Input: ¥0.5 Thinking Output: ¥5 | Model: 0.250 Completion: 10.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen3-asr-flash-2025-09-08 | qwen3-asr-flash-2025-09-08 | - | - | ¥0.00022/s | Model: 0.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen3-asr-flash | qwen3-asr-flash | - | - | ¥0.00022/s | Model: 0.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen3-coder-30b-a3b-instruct | qwen3-coder-30b-a3b-instruct | - | - | Input: ¥1.5 Output: ¥6 Input 128k 256k: ¥3.75 Input 256k 1m: ¥7.5 Input 32k 128k: ¥2.25 Output 128k 256k: ¥15 Output 256k 1m: ¥37.5 Output 32k 128k: ¥9 | Model: 3.750 Completion: 5.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen3-coder-480b-a35b-instruct | qwen3-coder-480b-a35b-instruct | - | - | Input: ¥6 Output: ¥24 Input 128k 256k: ¥15 Input 256k 1m: ¥30 Input 32k 128k: ¥9 Output 128k 256k: ¥60 Output 256k 1m: ¥300 Output 32k 128k: ¥36 | Model: 15.000 Completion: 10.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen3-coder-flash | qwen3-coder-flash | - | - | Input: ¥1 Output: ¥4 Cache Read: ¥0.1 Cache Read 128k 256k: ¥0.25 Cache Read 256k 1m: ¥0.5 Cache Read 32k 128k: ¥0.15 Cache Write: ¥1.25 Cache Write 128k 256k: ¥3.125 Cache Write 256k 1m: ¥6.25 Cache Write 32k 128k: ¥1.875 Input 128k 256k: ¥2.5 Input 256k 1m: ¥5 Input 32k 128k: ¥1.5 Output 128k 256k: ¥10 Output 256k 1m: ¥25 Output 32k 128k: ¥6 | Model: 2.500 Completion: 5.000 Cache: 0.100 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen3-coder-plus-2025-07-22 | qwen3-coder-plus-2025-07-22 | - | - | Input: ¥4 Output: ¥16 Input 128k 256k: ¥10 Input 256k 1m: ¥20 Input 32k 128k: ¥6 Output 128k 256k: ¥40 Output 256k 1m: ¥200 Output 32k 128k: ¥24 | Model: 10.000 Completion: 10.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen3-coder-plus-2025-09-23 | qwen3-coder-plus-2025-09-23 | - | - | Input: ¥4 Output: ¥16 Input 128k 256k: ¥10 Input 256k 1m: ¥20 Input 32k 128k: ¥6 Output 128k 256k: ¥40 Output 256k 1m: ¥200 Output 32k 128k: ¥24 | Model: 10.000 Completion: 10.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen3-coder-plus | qwen3-coder-plus | - | - | Input: ¥4 Output: ¥16 Cache Read: ¥0.4 Cache Read 128k 256k: ¥1 Cache Read 256k 1m: ¥2 Cache Read 32k 128k: ¥0.6 Cache Write: ¥5 Cache Write 128k 256k: ¥12.5 Cache Write 256k 1m: ¥25 Cache Write 32k 128k: ¥7.5 Input 128k 256k: ¥10 Input 256k 1m: ¥20 Input 32k 128k: ¥6 Output 128k 256k: ¥40 Output 256k 1m: ¥200 Output 32k 128k: ¥24 | Model: 10.000 Completion: 10.000 Cache: 0.100 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen3-max-2025-09-23 | qwen3-max-2025-09-23 | - | - | Input: ¥6 Output: ¥24 Input 128k 256k: ¥15 Input 32k 128k: ¥10 Output 128k 256k: ¥60 Output 32k 128k: ¥40 | Model: 7.500 Completion: 4.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen3-max-preview | qwen3-max-preview | - | - | Input: ¥6 Output: ¥24 Cache Read: ¥1.2 Cache Read 128k 256k: ¥3 Cache Read 32k 128k: ¥2 Input 128k 256k: ¥15 Input 32k 128k: ¥10 Output 128k 256k: ¥60 Output 32k 128k: ¥40 | Model: 7.500 Completion: 4.000 Cache: 0.200 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen3-max | qwen3-max | - | - | Input: ¥6 Output: ¥24 Cache Read: ¥0.6 Cache Read 128k 256k: ¥1.5 Cache Read 32k 128k: ¥1 Cache Write: ¥7.5 Cache Write 128k 256k: ¥18.75 Cache Write 32k 128k: ¥12.5 Input 128k 256k: ¥15 Input 32k 128k: ¥10 Output 128k 256k: ¥60 Output 32k 128k: ¥40 | Model: 7.500 Completion: 4.000 Cache: 0.100 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen3-omni-30b-a3b-captioner | qwen3-omni-30b-a3b-captioner | - | - | Audio Input: ¥15.8 Multi Output: ¥12.7 Multiin Text Output: ¥12.7 | - | - | - | In: text Out: text | - |
| qwen3-omni-flash-2025-09-15 | qwen3-omni-flash-2025-09-15 | - | - | Text Input: ¥1.8 Vision Input: ¥3.3 Audio Input: ¥15.8 Output: ¥62.6 Multi Output: ¥62.6 Multiin Text Output: ¥12.7 Purein Text Output: ¥6.9 Thinking Audio Input: ¥15.8 Thinking Multiin Text Output: ¥12.7 Thinking Purein Text Output: ¥6.9 Thinking Text Input: ¥1.8 Thinking Vision Input: ¥3.3 | - | - | - | In: text Out: text | - |
| qwen3-omni-flash-realtime-2025-09-15 | qwen3-omni-flash-realtime-2025-09-15 | - | - | Text Input: ¥2.2 Vision Input: ¥3.9 Audio Input: ¥18.9 Output: ¥75.1 Multi Output: ¥75.1 Multiin Text Output: ¥15.2 Purein Text Output: ¥8.3 | - | - | - | In: text Out: text | - |
| qwen3-omni-flash-realtime | qwen3-omni-flash-realtime | - | - | Text Input: ¥2.2 Vision Input: ¥3.9 Audio Input: ¥18.9 Output: ¥75.1 Multi Output: ¥75.1 Multiin Text Output: ¥15.2 Purein Text Output: ¥8.3 | - | - | - | In: text Out: text | - |
| qwen3-omni-flash | qwen3-omni-flash | - | - | Text Input: ¥1.8 Vision Input: ¥3.3 Audio Input: ¥15.8 Output: ¥62.6 Multi Output: ¥62.6 Multiin Text Output: ¥12.7 Purein Text Output: ¥6.9 Thinking Audio Input: ¥15.8 Thinking Multiin Text Output: ¥12.7 Thinking Purein Text Output: ¥6.9 Thinking Text Input: ¥1.8 Thinking Vision Input: ¥3.3 | - | - | - | In: text Out: text | - |
| qwen3-tts-flash-2025-09-18 | qwen3-tts-flash-2025-09-18 | - | - | ¥0.8/10K chars | Model: 0.800 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen3-tts-flash-realtime-2025-09-18 | qwen3-tts-flash-realtime-2025-09-18 | - | - | ¥1/10K chars | Model: 1.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen3-tts-flash-realtime | qwen3-tts-flash-realtime | - | - | ¥1/10K chars | Model: 1.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen3-tts-flash | qwen3-tts-flash | - | - | ¥0.8/10K chars | Model: 0.800 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen3-vl-plus-2025-09-23 | qwen3-vl-plus-2025-09-23 | - | - | Input: ¥1 Output: ¥10 Input 128k 256k: ¥3 Input 32k 128k: ¥1.5 Output 128k 256k: ¥30 Output 32k 128k: ¥15 | Model: 1.500 Completion: 10.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwen3-vl-plus | qwen3-vl-plus | - | - | Input: ¥1 Output: ¥10 Cache Read: ¥0.2 Cache Read 128k 256k: ¥0.6 Cache Read 32k 128k: ¥0.3 Input 128k 256k: ¥3 Input 32k 128k: ¥1.5 Output 128k 256k: ¥30 Output 32k 128k: ¥15 | Model: 1.500 Completion: 10.000 Cache: 0.200 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwq-32b-preview | qwq-32b-preview | - | - | Input: ¥2 Output: ¥6 | Model: 1.000 Completion: 3.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwq-32b | qwq-32b | - | - | Input: ¥2 Output: ¥6 | Model: 1.000 Completion: 3.000 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwq-plus-latest | qwq-plus-latest | - | - | Input: ¥1.6 Output: ¥4 | Model: 0.800 Completion: 2.500 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| qwq-plus | qwq-plus | - | - | Input: ¥1.6 Output: ¥4 | Model: 0.800 Completion: 2.500 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| text-embedding-async-v2 | text-embedding-async-v2 | - | - | Input: ¥0.7 Output: - | Model: 0.350 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| text-embedding-v1 | text-embedding-v1 | - | - | Input: ¥0.7 Output: - | Model: 0.350 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| text-embedding-v2 | text-embedding-v2 | - | - | Input: ¥0.7 Output: - | Model: 0.350 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| text-embedding-v3 | text-embedding-v3 | - | - | Input: ¥0.5 Output: - | Model: 0.250 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| text-embedding-v4 | text-embedding-v4 | - | - | Input: ¥0.5 Output: - | Model: 0.250 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| tongyi-embedding-vision-flash | tongyi-embedding-vision-flash | - | - | Text: ¥0.2/1K Image: ¥0.5/1K | - | - | - | In: text Out: text | - |
| tongyi-embedding-vision-plus | tongyi-embedding-vision-plus | - | - | Text: ¥0.5/1K Image: ¥0.5/1K | - | - | - | In: text Out: text | - |
| tongyi-intent-detect-v3 | tongyi-intent-detect-v3 | - | - | Input: ¥0.4 Output: ¥1 | Model: 0.200 Completion: 2.500 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| wan2.2-animate-mix | wan2.2-animate-mix | - | - | Per Second Pro: ¥0.9 Per Second Standard: ¥0.6 | Model: 0.600 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| wan2.2-animate-move | wan2.2-animate-move | - | - | Per Second Pro: ¥0.6 Per Second Standard: ¥0.4 | Model: 0.400 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| wan2.2-i2v-flash | wan2.2-i2v-flash | - | - | Per Second 1080p: ¥0.48 Per Second 480p: ¥0.1 Per Second 720p: ¥0.2 | Model: 0.100 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| wan2.2-i2v-plus | wan2.2-i2v-plus | - | - | Per Second 1080p: ¥0.7 Per Second 480p: ¥0.14 | Model: 0.140 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| wan2.2-kf2v-flash | wan2.2-kf2v-flash | - | - | Per Second 1080p: ¥0.48 Per Second 480p: ¥0.1 Per Second 720p: ¥0.2 | Model: 0.100 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| wan2.2-s2v | wan2.2-s2v | - | - | Per Second 480p: ¥0.5 Per Second 720p: ¥0.9 | Model: 0.500 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| wan2.2-t2i-flash | wan2.2-t2i-flash | - | - | ¥0.14/img | Model: 0.140 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| wan2.2-t2i-plus | wan2.2-t2i-plus | - | - | ¥0.2/img | Model: 0.200 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| wan2.2-t2v-plus | wan2.2-t2v-plus | - | - | Per Second 1080x1920: ¥0.7 Per Second 1248x1632: ¥0.7 Per Second 1440x1440: ¥0.7 Per Second 1632x1248: ¥0.7 Per Second 1920x1080: ¥0.7 Per Second 480x832: ¥0.14 Per Second 624x624: ¥0.14 Per Second 832x480: ¥0.14 | Model: 0.140 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| wan2.5-i2i-preview | wan2.5-i2i-preview | - | - | ¥0.2/img | Model: 0.200 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| wan2.5-i2v-preview | wan2.5-i2v-preview | - | - | Per Second 1080p: ¥1 Per Second 480p: ¥0.3 Per Second 720p: ¥0.6 | Model: 0.300 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| wan2.5-t2i-preview | wan2.5-t2i-preview | - | - | ¥0.2/img | Model: 0.200 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| wan2.5-t2v-preview | wan2.5-t2v-preview | - | - | Per Second 1080x1920: ¥1 Per Second 1088x832: ¥0.6 Per Second 1248x1632: ¥1 Per Second 1280x720: ¥0.6 Per Second 1440x1440: ¥1 Per Second 1632x1248: ¥1 Per Second 1920x1080: ¥1 Per Second 480x832: ¥0.3 Per Second 624x624: ¥0.3 Per Second 720x1280: ¥0.6 Per Second 832x1088: ¥0.6 Per Second 832x480: ¥0.3 Per Second 960x960: ¥0.6 | Model: 0.300 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| wanx-background-generation-v2 | wanx-background-generation-v2 | - | - | ¥0.08/img | Model: 0.080 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| wanx-sketch-to-image-lite | wanx-sketch-to-image-lite | - | - | ¥0.06/img | Model: 0.060 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| wanx-style-repaint-v1 | wanx-style-repaint-v1 | - | - | ¥0.12/img | Model: 0.120 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| wanx-v1 | wanx-v1 | - | - | ¥0.16/img | Model: 0.160 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| wanx2.0-t2i-turbo | wanx2.0-t2i-turbo | - | - | ¥0.04/img | Model: 0.040 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| wanx2.1-i2v-plus | wanx2.1-i2v-plus | - | - | Per Second Standard: ¥0.7 | Model: 0.700 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| wanx2.1-i2v-turbo | wanx2.1-i2v-turbo | - | - | Per Second Standard: ¥0.24 | Model: 0.240 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| wanx2.1-imageedit | wanx2.1-imageedit | - | - | ¥0.14/img | Model: 0.140 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| wanx2.1-kf2v-plus | wanx2.1-kf2v-plus | - | - | Per Second Standard: ¥0.7 | Model: 0.700 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| wanx2.1-t2i-plus | wanx2.1-t2i-plus | - | - | ¥0.2/img | Model: 0.200 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| wanx2.1-t2i-turbo | wanx2.1-t2i-turbo | - | - | ¥0.14/img | Model: 0.140 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| wanx2.1-t2v-plus | wanx2.1-t2v-plus | - | - | Per Second 1088x832: ¥0.7 Per Second 1280x720: ¥0.7 Per Second 720x1280: ¥0.7 Per Second 832x1088: ¥0.7 Per Second 960x960: ¥0.7 | Model: 0.700 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| wanx2.1-t2v-turbo | wanx2.1-t2v-turbo | - | - | Per Second 1088x832: ¥0.24 Per Second 1280x720: ¥0.24 Per Second 480x832: ¥0.24 Per Second 624x624: ¥0.24 Per Second 720x1280: ¥0.24 Per Second 832x1088: ¥0.24 Per Second 832x480: ¥0.24 Per Second 960x960: ¥0.24 | Model: 0.240 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
| wanx2.1-vace-plus | wanx2.1-vace-plus | - | - | Per Second Standard: ¥0.7 | Model: 0.700 (CNY pricing, multiply by USD/CNY rate for NewAPI) | - | - | In: text Out: text | - |
Amazon Bedrock¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| Command R+ | cohere.command-r-plus-v1:0 | 128K | 4.1K | Input: $3 Output: $15 | Model: 1.500 Completion: 5.000 | 🔧 🌡️ | 2024-04 | In: text Out: text | Open Weights Released: 2024-04-04 |
| Claude 2 | anthropic.claude-v2 | 100K | 4.1K | Input: $8 Output: $24 | Model: 4.000 Completion: 3.000 | 🌡️ | 2023-08 | In: text Out: text | Released: 2023-07-11 |
| Claude Sonnet 3.7 | anthropic.claude-3-7-sonnet-20250219-v1:0 | 200K | 8.2K | Input: $3 Output: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-02-19 |
| Claude Sonnet 4 | anthropic.claude-sonnet-4-20250514-v1:0 | 200K | 64K | Input: $3 Output: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-05-22 |
| Qwen3 Coder 30B A3B Instruct | qwen.qwen3-coder-30b-a3b-v1:0 | 262.1K | 131.1K | Input: $0.15 Output: $0.6 | Model: 0.075 Completion: 4.000 | 🔧 🌡️ | 2024-04 | In: text Out: text | Released: 2025-09-18 |
| Llama 3.2 11B Instruct | meta.llama3-2-11b-instruct-v1:0 | 128K | 4.1K | Input: $0.16 Output: $0.16 | Model: 0.080 Completion: 1.000 | 📎 🔧 🌡️ | 2023-12 | In: text, image Out: text | Open Weights Released: 2024-09-25 |
| Claude Haiku 3 | anthropic.claude-3-haiku-20240307-v1:0 | 200K | 4.1K | Input: $0.25 Output: $1.25 | Model: 0.125 Completion: 5.000 | 📎 🔧 🌡️ | 2024-02 | In: text, image Out: text | Released: 2024-03-13 |
| Llama 3.2 90B Instruct | meta.llama3-2-90b-instruct-v1:0 | 128K | 4.1K | Input: $0.72 Output: $0.72 | Model: 0.360 Completion: 1.000 | 📎 🔧 🌡️ | 2023-12 | In: text, image Out: text | Open Weights Released: 2024-09-25 |
| Llama 3.2 1B Instruct | meta.llama3-2-1b-instruct-v1:0 | 131K | 4.1K | Input: $0.1 Output: $0.1 | Model: 0.050 Completion: 1.000 | 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-09-25 |
| Claude 2.1 | anthropic.claude-v2:1 | 200K | 4.1K | Input: $8 Output: $24 | Model: 4.000 Completion: 3.000 | 🌡️ | 2023-08 | In: text Out: text | Released: 2023-11-21 |
| DeepSeek-V3.1 | deepseek.v3-v1:0 | 163.8K | 81.9K | Input: $0.58 Output: $1.68 | Model: 0.290 Completion: 2.897 | 🧠 🔧 🌡️ | 2024-07 | In: text Out: text | Open Weights Released: 2025-09-18 |
| Command Light | cohere.command-light-text-v14 | 4.1K | 4.1K | Input: $0.3 Output: $0.6 | Model: 0.150 Completion: 2.000 | 🌡️ | 2023-08 | In: text Out: text | Open Weights Released: 2023-11-01 |
| Jamba 1.5 Large | ai21.jamba-1-5-large-v1:0 | 256K | 4.1K | Input: $2 Output: $8 | Model: 1.000 Completion: 4.000 | 🔧 🌡️ | 2024-08 | In: text Out: text | Open Weights Released: 2024-08-15 |
| Llama 3.3 70B Instruct | meta.llama3-3-70b-instruct-v1:0 | 128K | 4.1K | Input: $0.72 Output: $0.72 | Model: 0.360 Completion: 1.000 | 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-12-06 |
| Claude Opus 3 | anthropic.claude-3-opus-20240229-v1:0 | 200K | 4.1K | Input: $15 Output: $75 | Model: 7.500 Completion: 5.000 | 📎 🔧 🌡️ | 2023-08 | In: text, image Out: text | Released: 2024-02-29 |
| Nova Pro | amazon.nova-pro-v1:0 | 300K | 8.2K | Input: $0.8 Output: $3.2 Cache Read: $0.2 | Model: 0.400 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-10 | In: text, image, video Out: text | Released: 2024-12-03 |
| Llama 3.1 8B Instruct | meta.llama3-1-8b-instruct-v1:0 | 128K | 4.1K | Input: $0.22 Output: $0.22 | Model: 0.110 Completion: 1.000 | 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-07-23 |
| Qwen3 32B (dense) | qwen.qwen3-32b-v1:0 | 16.4K | 16.4K | Input: $0.15 Output: $0.6 | Model: 0.075 Completion: 4.000 | 🧠 🔧 🌡️ | 2024-04 | In: text Out: text | Open Weights Released: 2025-09-18 |
| Claude Sonnet 3.5 | anthropic.claude-3-5-sonnet-20240620-v1:0 | 200K | 8.2K | Input: $3 Output: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2024-06-20 |
| Claude Haiku 4.5 | anthropic.claude-haiku-4-5-20251001-v1:0 | 200K | 64K | Input: $1 Output: $5 Cache Read: $0.1 Cache Write: $1.25 | Model: 0.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-02-28 | In: text, image Out: text | Released: 2025-10-15 |
| Command R | cohere.command-r-v1:0 | 128K | 4.1K | Input: $0.5 Output: $1.5 | Model: 0.250 Completion: 3.000 | 🔧 🌡️ | 2024-04 | In: text Out: text | Open Weights Released: 2024-03-11 |
| Nova Micro | amazon.nova-micro-v1:0 | 128K | 8.2K | Input: $0.035 Output: $0.14 Cache Read: $0.00875 | Model: 0.018 Completion: 4.000 Cache: 0.250 | 🔧 🌡️ | 2024-10 | In: text Out: text | Released: 2024-12-03 |
| Llama 3.1 70B Instruct | meta.llama3-1-70b-instruct-v1:0 | 128K | 4.1K | Input: $0.72 Output: $0.72 | Model: 0.360 Completion: 1.000 | 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-07-23 |
| Llama 3 70B Instruct | meta.llama3-70b-instruct-v1:0 | 8.2K | 2K | Input: $2.65 Output: $3.5 | Model: 1.325 Completion: 1.321 | 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-07-23 |
| DeepSeek-R1 | deepseek.r1-v1:0 | 128K | 32.8K | Input: $1.35 Output: $5.4 | Model: 0.675 Completion: 4.000 | 🧠 🔧 🌡️ | 2024-07 | In: text Out: text | Released: 2025-01-20 Updated: 2025-05-29 |
| Claude Sonnet 3.5 v2 | anthropic.claude-3-5-sonnet-20241022-v2:0 | 200K | 8.2K | Input: $3 Output: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2024-10-22 |
| Command | cohere.command-text-v14 | 4.1K | 4.1K | Input: $1.5 Output: $2 | Model: 0.750 Completion: 1.333 | 🌡️ | 2023-08 | In: text Out: text | Open Weights Released: 2023-11-01 |
| Claude Opus 4 | anthropic.claude-opus-4-20250514-v1:0 | 200K | 32K | Input: $15 Output: $75 Cache Read: $1.5 Cache Write: $18.75 | Model: 7.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-05-22 |
| Qwen3 Coder 480B A35B Instruct | qwen.qwen3-coder-480b-a35b-v1:0 | 131.1K | 65.5K | Input: $0.22 Output: $1.8 | Model: 0.110 Completion: 8.182 | 🔧 🌡️ | 2024-04 | In: text Out: text | Open Weights Released: 2025-09-18 |
| Claude Sonnet 4.5 | anthropic.claude-sonnet-4-5-20250929-v1:0 | 200K | 64K | Input: $3 Output: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-07-31 | In: text, image Out: text | Released: 2025-09-29 |
| Llama 3.2 3B Instruct | meta.llama3-2-3b-instruct-v1:0 | 131K | 4.1K | Input: $0.15 Output: $0.15 | Model: 0.075 Completion: 1.000 | 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-09-25 |
| Claude Instant | anthropic.claude-instant-v1 | 100K | 4.1K | Input: $0.8 Output: $2.4 | Model: 0.400 Completion: 3.000 | 🌡️ | 2023-08 | In: text Out: text | Released: 2023-03-01 |
| Nova Premier | amazon.nova-premier-v1:0 | 1M | 16.4K | Input: $2.5 Output: $12.5 | Model: 1.250 Completion: 5.000 | 📎 🧠 🔧 🌡️ | 2024-10 | In: text, image, video Out: text | Released: 2024-12-03 |
| Claude Opus 4.1 | anthropic.claude-opus-4-1-20250805-v1:0 | 200K | 32K | Input: $15 Output: $75 Cache Read: $1.5 Cache Write: $18.75 | Model: 7.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-08-05 |
| Llama 4 Scout 17B Instruct | meta.llama4-scout-17b-instruct-v1:0 | 3.5M | 16.4K | Input: $0.17 Output: $0.66 | Model: 0.085 Completion: 3.882 | 📎 🔧 🌡️ | 2024-08 | In: text, image Out: text | Open Weights Released: 2025-04-05 |
| Jamba 1.5 Mini | ai21.jamba-1-5-mini-v1:0 | 256K | 4.1K | Input: $0.2 Output: $0.4 | Model: 0.100 Completion: 2.000 | 🔧 🌡️ | 2024-08 | In: text Out: text | Open Weights Released: 2024-08-15 |
| Llama 3 8B Instruct | meta.llama3-8b-instruct-v1:0 | 8.2K | 2K | Input: $0.3 Output: $0.6 | Model: 0.150 Completion: 2.000 | 🌡️ | 2023-03 | In: text Out: text | Open Weights Released: 2024-07-23 |
| Claude Sonnet 3 | anthropic.claude-3-sonnet-20240229-v1:0 | 200K | 4.1K | Input: $3 Output: $15 | Model: 1.500 Completion: 5.000 | 📎 🔧 🌡️ | 2023-08 | In: text, image Out: text | Released: 2024-03-04 |
| Llama 4 Maverick 17B Instruct | meta.llama4-maverick-17b-instruct-v1:0 | 1M | 16.4K | Input: $0.24 Output: $0.97 | Model: 0.120 Completion: 4.042 | 📎 🔧 🌡️ | 2024-08 | In: text, image Out: text | Open Weights Released: 2025-04-05 |
| Qwen3 235B A22B 2507 | qwen.qwen3-235b-a22b-2507-v1:0 | 262.1K | 131.1K | Input: $0.22 Output: $0.88 | Model: 0.110 Completion: 4.000 | 🔧 🌡️ | 2024-04 | In: text Out: text | Open Weights Released: 2025-09-18 |
| Nova Lite | amazon.nova-lite-v1:0 | 300K | 8.2K | Input: $0.06 Output: $0.24 Cache Read: $0.015 | Model: 0.030 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-10 | In: text, image, video Out: text | Released: 2024-12-03 |
| Claude Haiku 3.5 | anthropic.claude-3-5-haiku-20241022-v1:0 | 200K | 8.2K | Input: $0.8 Output: $4 Cache Read: $0.08 Cache Write: $1 | Model: 0.400 Completion: 5.000 Cache: 0.100 | 📎 🔧 🌡️ | 2024-07 | In: text Out: text | Released: 2024-10-22 |
Anthropic¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| Claude Opus 4 (latest) | claude-opus-4-0 | 200K | 32K | Input: $15 Output: $75 Cache Read: $1.5 Cache Write: $18.75 | Model: 7.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-05-22 |
| Claude Sonnet 3.5 v2 | claude-3-5-sonnet-20241022 | 200K | 8.2K | Input: $3 Output: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🔧 🌡️ | 2024-04-30 | In: text, image Out: text | Released: 2024-10-22 |
| Claude Opus 4.1 (latest) | claude-opus-4-1 | 200K | 32K | Input: $15 Output: $75 Cache Read: $1.5 Cache Write: $18.75 | Model: 7.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-08-05 |
| Claude Haiku 4.5 (latest) | claude-haiku-4-5 | 200K | 64K | Input: $1 Output: $5 Cache Read: $0.1 Cache Write: $1.25 | Model: 0.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-02-31 | In: text, image Out: text | Released: 2025-10-15 |
| Claude Sonnet 3.5 | claude-3-5-sonnet-20240620 | 200K | 8.2K | Input: $3 Output: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🔧 🌡️ | 2024-04-30 | In: text, image Out: text | Released: 2024-06-20 |
| Claude Haiku 3.5 (latest) | claude-3-5-haiku-latest | 200K | 8.2K | Input: $0.8 Output: $4 Cache Read: $0.08 Cache Write: $1 | Model: 0.400 Completion: 5.000 Cache: 0.100 | 📎 🔧 🌡️ | 2024-07-31 | In: text, image Out: text | Released: 2024-10-22 |
| Claude Opus 3 | claude-3-opus-20240229 | 200K | 4.1K | Input: $15 Output: $75 Cache Read: $1.5 Cache Write: $18.75 | Model: 7.500 Completion: 5.000 Cache: 0.100 | 📎 🔧 🌡️ | 2023-08-31 | In: text, image Out: text | Released: 2024-02-29 |
| Claude Sonnet 4.5 (latest) | claude-sonnet-4-5 | 200K | 64K | Input: $3 Output: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-07-31 | In: text, image Out: text | Released: 2025-09-29 |
| Claude Sonnet 4.5 | claude-sonnet-4-5-20250929 | 200K | 64K | Input: $3 Output: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-07-31 | In: text, image Out: text | Released: 2025-09-29 |
| Claude Sonnet 4 | claude-sonnet-4-20250514 | 200K | 64K | Input: $3 Output: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-05-22 |
| Claude Opus 4 | claude-opus-4-20250514 | 200K | 32K | Input: $15 Output: $75 Cache Read: $1.5 Cache Write: $18.75 | Model: 7.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-05-22 |
| Claude Haiku 3.5 | claude-3-5-haiku-20241022 | 200K | 8.2K | Input: $0.8 Output: $4 Cache Read: $0.08 Cache Write: $1 | Model: 0.400 Completion: 5.000 Cache: 0.100 | 📎 🔧 🌡️ | 2024-07-31 | In: text, image Out: text | Released: 2024-10-22 |
| Claude Haiku 3 | claude-3-haiku-20240307 | 200K | 4.1K | Input: $0.25 Output: $1.25 Cache Read: $0.03 Cache Write: $0.3 | Model: 0.125 Completion: 5.000 Cache: 0.120 | 📎 🔧 🌡️ | 2023-08-31 | In: text, image Out: text | Released: 2024-03-13 |
| Claude Sonnet 3.7 | claude-3-7-sonnet-20250219 | 200K | 64K | Input: $3 Output: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2024-10-31 | In: text, image Out: text | Released: 2025-02-19 |
| Claude Sonnet 3.7 (latest) | claude-3-7-sonnet-latest | 200K | 64K | Input: $3 Output: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2024-10-31 | In: text, image Out: text | Released: 2025-02-19 |
| Claude Sonnet 4 (latest) | claude-sonnet-4-0 | 200K | 64K | Input: $3 Output: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-05-22 |
| Claude Opus 4.1 | claude-opus-4-1-20250805 | 200K | 32K | Input: $15 Output: $75 Cache Read: $1.5 Cache Write: $18.75 | Model: 7.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-08-05 |
| Claude Sonnet 3 | claude-3-sonnet-20240229 | 200K | 4.1K | Input: $3 Output: $15 Cache Read: $0.3 Cache Write: $0.3 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🔧 🌡️ | 2023-08-31 | In: text, image Out: text | Released: 2024-03-04 |
| Claude Haiku 4.5 | claude-haiku-4-5-20251001 | 200K | 64K | Input: $1 Output: $5 Cache Read: $0.1 Cache Write: $1.25 | Model: 0.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-02-31 | In: text, image Out: text | Released: 2025-10-15 |
Azure¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| GPT-4.1 nano | gpt-4.1-nano | 1M | 32.8K | Input: $0.1 Output: $0.4 Cache Read: $0.03 | Model: 0.050 Completion: 4.000 Cache: 0.300 | 📎 🔧 🌡️ | 2024-05 | In: text, image Out: text | Released: 2025-04-14 |
| GPT-4 | gpt-4 | 8.2K | 8.2K | Input: $60 Output: $120 | Model: 30.000 Completion: 2.000 | 🔧 🌡️ | 2023-11 | In: text Out: text | Released: 2023-03-14 |
| Claude Opus 4.1 | claude-opus-4-1 | 200K | 32K | Input: $15 Output: $75 Cache Read: $1.5 Cache Write: $18.75 | Model: 7.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-11-18 |
| GPT-4 32K | gpt-4-32k | 32.8K | 32.8K | Input: $60 Output: $120 | Model: 30.000 Completion: 2.000 | 🔧 🌡️ | 2023-11 | In: text Out: text | Released: 2023-03-14 |
| GPT-5.1 Codex | gpt-5.1-codex | 400K | 128K | Input: $1.25 Output: $10 Cache Read: $0.125 | Model: 0.625 Completion: 8.000 Cache: 0.100 | 🧠 🔧 | 2024-09-30 | In: text, image, audio Out: text, image, audio | Released: 2025-11-14 |
| Claude Haiku 4.5 | claude-haiku-4-5 | 200K | 64K | Input: $1 Output: $5 Cache Read: $0.1 Cache Write: $1.25 | Model: 0.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-02-31 | In: text, image Out: text | Released: 2025-11-18 |
| GPT-4.1 mini | gpt-4.1-mini | 1M | 32.8K | Input: $0.4 Output: $1.6 Cache Read: $0.1 | Model: 0.200 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-05 | In: text, image Out: text | Released: 2025-04-14 |
| GPT-5 Chat | gpt-5-chat | 128K | 16.4K | Input: $1.25 Output: $10 Cache Read: $0.13 | Model: 0.625 Completion: 8.000 Cache: 0.104 | 📎 🧠 | 2024-10-24 | In: text, image Out: text | Released: 2025-08-07 |
| Claude Sonnet 4.5 | claude-sonnet-4-5 | 200K | 64K | Input: $3 Output: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-07-31 | In: text, image Out: text | Released: 2025-11-18 |
| GPT-3.5 Turbo 0125 | gpt-3.5-turbo-0125 | 16.4K | 16.4K | Input: $0.5 Output: $1.5 | Model: 0.250 Completion: 3.000 | 🌡️ | 2021-08 | In: text Out: text | Released: 2024-01-25 |
| GPT-4 Turbo | gpt-4-turbo | 128K | 4.1K | Input: $10 Output: $30 | Model: 5.000 Completion: 3.000 | 📎 🔧 🌡️ | 2023-11 | In: text, image Out: text | Released: 2023-11-06 Updated: 2024-04-09 |
| GPT-3.5 Turbo 0613 | gpt-3.5-turbo-0613 | 16.4K | 16.4K | Input: $3 Output: $4 | Model: 1.500 Completion: 1.333 | 🌡️ | 2021-08 | In: text Out: text | Released: 2023-06-13 |
| o1-preview | o1-preview | 128K | 32.8K | Input: $16.5 Output: $66 Cache Read: $8.25 | Model: 8.250 Completion: 4.000 Cache: 0.500 | 🧠 🔧 | 2023-09 | In: text Out: text | Released: 2024-09-12 |
| GPT-5.1 Codex Mini | gpt-5.1-codex-mini | 400K | 128K | Input: $0.25 Output: $2 Cache Read: $0.025 | Model: 0.125 Completion: 8.000 Cache: 0.100 | 🧠 🔧 | 2024-09-30 | In: text, image, audio Out: text, image, audio | Released: 2025-11-14 |
| o3-mini | o3-mini | 200K | 100K | Input: $1.1 Output: $4.4 Cache Read: $0.55 | Model: 0.550 Completion: 4.000 Cache: 0.500 | 🧠 🔧 | 2024-05 | In: text Out: text | Released: 2024-12-20 Updated: 2025-01-29 |
| GPT-5.1 | gpt-5.1 | 272K | 128K | Input: $1.25 Output: $10 Cache Read: $0.125 | Model: 0.625 Completion: 8.000 Cache: 0.100 | 📎 🧠 🔧 | 2024-09-30 | In: text, image, audio Out: text, image, audio | Released: 2025-11-14 |
| GPT-5 Nano | gpt-5-nano | 272K | 128K | Input: $0.05 Output: $0.4 Cache Read: $0.01 | Model: 0.025 Completion: 8.000 Cache: 0.200 | 📎 🧠 🔧 | 2024-05-30 | In: text, image Out: text | Released: 2025-08-07 |
| GPT-5-Codex | gpt-5-codex | 400K | 128K | Input: $1.25 Output: $10 Cache Read: $0.13 | Model: 0.625 Completion: 8.000 Cache: 0.104 | 🧠 🔧 | 2024-09-30 | In: text, image Out: text | Released: 2025-09-15 |
| GPT-4o | gpt-4o | 128K | 16.4K | Input: $2.5 Output: $10 Cache Read: $1.25 | Model: 1.250 Completion: 4.000 Cache: 0.500 | 📎 🔧 🌡️ | 2023-09 | In: text, image Out: text | Released: 2024-05-13 |
| GPT-3.5 Turbo 0301 | gpt-3.5-turbo-0301 | 4.1K | 4.1K | Input: $1.5 Output: $2 | Model: 0.750 Completion: 1.333 | 🌡️ | 2021-08 | In: text Out: text | Released: 2023-03-01 |
| GPT-4.1 | gpt-4.1 | 1M | 32.8K | Input: $2 Output: $8 Cache Read: $0.5 | Model: 1.000 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-05 | In: text, image Out: text | Released: 2025-04-14 |
| o4-mini | o4-mini | 200K | 100K | Input: $1.1 Output: $4.4 Cache Read: $0.28 | Model: 0.550 Completion: 4.000 Cache: 0.255 | 📎 🧠 🔧 | 2024-05 | In: text, image Out: text | Released: 2025-04-16 |
| o1 | o1 | 200K | 100K | Input: $15 Output: $60 Cache Read: $7.5 | Model: 7.500 Completion: 4.000 Cache: 0.500 | 🧠 🔧 | 2023-09 | In: text, image Out: text | Released: 2024-12-05 |
| GPT-5.1 Chat | gpt-5.1-chat | 128K | 16.4K | Input: $1.25 Output: $10 Cache Read: $0.125 | Model: 0.625 Completion: 8.000 Cache: 0.100 | 📎 🧠 🔧 | 2024-09-30 | In: text, image, audio Out: text, image, audio | Released: 2025-11-14 |
| GPT-5 Mini | gpt-5-mini | 272K | 128K | Input: $0.25 Output: $2 Cache Read: $0.03 | Model: 0.125 Completion: 8.000 Cache: 0.120 | 📎 🧠 🔧 | 2024-05-30 | In: text, image Out: text | Released: 2025-08-07 |
| o1-mini | o1-mini | 128K | 65.5K | Input: $1.1 Output: $4.4 Cache Read: $0.55 | Model: 0.550 Completion: 4.000 Cache: 0.500 | 🧠 🔧 | 2023-09 | In: text Out: text | Released: 2024-09-12 |
| GPT-3.5 Turbo Instruct | gpt-3.5-turbo-instruct | 4.1K | 4.1K | Input: $1.5 Output: $2 | Model: 0.750 Completion: 1.333 | 🌡️ | 2021-08 | In: text Out: text | Released: 2023-09-21 |
| o3 | o3 | 200K | 100K | Input: $2 Output: $8 Cache Read: $0.5 | Model: 1.000 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 | 2024-05 | In: text, image Out: text | Released: 2025-04-16 |
| Codex Mini | codex-mini | 200K | 100K | Input: $1.5 Output: $6 Cache Read: $0.375 | Model: 0.750 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 | 2024-04 | In: text Out: text | Released: 2025-05-16 |
| GPT-4 Turbo Vision | gpt-4-turbo-vision | 128K | 4.1K | Input: $10 Output: $30 | Model: 5.000 Completion: 3.000 | 📎 🔧 🌡️ | 2023-11 | In: text, image Out: text | Released: 2023-11-06 Updated: 2024-04-09 |
| GPT-4o mini | gpt-4o-mini | 128K | 16.4K | Input: $0.15 Output: $0.6 Cache Read: $0.08 | Model: 0.075 Completion: 4.000 Cache: 0.533 | 📎 🔧 🌡️ | 2023-09 | In: text, image Out: text | Released: 2024-07-18 |
| GPT-5 | gpt-5 | 272K | 128K | Input: $1.25 Output: $10 Cache Read: $0.13 | Model: 0.625 Completion: 8.000 Cache: 0.104 | 📎 🧠 🔧 | 2024-09-30 | In: text, image Out: text | Released: 2025-08-07 |
| GPT-3.5 Turbo 1106 | gpt-3.5-turbo-1106 | 16.4K | 16.4K | Input: $1 Output: $2 | Model: 0.500 Completion: 2.000 | 🌡️ | 2021-08 | In: text Out: text | Released: 2023-11-06 |
Azure Cognitive Services¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| GPT-3.5 Turbo 1106 | gpt-3.5-turbo-1106 | 16.4K | 16.4K | Input: $1 Output: $2 | Model: 0.500 Completion: 2.000 | 🌡️ | 2021-08 | In: text Out: text | Released: 2023-11-06 |
| GPT-5 | gpt-5 | 272K | 128K | Input: $1.25 Output: $10 Cache Read: $0.13 | Model: 0.625 Completion: 8.000 Cache: 0.104 | 📎 🧠 🔧 | 2024-09-30 | In: text, image Out: text | Released: 2025-08-07 |
| GPT-4o mini | gpt-4o-mini | 128K | 16.4K | Input: $0.15 Output: $0.6 Cache Read: $0.08 | Model: 0.075 Completion: 4.000 Cache: 0.533 | 📎 🔧 🌡️ | 2023-09 | In: text, image Out: text | Released: 2024-07-18 |
| GPT-4 Turbo Vision | gpt-4-turbo-vision | 128K | 4.1K | Input: $10 Output: $30 | Model: 5.000 Completion: 3.000 | 📎 🔧 🌡️ | 2023-11 | In: text, image Out: text | Released: 2023-11-06 Updated: 2024-04-09 |
| Codex Mini | codex-mini | 200K | 100K | Input: $1.5 Output: $6 Cache Read: $0.375 | Model: 0.750 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 | 2024-04 | In: text Out: text | Released: 2025-05-16 |
| o3 | o3 | 200K | 100K | Input: $2 Output: $8 Cache Read: $0.5 | Model: 1.000 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 | 2024-05 | In: text, image Out: text | Released: 2025-04-16 |
| GPT-3.5 Turbo Instruct | gpt-3.5-turbo-instruct | 4.1K | 4.1K | Input: $1.5 Output: $2 | Model: 0.750 Completion: 1.333 | 🌡️ | 2021-08 | In: text Out: text | Released: 2023-09-21 |
| o1-mini | o1-mini | 128K | 65.5K | Input: $1.1 Output: $4.4 Cache Read: $0.55 | Model: 0.550 Completion: 4.000 Cache: 0.500 | 🧠 🔧 | 2023-09 | In: text Out: text | Released: 2024-09-12 |
| GPT-5 Mini | gpt-5-mini | 272K | 128K | Input: $0.25 Output: $2 Cache Read: $0.03 | Model: 0.125 Completion: 8.000 Cache: 0.120 | 📎 🧠 🔧 | 2024-05-30 | In: text, image Out: text | Released: 2025-08-07 |
| GPT-5.1 Chat | gpt-5.1-chat | 128K | 16.4K | Input: $1.25 Output: $10 Cache Read: $0.125 | Model: 0.625 Completion: 8.000 Cache: 0.100 | 📎 🧠 🔧 | 2024-09-30 | In: text, image, audio Out: text, image, audio | Released: 2025-11-14 |
| o1 | o1 | 200K | 100K | Input: $15 Output: $60 Cache Read: $7.5 | Model: 7.500 Completion: 4.000 Cache: 0.500 | 🧠 🔧 | 2023-09 | In: text, image Out: text | Released: 2024-12-05 |
| o4-mini | o4-mini | 200K | 100K | Input: $1.1 Output: $4.4 Cache Read: $0.28 | Model: 0.550 Completion: 4.000 Cache: 0.255 | 📎 🧠 🔧 | 2024-05 | In: text, image Out: text | Released: 2025-04-16 |
| GPT-4.1 | gpt-4.1 | 1M | 32.8K | Input: $2 Output: $8 Cache Read: $0.5 | Model: 1.000 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-05 | In: text, image Out: text | Released: 2025-04-14 |
| GPT-3.5 Turbo 0301 | gpt-3.5-turbo-0301 | 4.1K | 4.1K | Input: $1.5 Output: $2 | Model: 0.750 Completion: 1.333 | 🌡️ | 2021-08 | In: text Out: text | Released: 2023-03-01 |
| GPT-4o | gpt-4o | 128K | 16.4K | Input: $2.5 Output: $10 Cache Read: $1.25 | Model: 1.250 Completion: 4.000 Cache: 0.500 | 📎 🔧 🌡️ | 2023-09 | In: text, image Out: text | Released: 2024-05-13 |
| GPT-5-Codex | gpt-5-codex | 400K | 128K | Input: $1.25 Output: $10 Cache Read: $0.13 | Model: 0.625 Completion: 8.000 Cache: 0.104 | 🧠 🔧 | 2024-09-30 | In: text, image Out: text | Released: 2025-09-15 |
| GPT-5 Nano | gpt-5-nano | 272K | 128K | Input: $0.05 Output: $0.4 Cache Read: $0.01 | Model: 0.025 Completion: 8.000 Cache: 0.200 | 📎 🧠 🔧 | 2024-05-30 | In: text, image Out: text | Released: 2025-08-07 |
| GPT-5.1 | gpt-5.1 | 272K | 128K | Input: $1.25 Output: $10 Cache Read: $0.125 | Model: 0.625 Completion: 8.000 Cache: 0.100 | 📎 🧠 🔧 | 2024-09-30 | In: text, image, audio Out: text, image, audio | Released: 2025-11-14 |
| o3-mini | o3-mini | 200K | 100K | Input: $1.1 Output: $4.4 Cache Read: $0.55 | Model: 0.550 Completion: 4.000 Cache: 0.500 | 🧠 🔧 | 2024-05 | In: text Out: text | Released: 2024-12-20 Updated: 2025-01-29 |
| GPT-5.1 Codex Mini | gpt-5.1-codex-mini | 400K | 128K | Input: $0.25 Output: $2 Cache Read: $0.025 | Model: 0.125 Completion: 8.000 Cache: 0.100 | 🧠 🔧 | 2024-09-30 | In: text, image, audio Out: text, image, audio | Released: 2025-11-14 |
| o1-preview | o1-preview | 128K | 32.8K | Input: $16.5 Output: $66 Cache Read: $8.25 | Model: 8.250 Completion: 4.000 Cache: 0.500 | 🧠 🔧 | 2023-09 | In: text Out: text | Released: 2024-09-12 |
| GPT-3.5 Turbo 0613 | gpt-3.5-turbo-0613 | 16.4K | 16.4K | Input: $3 Output: $4 | Model: 1.500 Completion: 1.333 | 🌡️ | 2021-08 | In: text Out: text | Released: 2023-06-13 |
| GPT-4 Turbo | gpt-4-turbo | 128K | 4.1K | Input: $10 Output: $30 | Model: 5.000 Completion: 3.000 | 📎 🔧 🌡️ | 2023-11 | In: text, image Out: text | Released: 2023-11-06 Updated: 2024-04-09 |
| GPT-3.5 Turbo 0125 | gpt-3.5-turbo-0125 | 16.4K | 16.4K | Input: $0.5 Output: $1.5 | Model: 0.250 Completion: 3.000 | 🌡️ | 2021-08 | In: text Out: text | Released: 2024-01-25 |
| Claude Sonnet 4.5 | claude-sonnet-4-5 | 200K | 64K | Input: $3 Output: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-07-31 | In: text, image Out: text | Released: 2025-11-18 |
| GPT-5 Chat | gpt-5-chat | 128K | 16.4K | Input: $1.25 Output: $10 Cache Read: $0.13 | Model: 0.625 Completion: 8.000 Cache: 0.104 | 📎 🧠 | 2024-10-24 | In: text, image Out: text | Released: 2025-08-07 |
| GPT-4.1 mini | gpt-4.1-mini | 1M | 32.8K | Input: $0.4 Output: $1.6 Cache Read: $0.1 | Model: 0.200 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-05 | In: text, image Out: text | Released: 2025-04-14 |
| Claude Haiku 4.5 | claude-haiku-4-5 | 200K | 64K | Input: $1 Output: $5 Cache Read: $0.1 Cache Write: $1.25 | Model: 0.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-02-31 | In: text, image Out: text | Released: 2025-11-18 |
| GPT-5.1 Codex | gpt-5.1-codex | 400K | 128K | Input: $1.25 Output: $10 Cache Read: $0.125 | Model: 0.625 Completion: 8.000 Cache: 0.100 | 🧠 🔧 | 2024-09-30 | In: text, image, audio Out: text, image, audio | Released: 2025-11-14 |
| GPT-4 32K | gpt-4-32k | 32.8K | 32.8K | Input: $60 Output: $120 | Model: 30.000 Completion: 2.000 | 🔧 🌡️ | 2023-11 | In: text Out: text | Released: 2023-03-14 |
| Claude Opus 4.1 | claude-opus-4-1 | 200K | 32K | Input: $15 Output: $75 Cache Read: $1.5 Cache Write: $18.75 | Model: 7.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-11-18 |
| GPT-4 | gpt-4 | 8.2K | 8.2K | Input: $60 Output: $120 | Model: 30.000 Completion: 2.000 | 🔧 🌡️ | 2023-11 | In: text Out: text | Released: 2023-03-14 |
| GPT-4.1 nano | gpt-4.1-nano | 1M | 32.8K | Input: $0.1 Output: $0.4 Cache Read: $0.03 | Model: 0.050 Completion: 4.000 Cache: 0.300 | 📎 🔧 🌡️ | 2024-05 | In: text, image Out: text | Released: 2025-04-14 |
Baseten¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| Kimi K2 Instruct 0905 | moonshotai/Kimi-K2-Instruct-0905 | 262.1K | 262.1K | Input: $0.6 Output: $2.5 | Model: 0.300 Completion: 4.167 | 🔧 🌡️ | 2025-08 | In: text Out: text | Open Weights Released: 2025-09-05 |
| Kimi K2 Thinking | moonshotai/Kimi-K2-Thinking | 262.1K | 262.1K | Input: $0.6 Output: $2.5 | Model: 0.300 Completion: 4.167 | 🧠 🔧 🌡️ | 2024-08 | In: text Out: text | Open Weights Released: 2025-11-06 |
| Qwen3 Coder 480B A35B Instruct | Qwen/Qwen3-Coder-480B-A35B-Instruct | 262.1K | 66.5K | Input: $0.38 Output: $1.53 | Model: 0.190 Completion: 4.026 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-23 |
| GLM 4.6 | zai-org/GLM-4.6 | 200K | 200K | Input: $0.6 Output: $2.2 | Model: 0.300 Completion: 3.667 | 🔧 🌡️ | 2025-08 | In: text Out: text | Open Weights Released: 2025-09-16 |
Cerebras¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| Qwen 3 235B Instruct | qwen-3-235b-a22b-instruct-2507 | 131K | 32K | Input: $0.6 Output: $1.2 | Model: 0.300 Completion: 2.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-22 |
| Z.AI GLM-4.6 | zai-glm-4.6 | 131.1K | 41K | Input: $0 Output: $0 Cache Read: $0 Cache Write: $0 | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-11-05 |
| Qwen 3 Coder 480B | qwen-3-coder-480b | 131K | 32K | Input: $2 Output: $2 | Model: 1.000 Completion: 1.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-23 |
| GPT OSS 120B | gpt-oss-120b | 131.1K | 32.8K | Input: $0.25 Output: $0.69 | Model: 0.125 Completion: 2.760 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-08-05 |
Chutes¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| Hermes 4 70B | NousResearch/Hermes-4-70B | 131.1K | 131.1K | Input: $0.11 Output: $0.38 | Model: 0.055 Completion: 3.455 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-11-08 |
| Hermes 4 14B | NousResearch/Hermes-4-14B | 41K | 41K | Input: $0.03 Output: $0.11 | Model: 0.015 Completion: 3.667 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-11-08 |
| Hermes 4 405B FP8 | NousResearch/Hermes-4-405B-FP8 | 131.1K | 131.1K | Input: $0.3 Output: $1.2 | Model: 0.150 Completion: 4.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-11-08 |
| DeepHermes 3 Mistral 24B Preview | NousResearch/DeepHermes-3-Mistral-24B-Preview | 32.8K | 32.8K | Input: $0.15 Output: $0.59 | Model: 0.075 Completion: 3.933 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-11-08 |
| Dots.Ocr | rednote-hilab/dots.ocr | 131.1K | 131.1K | Input: $0.01 Output: $0.01 | Model: 0.005 Completion: 1.000 | 🌡️ | - | In: text, image Out: text | Open Weights Released: 2025-11-08 |
| Kimi K2 Instruct 0905 | moonshotai/Kimi-K2-Instruct-0905 | 262.1K | 262.1K | Input: $0.39 Output: $1.9 | Model: 0.195 Completion: 4.872 | 🔧 🌡️ | - | In: text Out: text | Released: 2024-09-05 Updated: 2025-11-08 |
| Kimi K2 Thinking | moonshotai/Kimi-K2-Thinking | 262.1K | 16.4K | Input: $0.55 Output: $2.25 | Model: 0.275 Completion: 4.091 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-11-08 Updated: 2025-11-13 |
| MiniMax M2 | MiniMaxAI/MiniMax-M2 | 196.6K | 196.6K | Input: $0.26 Output: $1.02 | Model: 0.130 Completion: 3.923 | 🧠 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-10-27 Updated: 2025-11-08 |
| QwQ 32B ArliAI RpR V1 | ArliAI/QwQ-32B-ArliAI-RpR-v1 | 32.8K | 32.8K | Input: $0.03 Output: $0.11 | Model: 0.015 Completion: 3.667 | 🧠 🌡️ | - | In: text Out: text | Open Weights Released: 2025-11-08 |
| LongCat Flash Chat FP8 | meituan-longcat/LongCat-Flash-Chat-FP8 | 131.1K | 131.1K | Input: $0 Output: $0 | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-09-10 Updated: 2025-11-08 |
| DeepSeek R1T Chimera | tngtech/DeepSeek-R1T-Chimera | 163.8K | 163.8K | Input: $0.3 Output: $1.2 | Model: 0.150 Completion: 4.000 | 🧠 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04-26 Updated: 2025-11-08 |
| DeepSeek TNG R1T2 Chimera | tngtech/DeepSeek-TNG-R1T2-Chimera | 163.8K | 163.8K | Input: $0.3 Output: $1.2 | Model: 0.150 Completion: 4.000 | 🧠 🔧 🌡️ | 2025-07 | In: text Out: text | Open Weights Released: 2025-07-08 Updated: 2025-11-08 |
| InternVL3 78B | OpenGVLab/InternVL3-78B | 32.8K | 32.8K | Input: $0.07 Output: $0.26 | Model: 0.035 Completion: 3.714 | 🌡️ | - | In: text, image Out: text | Open Weights Released: 2025-11-08 |
| MAI DS R1 FP8 | microsoft/MAI-DS-R1-FP8 | 163.8K | 163.8K | Input: $0.3 Output: $1.2 | Model: 0.150 Completion: 4.000 | 🧠 🌡️ | - | In: text Out: text | Open Weights Released: 2025-11-08 |
| Gpt Oss 20b | openai/gpt-oss-20b | 131.1K | 131.1K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-11-08 |
| GPT OSS 120B | openai/gpt-oss-120b | 131.1K | 131.1K | Input: $0.04 Output: $0.4 | Model: 0.020 Completion: 10.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-08-05 Updated: 2025-11-08 |
| Mistral Small 3.1 24B Instruct 2503 | chutesai/Mistral-Small-3.1-24B-Instruct-2503 | 131.1K | 131.1K | Input: $0.05 Output: $0.22 | Model: 0.025 Completion: 4.400 | 🔧 🌡️ | - | In: text, image Out: text | Open Weights Released: 2025-11-08 |
| Mistral Small 3.2 24B Instruct (2506) | chutesai/Mistral-Small-3.2-24B-Instruct-2506 | 131.1K | 131.1K | Input: $0.06 Output: $0.18 | Model: 0.030 Completion: 3.000 | 🔧 🌡️ | - | In: text, image Out: text | Open Weights Released: 2025-06-20 Updated: 2025-11-08 |
| Tongyi DeepResearch 30B A3B | Alibaba-NLP/Tongyi-DeepResearch-30B-A3B | 131.1K | 131.1K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-11-08 |
| Mistral Nemo Instruct 2407 | unsloth/Mistral-Nemo-Instruct-2407 | 131.1K | 131.1K | Input: $0.03 Output: $0.11 | Model: 0.015 Completion: 3.667 | 🌡️ | - | In: text Out: text | Open Weights Released: 2025-11-08 |
| Gemma 3 4b It | unsloth/gemma-3-4b-it | 96K | 96K | Input: $0 Output: $0 | - | 🌡️ | - | In: text, image Out: text | Open Weights Released: 2025-11-08 |
| Mistral Small 24B Instruct 2501 | unsloth/Mistral-Small-24B-Instruct-2501 | 32.8K | 32.8K | Input: $0.05 Output: $0.22 | Model: 0.025 Completion: 4.400 | 🔧 🌡️ | - | In: text, image Out: text | Open Weights Released: 2025-11-08 |
| Gemma 3 12b It | unsloth/gemma-3-12b-it | 131.1K | 131.1K | Input: $0.03 Output: $0.1 | Model: 0.015 Completion: 3.333 | 🌡️ | - | In: text, image Out: text | Open Weights Released: 2025-11-08 |
| Gemma 3 27b It | unsloth/gemma-3-27b-it | 96K | 96K | Input: $0.13 Output: $0.52 | Model: 0.065 Completion: 4.000 | 🔧 🌡️ | - | In: text, image Out: text | Open Weights Released: 2025-11-08 |
| Qwen3 30B A3B | Qwen/Qwen3-30B-A3B | 41K | 41K | Input: $0.06 Output: $0.22 | Model: 0.030 Completion: 3.667 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04-28 Updated: 2025-11-08 |
| Qwen3 14B | Qwen/Qwen3-14B | 41K | 41K | Input: $0.05 Output: $0.22 | Model: 0.025 Completion: 4.400 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-11-08 |
| Qwen2.5 VL 32B Instruct | Qwen/Qwen2.5-VL-32B-Instruct | 16.4K | 16.4K | Input: $0.05 Output: $0.22 | Model: 0.025 Completion: 4.400 | 🌡️ | - | In: text, image Out: text | Open Weights Released: 2025-11-08 |
| Qwen3 235B A22B Instruct 2507 | Qwen/Qwen3-235B-A22B-Instruct-2507 | 262.1K | 262.1K | Input: $0.08 Output: $0.55 | Model: 0.040 Completion: 6.875 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04-28 Updated: 2025-11-08 |
| Qwen2.5 Coder 32B Instruct | Qwen/Qwen2.5-Coder-32B-Instruct | 32.8K | 32.8K | Input: $0.04 Output: $0.16 | Model: 0.020 Completion: 4.000 | 🌡️ | - | In: text Out: text | Open Weights Released: 2025-11-08 |
| Qwen2.5 72B Instruct | Qwen/Qwen2.5-72B-Instruct | 32.8K | 32.8K | Input: $0.07 Output: $0.26 | Model: 0.035 Completion: 3.714 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-11-08 |
| Qwen3 Coder 30B A3B Instruct | Qwen/Qwen3-Coder-30B-A3B-Instruct | 262.1K | 262.1K | Input: $0.06 Output: $0.25 | Model: 0.030 Completion: 4.167 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-25 Updated: 2025-11-08 |
| Qwen3 235B A22B | Qwen/Qwen3-235B-A22B | 41K | 41K | Input: $0.3 Output: $1.2 | Model: 0.150 Completion: 4.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-11-08 |
| Qwen2.5 VL 72B Instruct | Qwen/Qwen2.5-VL-72B-Instruct | 32.8K | 32.8K | Input: $0.08 Output: $0.33 | Model: 0.040 Completion: 4.125 | 🌡️ | - | In: text, image Out: text | Open Weights Released: 2025-11-08 |
| Qwen3 32B | Qwen/Qwen3-32B | 41K | 41K | Input: $0.05 Output: $0.2 | Model: 0.025 Completion: 4.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-11-08 |
| Qwen3 Coder 480B A35B Instruct (FP8) | Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8 | 262.1K | 262.1K | Input: $0.22 Output: $0.95 | Model: 0.110 Completion: 4.318 | 🔧 🌡️ | - | In: text Out: text | Released: 2025-08-01 Updated: 2025-11-08 |
| Qwen3 VL 235B A22B Instruct | Qwen/Qwen3-VL-235B-A22B-Instruct | 262.1K | 262.1K | Input: $0.3 Output: $1.2 | Model: 0.150 Completion: 4.000 | 🔧 🌡️ | - | In: text, image Out: text | Open Weights Released: 2025-11-08 |
| Qwen3 VL 235B A22B Thinking | Qwen/Qwen3-VL-235B-A22B-Thinking | 262.1K | 262.1K | Input: $0.3 Output: $1.2 | Model: 0.150 Completion: 4.000 | 🧠 🔧 🌡️ | - | In: text, image Out: text | Open Weights Released: 2025-11-08 |
| Qwen3 30B A3B Instruct 2507 | Qwen/Qwen3-30B-A3B-Instruct-2507 | 262.1K | 262.1K | Input: $0.08 Output: $0.33 | Model: 0.040 Completion: 4.125 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-25 Updated: 2025-11-08 |
| Qwen3-235B-A22B-Thinking-2507 | Qwen/Qwen3-235B-A22B-Thinking-2507 | 262.1K | 262.1K | Input: $0.11 Output: $0.6 | Model: 0.055 Completion: 5.455 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-25 Updated: 2025-11-08 |
| Qwen3 Next 80B A3B Instruct | Qwen/Qwen3-Next-80B-A3B-Instruct | 262.1K | 262.1K | Input: $0.1 Output: $0.8 | Model: 0.050 Completion: 8.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-09-11 |
| GLM 4.5 | zai-org/GLM-4.5 | 131.1K | 131.1K | Input: $0.35 Output: $1.55 | Model: 0.175 Completion: 4.429 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-10-30 |
| GLM 4.6 | zai-org/GLM-4.6 | 202.8K | 202.8K | Input: $0.4 Output: $1.75 | Model: 0.200 Completion: 4.375 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-10-30 Updated: 2025-11-08 |
| GLM 4.5 Air | zai-org/GLM-4.5-Air | 131.1K | 131.1K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-28 Updated: 2025-11-08 |
| DeepSeek R1 | deepseek-ai/DeepSeek-R1 | 163.8K | 163.8K | Input: $0.3 Output: $1.2 | Model: 0.150 Completion: 4.000 | 🧠 🌡️ | - | In: text Out: text | Open Weights Released: 2025-11-08 |
| DeepSeek R1 0528 Qwen3 8B | deepseek-ai/DeepSeek-R1-0528-Qwen3-8B | 32.8K | 32.8K | Input: $0.02 Output: $0.1 | Model: 0.010 Completion: 5.000 | 🧠 🌡️ | 2025-05 | In: text Out: text | Open Weights Released: 2025-05-29 Updated: 2025-11-08 |
| DeepSeek R1 (0528) | deepseek-ai/DeepSeek-R1-0528 | 163.8K | 163.8K | Input: $0.4 Output: $1.75 | Model: 0.200 Completion: 4.375 | 🧠 🔧 🌡️ | - | In: text Out: text | Released: 2025-08-01 Updated: 2025-11-08 |
| DeepSeek V3.2 Exp | deepseek-ai/DeepSeek-V3.2-Exp | 163.8K | 163.8K | Input: $0.25 Output: $0.35 | Model: 0.125 Completion: 1.400 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-09-29 Updated: 2025-11-08 |
| DeepSeek V3.1 Terminus | deepseek-ai/DeepSeek-V3.1-Terminus | 163.8K | 163.8K | Input: $0.23 Output: $0.9 | Model: 0.115 Completion: 3.913 | 🧠 🔧 🌡️ | 2025-07 | In: text Out: text | Open Weights Released: 2025-09-22 Updated: 2025-11-08 |
| DeepSeek V3 | deepseek-ai/DeepSeek-V3 | 163.8K | 163.8K | Input: $0.3 Output: $1.2 | Model: 0.150 Completion: 4.000 | 🌡️ | - | In: text Out: text | Open Weights Released: 2025-11-08 |
| DeepSeek R1 Distill Llama 70B | deepseek-ai/DeepSeek-R1-Distill-Llama-70B | 131.1K | 131.1K | Input: $0.03 Output: $0.13 | Model: 0.015 Completion: 4.333 | 🧠 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-01-23 Updated: 2025-11-08 |
| DeepSeek V3.1 | deepseek-ai/DeepSeek-V3.1 | 163.8K | 163.8K | Input: $0.2 Output: $0.8 | Model: 0.100 Completion: 4.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Released: 2025-08-21 Updated: 2025-11-08 |
| DeepSeek V3 (0324) | deepseek-ai/DeepSeek-V3-0324 | 163.8K | 163.8K | Input: $0.24 Output: $0.84 | Model: 0.120 Completion: 3.500 | 🌡️ | - | In: text Out: text | Released: 2025-08-01 Updated: 2025-11-08 |
Cloudflare Workers AI¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| hf/thebloke/mistral-7b-instruct-v0.1-awq | mistral-7b-instruct-v0.1-awq | 4.1K | 4.1K | Input: $0 Output: $0 | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2023-09-27 Updated: 2023-11-09 |
| cf/deepgram/aura-1 | aura-1 | - | - | Input: $0.015 Output: $0.015 | Model: 0.007 Completion: 1.000 | - | - | In: text Out: audio | Open Weights Released: 2025-08-27 Updated: 2025-07-07 |
| hf/mistral/mistral-7b-instruct-v0.2 | mistral-7b-instruct-v0.2 | 3.1K | 3.1K | Input: $0 Output: $0 | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2023-12-11 Updated: 2025-07-24 |
| cf/tinyllama/tinyllama-1.1b-chat-v1.0 | tinyllama-1.1b-chat-v1.0 | 2K | 2K | Input: $0 Output: $0 | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2023-12-30 Updated: 2024-03-17 |
| cf/qwen/qwen1.5-0.5b-chat | qwen1.5-0.5b-chat | 32K | 32K | Input: $0 Output: $0 | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-01-31 Updated: 2024-04-30 |
| cf/meta/llama-3.2-11b-vision-instruct | llama-3.2-11b-vision-instruct | 128K | 128K | Input: $0.049 Output: $0.68 | Model: 0.025 Completion: 13.878 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-09-18 Updated: 2024-12-04 |
| hf/thebloke/llama-2-13b-chat-awq | llama-2-13b-chat-awq | 4.1K | 4.1K | Input: $0 Output: $0 | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2023-09-19 Updated: 2023-11-09 |
| cf/meta/llama-3.1-8b-instruct-fp8 | llama-3.1-8b-instruct-fp8 | 32K | 32K | Input: $0.15 Output: $0.29 | Model: 0.075 Completion: 1.933 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-07-25 |
| cf/openai/whisper | whisper | - | - | Input: $0.00045 Output: $0.00045 | Model: 0.000 Completion: 1.000 | - | - | In: audio Out: text | Open Weights Released: 2023-11-07 Updated: 2024-08-12 |
| cf/stabilityai/stable-diffusion-xl-base-1.0 | stable-diffusion-xl-base-1.0 | - | - | Input: $0 Output: $0 | - | - | - | In: text Out: image | Open Weights Released: 2023-07-25 Updated: 2023-10-30 |
| cf/meta/llama-2-7b-chat-fp16 | llama-2-7b-chat-fp16 | 4.1K | 4.1K | Input: $0.56 Output: $6.67 | Model: 0.280 Completion: 11.911 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2023-07-26 |
| cf/microsoft/resnet-50 | resnet-50 | - | - | Input: $0.0000025 Output: $0 | Model: 0.000 | - | - | In: image Out: text | Open Weights Released: 2022-03-16 Updated: 2024-02-13 |
| cf/runwayml/stable-diffusion-v1-5-inpainting | stable-diffusion-v1-5-inpainting | - | - | Input: $0 Output: $0 | - | - | - | In: text Out: image | Open Weights Released: 2024-02-27 |
| cf/defog/sqlcoder-7b-2 | sqlcoder-7b-2 | 10K | 10K | Input: $0 Output: $0 | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-02-05 Updated: 2024-02-12 |
| cf/meta/llama-3-8b-instruct | llama-3-8b-instruct | 8K | 8K | Input: $0.28 Output: $0.83 | Model: 0.140 Completion: 2.964 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-04-17 Updated: 2025-06-19 |
| cf/meta-llama/llama-2-7b-chat-hf-lora | llama-2-7b-chat-hf-lora | 8.2K | 8.2K | Input: $0 Output: $0 | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2023-07-13 Updated: 2024-04-17 |
| cf/meta/llama-3.1-8b-instruct | llama-3.1-8b-instruct | 8K | 8K | Input: $0.28 Output: $0.83 | Model: 0.140 Completion: 2.964 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-07-18 Updated: 2024-09-25 |
| cf/openchat/openchat-3.5-0106 | openchat-3.5-0106 | 8.2K | 8.2K | Input: $0 Output: $0 | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-01-07 Updated: 2024-05-18 |
| hf/thebloke/openhermes-2.5-mistral-7b-awq | openhermes-2.5-mistral-7b-awq | 4.1K | 4.1K | Input: $0 Output: $0 | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2023-11-02 Updated: 2023-11-09 |
| cf/leonardo/lucid-origin | lucid-origin | - | - | Input: $0.007 Output: $0.007 | Model: 0.004 Completion: 1.000 | - | - | In: text Out: image | Released: 2025-08-25 Updated: 2025-08-05 |
| cf/facebook/bart-large-cnn | bart-large-cnn | - | - | Input: $0 Output: $0 | - | - | - | In: text Out: text | Open Weights Released: 2022-03-02 Updated: 2024-02-13 |
| cf/black-forest-labs/flux-1-schnell | flux-1-schnell | 2K | - | Input: $0.000053 Output: $0.00011 | Model: 0.000 Completion: 2.075 | - | - | In: text Out: image | Open Weights Released: 2024-07-31 Updated: 2024-08-16 |
| cf/deepseek-ai/deepseek-r1-distill-qwen-32b | deepseek-r1-distill-qwen-32b | 80K | 80K | Input: $0.5 Output: $4.88 | Model: 0.250 Completion: 9.760 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-01-20 Updated: 2025-02-24 |
| cf/google/gemma-2b-it-lora | gemma-2b-it-lora | 8.2K | 8.2K | Input: $0 Output: $0 | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-04-02 |
| cf/fblgit/una-cybertron-7b-v2-bf16 | una-cybertron-7b-v2-bf16 | 15K | 15K | Input: $0 Output: $0 | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2023-12-02 Updated: 2024-03-08 |
| cf/meta/m2m100-1.2b | m2m100-1.2b | - | - | Input: $0.34 Output: $0.34 | Model: 0.170 Completion: 1.000 | - | - | In: text Out: text | Open Weights Released: 2022-03-02 Updated: 2023-11-16 |
| cf/meta/llama-3.2-3b-instruct | llama-3.2-3b-instruct | 128K | 128K | Input: $0.051 Output: $0.34 | Model: 0.025 Completion: 6.667 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-09-18 Updated: 2024-10-24 |
| cf/qwen/qwen2.5-coder-32b-instruct | qwen2.5-coder-32b-instruct | 32.8K | 32.8K | Input: $0.66 Output: $1 | Model: 0.330 Completion: 1.515 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-11-06 Updated: 2025-01-12 |
| cf/runwayml/stable-diffusion-v1-5-img2img | stable-diffusion-v1-5-img2img | - | - | Input: $0 Output: $0 | - | - | - | In: text Out: image | Open Weights Released: 2024-02-27 |
| cf/google/gemma-7b-it-lora | gemma-7b-it-lora | 3.5K | 3.5K | Input: $0 Output: $0 | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-04-02 |
| cf/qwen/qwen1.5-14b-chat-awq | qwen1.5-14b-chat-awq | 7.5K | 7.5K | Input: $0 Output: $0 | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-02-03 Updated: 2024-04-30 |
| cf/qwen/qwen1.5-1.8b-chat | qwen1.5-1.8b-chat | 32K | 32K | Input: $0 Output: $0 | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-01-30 Updated: 2024-04-30 |
| cf/mistralai/mistral-small-3.1-24b-instruct | mistral-small-3.1-24b-instruct | 128K | 128K | Input: $0.35 Output: $0.56 | Model: 0.175 Completion: 1.600 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-03-11 Updated: 2025-07-28 |
| hf/google/gemma-7b-it | gemma-7b-it | 8.2K | 8.2K | Input: $0 Output: $0 | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-02-13 Updated: 2024-08-14 |
| hf/thebloke/llamaguard-7b-awq | llamaguard-7b-awq | 4.1K | 4.1K | Input: $0 Output: $0 | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2023-12-11 |
| hf/nousresearch/hermes-2-pro-mistral-7b | hermes-2-pro-mistral-7b | 24K | 24K | Input: $0 Output: $0 | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-03-11 Updated: 2024-09-08 |
| cf/tiiuae/falcon-7b-instruct | falcon-7b-instruct | 4.1K | 4.1K | Input: $0 Output: $0 | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2023-04-25 Updated: 2024-10-12 |
| cf/meta/llama-3.3-70b-instruct-fp8-fast | llama-3.3-70b-instruct-fp8-fast | 24K | 24K | Input: $0.29 Output: $2.25 | Model: 0.145 Completion: 7.759 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-12-06 |
| cf/meta/llama-3-8b-instruct-awq | llama-3-8b-instruct-awq | 8.2K | 8.2K | Input: $0.12 Output: $0.27 | Model: 0.060 Completion: 2.250 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-05-09 |
| cf/leonardo/phoenix-1.0 | phoenix-1.0 | - | - | Input: $0.0058 Output: $0.0058 | Model: 0.003 Completion: 1.000 | - | - | In: text Out: image | Released: 2025-08-25 |
| cf/microsoft/phi-2 | phi-2 | 2K | 2K | Input: $0 Output: $0 | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2023-12-13 Updated: 2024-04-29 |
| cf/lykon/dreamshaper-8-lcm | dreamshaper-8-lcm | - | - | Input: $0 Output: $0 | - | 📎 | - | In: text Out: image | Open Weights Released: 2023-12-06 Updated: 2023-12-07 |
| cf/thebloke/discolm-german-7b-v1-awq | discolm-german-7b-v1-awq | 4.1K | 4.1K | Input: $0 Output: $0 | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-01-18 Updated: 2024-01-24 |
| cf/meta/llama-2-7b-chat-int8 | llama-2-7b-chat-int8 | 8.2K | 8.2K | Input: $0.556 Output: $6.667 | Model: 0.278 Completion: 11.991 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2023-09-25 |
| cf/meta/llama-3.2-1b-instruct | llama-3.2-1b-instruct | 60K | 60K | Input: $0.027 Output: $0.2 | Model: 0.013 Completion: 7.407 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-09-18 Updated: 2024-10-24 |
| cf/openai/whisper-large-v3-turbo | whisper-large-v3-turbo | - | - | Input: $0.00051 Output: $0.00051 | Model: 0.000 Completion: 1.000 | - | - | In: audio Out: text | Open Weights Released: 2024-10-01 Updated: 2024-10-04 |
| cf/meta/llama-4-scout-17b-16e-instruct | llama-4-scout-17b-16e-instruct | 131K | 131K | Input: $0.27 Output: $0.85 | Model: 0.135 Completion: 3.148 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-04-02 Updated: 2025-05-23 |
| hf/nexusflow/starling-lm-7b-beta | starling-lm-7b-beta | 4.1K | 4.1K | Input: $0 Output: $0 | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-03-19 Updated: 2024-04-03 |
| hf/thebloke/deepseek-coder-6.7b-base-awq | deepseek-coder-6.7b-base-awq | 4.1K | 4.1K | Input: $0 Output: $0 | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2023-11-05 Updated: 2023-11-09 |
| cf/google/gemma-3-12b-it | gemma-3-12b-it | 80K | 80K | Input: $0.35 Output: $0.56 | Model: 0.175 Completion: 1.600 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-03-01 Updated: 2025-03-21 |
| cf/meta/llama-guard-3-8b | llama-guard-3-8b | - | - | Input: $0.48 Output: $0.03 | Model: 0.240 Completion: 0.063 | 🌡️ | - | In: text Out: text | Open Weights Released: 2024-07-22 Updated: 2024-10-11 |
| hf/thebloke/neural-chat-7b-v3-1-awq | neural-chat-7b-v3-1-awq | 4.1K | 4.1K | Input: $0 Output: $0 | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2023-11-15 Updated: 2023-11-17 |
| cf/openai/whisper-tiny-en | whisper-tiny-en | - | - | Input: $0 Output: $0 | - | - | - | In: audio Out: text | Open Weights Released: 2022-09-26 Updated: 2024-01-22 |
| cf/bytedance/stable-diffusion-xl-lightning | stable-diffusion-xl-lightning | - | - | Input: $0 Output: $0 | - | - | - | In: text Out: image | Open Weights Released: 2024-02-20 Updated: 2024-04-03 |
| cf/mistral/mistral-7b-instruct-v0.1 | mistral-7b-instruct-v0.1 | 2.8K | 2.8K | Input: $0.11 Output: $0.19 | Model: 0.055 Completion: 1.727 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2023-09-27 Updated: 2025-07-24 |
| cf/llava-hf/llava-1.5-7b-hf | llava-1.5-7b-hf | - | - | Input: $0 Output: $0 | - | 📎 🌡️ | - | In: image, text Out: text | Open Weights Released: 2023-12-05 Updated: 2025-06-06 |
| cf/openai/gpt-oss-20b | gpt-oss-20b | 128K | 128K | Input: $0.2 Output: $0.3 | Model: 0.100 Completion: 1.500 | - | - | In: text Out: text | Open Weights Released: 2025-08-04 Updated: 2025-08-14 |
| cf/deepseek-ai/deepseek-math-7b-instruct | deepseek-math-7b-instruct | 4.1K | 4.1K | Input: $0 Output: $0 | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-02-05 Updated: 2024-02-06 |
| cf/openai/gpt-oss-120b | gpt-oss-120b | 128K | 128K | Input: $0.35 Output: $0.75 | Model: 0.175 Completion: 2.143 | - | - | In: text Out: text | Open Weights Released: 2025-08-04 Updated: 2025-08-14 |
| cf/myshell-ai/melotts | melotts | - | - | Input: $0.0002 Output: $0 | Model: 0.000 | 📎 | - | In: text Out: audio | Open Weights Released: 2024-07-19 |
| cf/qwen/qwen1.5-7b-chat-awq | qwen1.5-7b-chat-awq | 20K | 20K | Input: $0 Output: $0 | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-02-03 Updated: 2024-04-30 |
| cf/meta/llama-3.1-8b-instruct-fast | llama-3.1-8b-instruct-fast | 128K | 128K | Input: $0.045 Output: $0.384 | Model: 0.022 Completion: 8.533 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-07-18 Updated: 2024-09-25 |
| cf/deepgram/nova-3 | nova-3 | - | - | Input: $0.0052 Output: $0.0052 | Model: 0.003 Completion: 1.000 | - | - | In: audio Out: text | Open Weights Released: 2025-06-05 Updated: 2025-07-08 |
| cf/meta/llama-3.1-70b-instruct | llama-3.1-70b-instruct | 24K | 24K | Input: $0.293 Output: $2.253 | Model: 0.146 Completion: 7.689 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-07-16 Updated: 2024-12-15 |
| cf/qwen/qwq-32b | qwq-32b | 24K | 24K | Input: $0.66 Output: $1 | Model: 0.330 Completion: 1.515 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-03-05 Updated: 2025-03-11 |
| hf/thebloke/zephyr-7b-beta-awq | zephyr-7b-beta-awq | 4.1K | 4.1K | Input: $0 Output: $0 | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2023-10-27 Updated: 2023-11-09 |
| hf/thebloke/deepseek-coder-6.7b-instruct-awq | deepseek-coder-6.7b-instruct-awq | 4.1K | 4.1K | Input: $0 Output: $0 | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2023-11-05 Updated: 2023-11-13 |
| cf/meta/llama-3.1-8b-instruct-awq | llama-3.1-8b-instruct-awq | 8.2K | 8.2K | Input: $0.12 Output: $0.27 | Model: 0.060 Completion: 2.250 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-07-25 |
| cf/mistral/mistral-7b-instruct-v0.2-lora | mistral-7b-instruct-v0.2-lora | 15K | 15K | Input: $0 Output: $0 | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-04-01 |
| cf/unum/uform-gen2-qwen-500m | uform-gen2-qwen-500m | - | - | Input: $0 Output: $0 | - | - | - | In: image, text Out: text | Open Weights Released: 2024-02-15 Updated: 2024-04-24 |
Cohere¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| Command A Translate | command-a-translate-08-2025 | 8K | 8K | Input: $2.5 Output: $10 | Model: 1.250 Completion: 4.000 | 🔧 🌡️ | 2024-06-01 | In: text Out: text | Open Weights Released: 2025-08-28 |
| Command A | command-a-03-2025 | 256K | 8K | Input: $2.5 Output: $10 | Model: 1.250 Completion: 4.000 | 🧠 🔧 🌡️ | 2024-06-01 | In: text Out: text | Open Weights Released: 2025-03-13 |
| Command R | command-r-08-2024 | 128K | 4K | Input: $0.15 Output: $0.6 | Model: 0.075 Completion: 4.000 | 🧠 🔧 🌡️ | 2024-06-01 | In: text Out: text | Open Weights Released: 2024-08-30 |
| Command R+ | command-r-plus-08-2024 | 128K | 4K | Input: $2.5 Output: $10 | Model: 1.250 Completion: 4.000 | 🧠 🔧 🌡️ | 2024-06-01 | In: text Out: text | Open Weights Released: 2024-08-30 |
| Command R7B | command-r7b-12-2024 | 128K | 4K | Input: $0.0375 Output: $0.15 | Model: 0.019 Completion: 4.000 | 🔧 🌡️ | 2024-06-01 | In: text Out: text | Open Weights Released: 2024-02-27 |
| Command A Reasoning | command-a-reasoning-08-2025 | 256K | 32K | Input: $2.5 Output: $10 | Model: 1.250 Completion: 4.000 | 🧠 🔧 🌡️ | 2024-06-01 | In: text Out: text | Open Weights Released: 2025-08-21 |
| Command A Vision | command-a-vision-07-2025 | 128K | 8K | Input: $2.5 Output: $10 | Model: 1.250 Completion: 4.000 | 🌡️ | 2024-06-01 | In: text, image Out: text | Open Weights Released: 2025-07-31 |
Cortecs¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| Nova Pro 1.0 | nova-pro-v1 | 300K | 5K | Input: $1.016 Output: $4.061 | Model: 0.508 Completion: 3.997 | 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2024-12-03 |
| Claude 4.5 Sonnet | claude-4-5-sonnet | 200K | 200K | Input: $3.259 Output: $16.296 | Model: 1.629 Completion: 5.000 | 📎 🧠 🔧 🌡️ | 2025-07-31 | In: text, image Out: text | Released: 2025-09-29 |
| DeepSeek V3 0324 | deepseek-v3-0324 | 128K | 128K | Input: $0.551 Output: $1.654 | Model: 0.276 Completion: 3.002 | 🔧 🌡️ | 2024-07 | In: text Out: text | Open Weights Released: 2025-03-24 |
| Kimi K2 Instruct | kimi-k2-instruct | 131K | 131K | Input: $0.551 Output: $2.646 | Model: 0.276 Completion: 4.802 | 🔧 🌡️ | 2024-07 | In: text Out: text | Open Weights Released: 2025-07-11 Updated: 2025-09-05 |
| GPT 4.1 | gpt-4.1 | 1M | 32.8K | Input: $2.354 Output: $9.417 | Model: 1.177 Completion: 4.000 | 🔧 🌡️ | 2024-06 | In: text, image Out: text | Released: 2025-04-14 |
| Gemini 2.5 Pro | gemini-2.5-pro | 1M | 65.5K | Input: $1.654 Output: $11.024 | Model: 0.827 Completion: 6.665 | 🔧 🌡️ | 2025-01 | In: text, image Out: text | Released: 2025-03-20 Updated: 2025-06-17 |
| GPT Oss 120b | gpt-oss-120b | 128K | 128K | Input: $0 Output: $0 | - | 🔧 🌡️ | 2024-01 | In: text Out: text | Open Weights Released: 2025-08-05 |
| Qwen3 Coder 480B A35B Instruct | qwen3-coder-480b-a35b-instruct | 262K | 262K | Input: $0.441 Output: $1.984 | Model: 0.221 Completion: 4.499 | 🔧 🌡️ | 2025-01 | In: text Out: text | Open Weights Released: 2025-07-25 |
| Claude Sonnet 4 | claude-sonnet-4 | 200K | 64K | Input: $3.307 Output: $16.536 | Model: 1.653 Completion: 5.000 | 🔧 🌡️ | 2025-03 | In: text, image Out: text | Released: 2025-05-22 |
| Llama 3.1 405B Instruct | llama-3.1-405b-instruct | 128K | 128K | Input: $0 Output: $0 | - | 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-07-23 |
| Qwen3 32B | qwen3-32b | 16.4K | 16.4K | Input: $0.099 Output: $0.33 | Model: 0.050 Completion: 3.333 | 🔧 🌡️ | 2024-12 | In: text Out: text | Open Weights Released: 2025-04-29 |
Deep Infra¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| Kimi K2 | moonshotai/Kimi-K2-Instruct | 131.1K | 32.8K | Input: $0.5 Output: $2 | Model: 0.250 Completion: 4.000 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-07-11 |
| GPT OSS 20B | openai/gpt-oss-20b | 131.1K | 16.4K | Input: $0.03 Output: $0.14 | Model: 0.015 Completion: 4.667 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-08-05 |
| GPT OSS 120B | openai/gpt-oss-120b | 131.1K | 16.4K | Input: $0.05 Output: $0.24 | Model: 0.025 Completion: 4.800 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-08-05 |
| Qwen3 Coder 480B A35B Instruct | Qwen/Qwen3-Coder-480B-A35B-Instruct | 262.1K | 66.5K | Input: $0.4 Output: $1.6 | Model: 0.200 Completion: 4.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-23 |
| Qwen3 Coder 480B A35B Instruct Turbo | Qwen/Qwen3-Coder-480B-A35B-Instruct-Turbo | 262.1K | 66.5K | Input: $0.3 Output: $1.2 | Model: 0.150 Completion: 4.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-23 |
| GLM-4.5 | zai-org/GLM-4.5 | 131.1K | 98.3K | Input: $0.6 Output: $2.2 | Model: 0.300 Completion: 3.667 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-28 |
DeepSeek¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| DeepSeek Chat | deepseek-chat | 128K | 8.2K | Input: $0.28 Output: $0.42 Cache Read: $0.028 | Model: 0.140 Completion: 1.500 Cache: 0.100 | 📎 🔧 🌡️ | 2024-07 | In: text Out: text | Released: 2024-12-26 Updated: 2025-09-29 |
| DeepSeek Reasoner | deepseek-reasoner | 128K | 128K | Input: $0.28 Output: $0.42 Cache Read: $0.028 | Model: 0.140 Completion: 1.500 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2024-07 | In: text Out: text | Released: 2025-01-20 Updated: 2025-09-29 |
doubao¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| doubao-seed-1-6-flash | doubao-seed-1-6-flash | 256K | 32K | - | - | 🔧 🌡️ | 2023-10 | In: text, image Out: text | Released: 2025-06-11 Updated: 2025-07-15 |
| doubao-seed-1-6-thinking | doubao-seed-1-6-thinking | 256K | 32K | - | - | 🧠 🔧 🌡️ | 2023-10 | In: text, image Out: text | Released: 2025-06-11 Updated: 2025-07-15 |
| doubao-seed-1-6 | doubao-seed-1-6 | 256K | 32K | - | - | 🧠 🔧 🌡️ | 2023-10 | In: text, image Out: text | Released: 2025-06-11 Updated: 2025-06-15 |
ExampleCorp AI¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| Novus 1 | novus-1 | 128K | 4.1K | Input: $5 Output: $15 Cache Read: $0.075 Cache Write: $0.5 | Model: 2.500 Completion: 3.000 Cache: 0.015 | 📎 🧠 🔧 🌡️ | 2024-07 | In: text, image, audio, video, pdf Out: text, image, audio, video, pdf | Released: 2025-01-20 Updated: 2025-08-21 |
FastRouter¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| Kimi K2 | moonshotai/kimi-k2 | 131.1K | 32.8K | Input: $0.55 Output: $2.2 | Model: 0.275 Completion: 4.000 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-07-11 |
| Grok 4 | x-ai/grok-4 | 256K | 64K | Input: $3 Output: $15 Cache Read: $0.75 Cache Write: $15 | Model: 1.500 Completion: 5.000 Cache: 0.250 | 🧠 🔧 🌡️ | 2025-07 | In: text Out: text | Released: 2025-07-09 |
| Gemini 2.5 Flash | google/gemini-2.5-flash | 1M | 65.5K | Input: $0.3 Output: $2.5 Cache Read: $0.0375 | Model: 0.150 Completion: 8.333 Cache: 0.125 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, pdf Out: text | Released: 2025-06-17 |
| Gemini 2.5 Pro | google/gemini-2.5-pro | 1M | 65.5K | Input: $1.25 Output: $10 Cache Read: $0.31 | Model: 0.625 Completion: 8.000 Cache: 0.248 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, pdf Out: text | Released: 2025-06-17 |
| GPT-5 Nano | openai/gpt-5-nano | 400K | 128K | Input: $0.05 Output: $0.4 Cache Read: $0.005 | Model: 0.025 Completion: 8.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2024-10-01 | In: text, image Out: text | Released: 2025-08-07 |
| GPT-4.1 | openai/gpt-4.1 | 1M | 32.8K | Input: $2 Output: $8 Cache Read: $0.5 | Model: 1.000 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-04-14 |
| GPT-5 Mini | openai/gpt-5-mini | 400K | 128K | Input: $0.25 Output: $2 Cache Read: $0.025 | Model: 0.125 Completion: 8.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2024-10-01 | In: text, image Out: text | Released: 2025-08-07 |
| GPT OSS 20B | openai/gpt-oss-20b | 131.1K | 65.5K | Input: $0.05 Output: $0.2 | Model: 0.025 Completion: 4.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-08-05 |
| GPT OSS 120B | openai/gpt-oss-120b | 131.1K | 32.8K | Input: $0.15 Output: $0.6 | Model: 0.075 Completion: 4.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-08-05 |
| GPT-5 | openai/gpt-5 | 400K | 128K | Input: $1.25 Output: $10 Cache Read: $0.125 | Model: 0.625 Completion: 8.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2024-10-01 | In: text, image Out: text | Released: 2025-08-07 |
| Qwen3 Coder | qwen/qwen3-coder | 262.1K | 66.5K | Input: $0.3 Output: $1.2 | Model: 0.150 Completion: 4.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-23 |
| Claude Opus 4.1 | anthropic/claude-opus-4.1 | 200K | 32K | Input: $15 Output: $75 Cache Read: $1.5 Cache Write: $18.75 | Model: 7.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-08-05 |
| Claude Sonnet 4 | anthropic/claude-sonnet-4 | 200K | 64K | Input: $3 Output: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-05-22 |
| DeepSeek R1 Distill Llama 70B | deepseek-ai/deepseek-r1-distill-llama-70b | 131.1K | 131.1K | Input: $0.03 Output: $0.14 | Model: 0.015 Completion: 4.667 | 🧠 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-01-23 |
Fireworks AI¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| Deepseek R1 05/28 | accounts/fireworks/models/deepseek-r1-0528 | 160K | 16.4K | Input: $3 Output: $8 | Model: 1.500 Completion: 2.667 | 🧠 🔧 🌡️ | 2025-05 | In: text Out: text | Open Weights Released: 2025-05-28 |
| DeepSeek V3.1 | accounts/fireworks/models/deepseek-v3p1 | 163.8K | 163.8K | Input: $0.56 Output: $1.68 | Model: 0.280 Completion: 3.000 | 🧠 🔧 🌡️ | 2025-07 | In: text Out: text | Open Weights Released: 2025-08-21 |
| MiniMax-M2 | accounts/fireworks/models/minimax-m2 | 128K | 16.4K | Input: $0.3 Output: $1.2 | Model: 0.150 Completion: 4.000 | 🧠 🔧 🌡️ | 2024-11 | In: text Out: text | Open Weights Released: 2025-10-27 |
| Deepseek V3 03-24 | accounts/fireworks/models/deepseek-v3-0324 | 160K | 16.4K | Input: $0.9 Output: $0.9 | Model: 0.450 Completion: 1.000 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-03-24 |
| Kimi K2 Instruct | accounts/fireworks/models/kimi-k2-instruct | 128K | 16.4K | Input: $1 Output: $3 | Model: 0.500 Completion: 3.000 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-07-11 |
| Qwen3 235B-A22B | accounts/fireworks/models/qwen3-235b-a22b | 128K | 16.4K | Input: $0.22 Output: $0.88 | Model: 0.110 Completion: 4.000 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04-29 |
| GPT OSS 20B | accounts/fireworks/models/gpt-oss-20b | 131.1K | 32.8K | Input: $0.05 Output: $0.2 | Model: 0.025 Completion: 4.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-08-05 |
| GPT OSS 120B | accounts/fireworks/models/gpt-oss-120b | 131.1K | 32.8K | Input: $0.15 Output: $0.6 | Model: 0.075 Completion: 4.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-08-05 |
| GLM 4.5 Air | accounts/fireworks/models/glm-4p5-air | 131.1K | 131.1K | Input: $0.22 Output: $0.88 | Model: 0.110 Completion: 4.000 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-08-01 |
| Qwen3 Coder 480B A35B Instruct | accounts/fireworks/models/qwen3-coder-480b-a35b-instruct | 256K | 32.8K | Input: $0.45 Output: $1.8 | Model: 0.225 Completion: 4.000 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-07-22 |
| GLM 4.5 | accounts/fireworks/models/glm-4p5 | 131.1K | 131.1K | Input: $0.55 Output: $2.19 | Model: 0.275 Completion: 3.982 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-29 |
GitHub Copilot¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| Gemini 2.0 Flash | gemini-2.0-flash-001 | 1M | 8.2K | Input: $0 Output: $0 | - | 📎 🔧 🌡️ | 2024-06 | In: text, image, audio, video Out: text | Released: 2024-12-11 |
| Claude Opus 4 | claude-opus-4 | 80K | 16K | Input: $0 Output: $0 | - | 📎 🧠 | 2025-03-31 | In: text, image Out: text | Released: 2025-05-22 |
| Grok Code Fast 1 | grok-code-fast-1 | 128K | 64K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2025-08 | In: text Out: text | Released: 2025-08-27 |
| GPT-5.1-Codex | gpt-5.1-codex | 128K | 128K | Input: $0 Output: $0 | - | 🧠 🔧 | 2024-09-30 | In: text, image Out: text | Released: 2025-11-13 |
| Claude Haiku 4.5 | claude-haiku-4.5 | 128K | 16K | Input: $0 Output: $0 | - | 📎 🧠 🔧 🌡️ | 2025-02-31 | In: text, image Out: text | Released: 2025-10-15 |
| Gemini 3 Pro Preview | gemini-3-pro-preview | 128K | 64K | Input: $0 Output: $0 | - | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video Out: text | Released: 2025-11-18 |
| Raptor Mini (Preview) | oswe-vscode-prime | 200K | 64K | Input: $0 Output: $0 | - | 📎 🧠 🔧 🌡️ | 2024-10 | In: text, image Out: text | Released: 2025-11-10 |
| Claude Sonnet 3.5 | claude-3.5-sonnet | 90K | 8.2K | Input: $0 Output: $0 | - | 📎 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2024-10-22 |
| GPT-5.1-Codex-mini | gpt-5.1-codex-mini | 128K | 100K | Input: $0 Output: $0 | - | 🧠 🔧 | 2024-09-30 | In: text, image Out: text | Released: 2025-11-13 |
| o3-mini | o3-mini | 128K | 65.5K | Input: $0 Output: $0 | - | 🧠 | 2024-10 | In: text Out: text | Released: 2024-12-20 Updated: 2025-01-29 |
| GPT-5.1 | gpt-5.1 | 128K | 128K | Input: $0 Output: $0 | - | 📎 🧠 🔧 | 2024-09-30 | In: text, image Out: text | Released: 2025-11-13 |
| GPT-5-Codex | gpt-5-codex | 128K | 128K | Input: $0 Output: $0 | - | 🧠 🔧 | 2024-09-30 | In: text, image Out: text | Released: 2025-09-15 |
| GPT-4o | gpt-4o | 64K | 16.4K | Input: $0 Output: $0 | - | 📎 🔧 🌡️ | 2023-09 | In: text, image Out: text | Released: 2024-05-13 |
| GPT-4.1 | gpt-4.1 | 128K | 16.4K | Input: $0 Output: $0 | - | 📎 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-04-14 |
| o4-mini (Preview) | o4-mini | 128K | 65.5K | Input: $0 Output: $0 | - | 🧠 | 2024-10 | In: text Out: text | Released: 2025-04-16 |
| Claude Opus 4.1 | claude-opus-41 | 80K | 16K | Input: $0 Output: $0 | - | 📎 🧠 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-08-05 |
| GPT-5-mini | gpt-5-mini | 128K | 64K | Input: $0 Output: $0 | - | 📎 🧠 🔧 🌡️ | 2024-06 | In: text, image Out: text | Released: 2025-08-13 |
| Claude Sonnet 3.7 | claude-3.7-sonnet | 200K | 16.4K | Input: $0 Output: $0 | - | 📎 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-02-19 |
| Gemini 2.5 Pro | gemini-2.5-pro | 128K | 64K | Input: $0 Output: $0 | - | 📎 🔧 🌡️ | 2025-01 | In: text, image, audio, video Out: text | Released: 2025-03-20 Updated: 2025-06-05 |
| o3 (Preview) | o3 | 128K | 16.4K | Input: $0 Output: $0 | - | 📎 🧠 🔧 | 2024-05 | In: text, image Out: text | Released: 2025-04-16 |
| Claude Sonnet 4 | claude-sonnet-4 | 128K | 16K | Input: $0 Output: $0 | - | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-05-22 |
| GPT-5 | gpt-5 | 128K | 128K | Input: $0 Output: $0 | - | 📎 🧠 🔧 🌡️ | 2024-10 | In: text, image Out: text | Released: 2025-08-07 |
| Claude Sonnet 3.7 Thinking | claude-3.7-sonnet-thought | 200K | 16.4K | Input: $0 Output: $0 | - | 📎 🧠 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-02-19 |
| Claude Sonnet 4.5 | claude-sonnet-4.5 | 128K | 16K | Input: $0 Output: $0 | - | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-09-29 |
GitHub Models¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| JAIS 30b Chat | core42/jais-30b-chat | 8.2K | 2K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2023-03 | In: text Out: text | Open Weights Released: 2023-08-30 |
| Grok 3 | xai/grok-3 | 128K | 8.2K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2024-10 | In: text Out: text | Released: 2024-12-09 |
| Grok 3 Mini | xai/grok-3-mini | 128K | 8.2K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2024-10 | In: text Out: text | Released: 2024-12-09 |
| Cohere Command R 08-2024 | cohere/cohere-command-r-08-2024 | 128K | 4.1K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2024-03 | In: text Out: text | Released: 2024-08-01 |
| Cohere Command A | cohere/cohere-command-a | 128K | 4.1K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2024-03 | In: text Out: text | Released: 2024-11-01 |
| Cohere Command R+ 08-2024 | cohere/cohere-command-r-plus-08-2024 | 128K | 4.1K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2024-03 | In: text Out: text | Released: 2024-08-01 |
| Cohere Command R | cohere/cohere-command-r | 128K | 4.1K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2024-03 | In: text Out: text | Released: 2024-03-11 Updated: 2024-08-01 |
| Cohere Command R+ | cohere/cohere-command-r-plus | 128K | 4.1K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2024-03 | In: text Out: text | Released: 2024-04-04 Updated: 2024-08-01 |
| DeepSeek-R1-0528 | deepseek/deepseek-r1-0528 | 65.5K | 8.2K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2024-06 | In: text Out: text | Open Weights Released: 2025-05-28 |
| DeepSeek-R1 | deepseek/deepseek-r1 | 65.5K | 8.2K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2024-06 | In: text Out: text | Open Weights Released: 2025-01-20 |
| DeepSeek-V3-0324 | deepseek/deepseek-v3-0324 | 128K | 8.2K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2024-06 | In: text Out: text | Open Weights Released: 2025-03-24 |
| Mistral Medium 3 (25.05) | mistral-ai/mistral-medium-2505 | 128K | 32.8K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2024-09 | In: text, image Out: text | Released: 2025-05-01 |
| Ministral 3B | mistral-ai/ministral-3b | 128K | 8.2K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2024-03 | In: text Out: text | Open Weights Released: 2024-10-22 |
| Mistral Nemo | mistral-ai/mistral-nemo | 128K | 8.2K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2024-03 | In: text Out: text | Open Weights Released: 2024-07-18 |
| Mistral Large 24.11 | mistral-ai/mistral-large-2411 | 128K | 32.8K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2024-09 | In: text Out: text | Released: 2024-11-01 |
| Codestral 25.01 | mistral-ai/codestral-2501 | 32K | 8.2K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2024-03 | In: text Out: text | Released: 2025-01-01 |
| Mistral Small 3.1 | mistral-ai/mistral-small-2503 | 128K | 32.8K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2024-09 | In: text, image Out: text | Released: 2025-03-01 |
| Phi-3-medium instruct (128k) | microsoft/phi-3-medium-128k-instruct | 128K | 4.1K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2023-10 | In: text Out: text | Open Weights Released: 2024-04-23 |
| Phi-3-mini instruct (4k) | microsoft/phi-3-mini-4k-instruct | 4.1K | 1K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2023-10 | In: text Out: text | Open Weights Released: 2024-04-23 |
| Phi-3-small instruct (128k) | microsoft/phi-3-small-128k-instruct | 128K | 4.1K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2023-10 | In: text Out: text | Open Weights Released: 2024-04-23 |
| Phi-3.5-vision instruct (128k) | microsoft/phi-3.5-vision-instruct | 128K | 4.1K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2023-10 | In: text, image Out: text | Open Weights Released: 2024-08-20 |
| Phi-4 | microsoft/phi-4 | 16K | 4.1K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2023-10 | In: text Out: text | Open Weights Released: 2024-12-11 |
| Phi-4-mini-reasoning | microsoft/phi-4-mini-reasoning | 128K | 4.1K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2023-10 | In: text Out: text | Open Weights Released: 2024-12-11 |
| Phi-3-small instruct (8k) | microsoft/phi-3-small-8k-instruct | 8.2K | 2K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2023-10 | In: text Out: text | Open Weights Released: 2024-04-23 |
| Phi-3.5-mini instruct (128k) | microsoft/phi-3.5-mini-instruct | 128K | 4.1K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2023-10 | In: text Out: text | Open Weights Released: 2024-08-20 |
| Phi-4-multimodal-instruct | microsoft/phi-4-multimodal-instruct | 128K | 4.1K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2023-10 | In: text, image, audio Out: text | Open Weights Released: 2024-12-11 |
| Phi-3-mini instruct (128k) | microsoft/phi-3-mini-128k-instruct | 128K | 4.1K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2023-10 | In: text Out: text | Open Weights Released: 2024-04-23 |
| Phi-3.5-MoE instruct (128k) | microsoft/phi-3.5-moe-instruct | 128K | 4.1K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2023-10 | In: text Out: text | Open Weights Released: 2024-08-20 |
| Phi-4-mini-instruct | microsoft/phi-4-mini-instruct | 128K | 4.1K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2023-10 | In: text Out: text | Open Weights Released: 2024-12-11 |
| Phi-3-medium instruct (4k) | microsoft/phi-3-medium-4k-instruct | 4.1K | 1K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2023-10 | In: text Out: text | Open Weights Released: 2024-04-23 |
| Phi-4-Reasoning | microsoft/phi-4-reasoning | 128K | 4.1K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2023-10 | In: text Out: text | Open Weights Released: 2024-12-11 |
| MAI-DS-R1 | microsoft/mai-ds-r1 | 65.5K | 8.2K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2024-06 | In: text Out: text | Released: 2025-01-20 |
| GPT-4.1-nano | openai/gpt-4.1-nano | 128K | 16.4K | Input: $0 Output: $0 | - | 📎 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-04-14 |
| GPT-4.1-mini | openai/gpt-4.1-mini | 128K | 16.4K | Input: $0 Output: $0 | - | 📎 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-04-14 |
| OpenAI o1-preview | openai/o1-preview | 128K | 32.8K | Input: $0 Output: $0 | - | 🧠 | 2023-10 | In: text Out: text | Released: 2024-09-12 |
| OpenAI o3-mini | openai/o3-mini | 200K | 100K | Input: $0 Output: $0 | - | 🧠 | 2024-04 | In: text Out: text | Released: 2025-01-31 |
| GPT-4o | openai/gpt-4o | 128K | 16.4K | Input: $0 Output: $0 | - | 📎 🔧 🌡️ | 2023-10 | In: text, image, audio Out: text | Released: 2024-05-13 |
| GPT-4.1 | openai/gpt-4.1 | 128K | 16.4K | Input: $0 Output: $0 | - | 📎 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-04-14 |
| OpenAI o4-mini | openai/o4-mini | 200K | 100K | Input: $0 Output: $0 | - | 🧠 | 2024-04 | In: text, image Out: text | Released: 2025-01-31 |
| OpenAI o1 | openai/o1 | 200K | 100K | Input: $0 Output: $0 | - | 🧠 | 2023-10 | In: text, image Out: text | Released: 2024-09-12 Updated: 2024-12-17 |
| OpenAI o1-mini | openai/o1-mini | 128K | 65.5K | Input: $0 Output: $0 | - | 🧠 | 2023-10 | In: text Out: text | Released: 2024-09-12 Updated: 2024-12-17 |
| OpenAI o3 | openai/o3 | 200K | 100K | Input: $0 Output: $0 | - | 🧠 | 2024-04 | In: text, image Out: text | Released: 2025-01-31 |
| GPT-4o mini | openai/gpt-4o-mini | 128K | 16.4K | Input: $0 Output: $0 | - | 📎 🔧 🌡️ | 2023-10 | In: text, image, audio Out: text | Released: 2024-07-18 |
| Llama-3.2-11B-Vision-Instruct | meta/llama-3.2-11b-vision-instruct | 128K | 8.2K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2023-12 | In: text, image, audio Out: text | Open Weights Released: 2024-09-25 |
| Meta-Llama-3.1-405B-Instruct | meta/meta-llama-3.1-405b-instruct | 128K | 32.8K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-07-23 |
| Llama 4 Maverick 17B 128E Instruct FP8 | meta/llama-4-maverick-17b-128e-instruct-fp8 | 128K | 8.2K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2024-12 | In: text, image Out: text | Open Weights Released: 2025-01-31 |
| Meta-Llama-3-70B-Instruct | meta/meta-llama-3-70b-instruct | 8.2K | 2K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-04-18 |
| Meta-Llama-3.1-70B-Instruct | meta/meta-llama-3.1-70b-instruct | 128K | 32.8K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-07-23 |
| Llama-3.3-70B-Instruct | meta/llama-3.3-70b-instruct | 128K | 32.8K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-12-06 |
| Llama-3.2-90B-Vision-Instruct | meta/llama-3.2-90b-vision-instruct | 128K | 8.2K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2023-12 | In: text, image, audio Out: text | Open Weights Released: 2024-09-25 |
| Meta-Llama-3-8B-Instruct | meta/meta-llama-3-8b-instruct | 8.2K | 2K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-04-18 |
| Llama 4 Scout 17B 16E Instruct | meta/llama-4-scout-17b-16e-instruct | 128K | 8.2K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2024-12 | In: text, image Out: text | Open Weights Released: 2025-01-31 |
| Meta-Llama-3.1-8B-Instruct | meta/meta-llama-3.1-8b-instruct | 128K | 32.8K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-07-23 |
| AI21 Jamba 1.5 Large | ai21-labs/ai21-jamba-1.5-large | 256K | 4.1K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2024-03 | In: text Out: text | Released: 2024-08-29 |
| AI21 Jamba 1.5 Mini | ai21-labs/ai21-jamba-1.5-mini | 256K | 4.1K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2024-03 | In: text Out: text | Released: 2024-08-29 |
Google¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| gemini-embedding-001 | gemini-embedding-001 | 2K | 3.1K | Input: $0.15 Output: $0 Cache Read: $0 Cache Write: $0 | Model: 0.075 | 🔧 | 2025-06 | In: text Out: text | Released: 2025-06-01 |
| Gemini 2.5 Flash Image | gemini-2.5-flash-image | 32.8K | 32.8K | Input: $0.3 Output: $30 Cache Read: $0.075 | Model: 0.150 Completion: 100.000 Cache: 0.250 | 📎 🧠 🌡️ | 2025-06 | In: text, image Out: text, image | Released: 2025-08-26 |
| Gemini 2.5 Flash Preview 05-20 | gemini-2.5-flash-preview-05-20 | 1M | 65.5K | Input: $0.15 Output: $0.6 Cache Read: $0.0375 | Model: 0.075 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-05-20 |
| Gemini Flash-Lite Latest | gemini-flash-lite-latest | 1M | 65.5K | Input: $0.1 Output: $0.4 Cache Read: $0.025 | Model: 0.050 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-09-25 |
| Gemini 3 Pro Preview | gemini-3-pro-preview | 1M | 64K | Input: $2 Output: $12 Cache Read: $0.2 | Model: 1.000 Completion: 6.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, video, audio, pdf Out: text | Released: 2025-11-18 |
| Gemini 2.5 Flash | gemini-2.5-flash | 1M | 65.5K | Input: $0.3 Output: $2.5 Cache Read: $0.075 Input Audio: $1 | Model: 0.500 Completion: 2.500 Cache: 0.075 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-03-20 Updated: 2025-06-05 |
| Gemini Flash Latest | gemini-flash-latest | 1M | 65.5K | Input: $0.3 Output: $2.5 Cache Read: $0.075 Input Audio: $1 | Model: 0.500 Completion: 2.500 Cache: 0.075 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-09-25 |
| Gemini 2.5 Pro Preview 05-06 | gemini-2.5-pro-preview-05-06 | 1M | 65.5K | Input: $1.25 Output: $10 Cache Read: $0.31 | Model: 0.625 Completion: 8.000 Cache: 0.248 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-05-06 |
| Gemini 2.5 Flash Preview TTS | gemini-2.5-flash-preview-tts | 8K | 16K | Input: $0.5 Output: $10 | Model: 0.250 Completion: 20.000 | - | 2025-01 | In: text Out: audio | Released: 2025-05-01 |
| Gemini 2.0 Flash Lite | gemini-2.0-flash-lite | 1M | 8.2K | Input: $0.075 Output: $0.3 | Model: 0.037 Completion: 4.000 | 📎 🔧 🌡️ | 2024-06 | In: text, image, audio, video, pdf Out: text | Released: 2024-12-11 |
| Gemini Live 2.5 Flash Preview Native Audio | gemini-live-2.5-flash-preview-native-audio | 131.1K | 65.5K | Input: $0.5 Output: $2 Input Audio: $3 Output Audio: $12 | Model: 1.500 Completion: 4.000 | 🧠 🔧 | 2025-01 | In: text, audio, video Out: text, audio | Released: 2025-06-17 Updated: 2025-09-18 |
| Gemini 2.0 Flash | gemini-2.0-flash | 1M | 8.2K | Input: $0.1 Output: $0.4 Cache Read: $0.025 | Model: 0.050 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-06 | In: text, image, audio, video, pdf Out: text | Released: 2024-12-11 |
| Gemini 2.5 Flash-Lite | gemini-2.5-flash-lite | 1M | 65.5K | Input: $0.1 Output: $0.4 Cache Read: $0.025 | Model: 0.050 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-06-17 |
| Gemini 2.5 Pro Preview 06-05 | gemini-2.5-pro-preview-06-05 | 1M | 65.5K | Input: $1.25 Output: $10 Cache Read: $0.31 | Model: 0.625 Completion: 8.000 Cache: 0.248 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-06-05 |
| Gemini Live 2.5 Flash | gemini-live-2.5-flash | 128K | 8K | Input: $0.5 Output: $2 Input Audio: $3 Output Audio: $12 | Model: 1.500 Completion: 4.000 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video Out: text, audio | Released: 2025-09-01 |
| Gemini 2.5 Flash Lite Preview 06-17 | gemini-2.5-flash-lite-preview-06-17 | 1M | 65.5K | Input: $0.1 Output: $0.4 Cache Read: $0.025 Input Audio: $0.3 | Model: 0.150 Completion: 1.333 Cache: 0.083 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-06-17 |
| Gemini 2.5 Flash Image (Preview) | gemini-2.5-flash-image-preview | 32.8K | 32.8K | Input: $0.3 Output: $30 Cache Read: $0.075 | Model: 0.150 Completion: 100.000 Cache: 0.250 | 📎 🧠 🌡️ | 2025-06 | In: text, image Out: text, image | Released: 2025-08-26 |
| Gemini 2.5 Flash Preview 09-25 | gemini-2.5-flash-preview-09-2025 | 1M | 65.5K | Input: $0.3 Output: $2.5 Cache Read: $0.075 Input Audio: $1 | Model: 0.500 Completion: 2.500 Cache: 0.075 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-09-25 |
| Gemini 2.5 Flash Preview 04-17 | gemini-2.5-flash-preview-04-17 | 1M | 65.5K | Input: $0.15 Output: $0.6 Cache Read: $0.0375 | Model: 0.075 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-04-17 |
| Gemini 2.5 Pro Preview TTS | gemini-2.5-pro-preview-tts | 8K | 16K | Input: $1 Output: $20 | Model: 0.500 Completion: 20.000 | - | 2025-01 | In: text Out: audio | Released: 2025-05-01 |
| Gemini 2.5 Pro | gemini-2.5-pro | 1M | 65.5K | Input: $1.25 Output: $10 Cache Read: $0.31 | Model: 0.625 Completion: 8.000 Cache: 0.248 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-03-20 Updated: 2025-06-05 |
| Gemini 1.5 Flash | gemini-1.5-flash | 1M | 8.2K | Input: $0.075 Output: $0.3 Cache Read: $0.01875 | Model: 0.037 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-04 | In: text, image, audio, video Out: text | Released: 2024-05-14 |
| Gemini 1.5 Flash-8B | gemini-1.5-flash-8b | 1M | 8.2K | Input: $0.0375 Output: $0.15 Cache Read: $0.01 | Model: 0.019 Completion: 4.000 Cache: 0.267 | 📎 🔧 🌡️ | 2024-04 | In: text, image, audio, video Out: text | Released: 2024-10-03 |
| Gemini 2.5 Flash Lite Preview 09-25 | gemini-2.5-flash-lite-preview-09-2025 | 1M | 65.5K | Input: $0.1 Output: $0.4 Cache Read: $0.025 | Model: 0.050 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-09-25 |
| Gemini 1.5 Pro | gemini-1.5-pro | 1M | 8.2K | Input: $1.25 Output: $5 Cache Read: $0.3125 | Model: 0.625 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-04 | In: text, image, audio, video Out: text | Released: 2024-02-15 |
Vertex¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| Gemini Embedding 001 | gemini-embedding-001 | 2K | 3.1K | Input: $0.15 Output: $0 | Model: 0.075 | - | 2025-05 | In: text Out: text | Released: 2025-05-20 |
| Gemini 2.5 Flash Preview 05-20 | gemini-2.5-flash-preview-05-20 | 1M | 65.5K | Input: $0.15 Output: $0.6 Cache Read: $0.0375 | Model: 0.075 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-05-20 |
| Gemini Flash-Lite Latest | gemini-flash-lite-latest | 1M | 65.5K | Input: $0.1 Output: $0.4 Cache Read: $0.025 | Model: 0.050 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-09-25 |
| Gemini 3 Pro Preview | gemini-3-pro-preview | 1M | 65.5K | Input: $2 Output: $12 Cache Read: $0.2 | Model: 1.000 Completion: 6.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, video, audio, pdf Out: text | Released: 2025-11-18 |
| Gemini 2.5 Flash | gemini-2.5-flash | 1M | 65.5K | Input: $0.3 Output: $2.5 Cache Read: $0.075 Cache Write: $0.383 | Model: 0.150 Completion: 8.333 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-06-17 |
| Gemini Flash Latest | gemini-flash-latest | 1M | 65.5K | Input: $0.3 Output: $2.5 Cache Read: $0.075 Cache Write: $0.383 | Model: 0.150 Completion: 8.333 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-09-25 |
| Gemini 2.5 Pro Preview 05-06 | gemini-2.5-pro-preview-05-06 | 1M | 65.5K | Input: $1.25 Output: $10 Cache Read: $0.31 | Model: 0.625 Completion: 8.000 Cache: 0.248 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-05-06 |
| Gemini 2.0 Flash Lite | gemini-2.0-flash-lite | 1M | 8.2K | Input: $0.075 Output: $0.3 | Model: 0.037 Completion: 4.000 | 📎 🔧 🌡️ | 2024-06 | In: text, image, audio, video, pdf Out: text | Released: 2024-12-11 |
| Gemini 2.0 Flash | gemini-2.0-flash | 1M | 8.2K | Input: $0.1 Output: $0.4 Cache Read: $0.025 | Model: 0.050 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-06 | In: text, image, audio, video, pdf Out: text | Released: 2024-12-11 |
| Gemini 2.5 Flash Lite | gemini-2.5-flash-lite | 1M | 65.5K | Input: $0.1 Output: $0.4 Cache Read: $0.025 | Model: 0.050 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-06-17 |
| Gemini 2.5 Pro Preview 06-05 | gemini-2.5-pro-preview-06-05 | 1M | 65.5K | Input: $1.25 Output: $10 Cache Read: $0.31 | Model: 0.625 Completion: 8.000 Cache: 0.248 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-06-05 |
| Gemini 2.5 Flash Lite Preview 06-17 | gemini-2.5-flash-lite-preview-06-17 | 65.5K | 65.5K | Input: $0.1 Output: $0.4 Cache Read: $0.025 | Model: 0.050 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-06-17 |
| Gemini 2.5 Flash Preview 09-25 | gemini-2.5-flash-preview-09-2025 | 1M | 65.5K | Input: $0.3 Output: $2.5 Cache Read: $0.075 Cache Write: $0.383 | Model: 0.150 Completion: 8.333 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-09-25 |
| Gemini 2.5 Flash Preview 04-17 | gemini-2.5-flash-preview-04-17 | 1M | 65.5K | Input: $0.15 Output: $0.6 Cache Read: $0.0375 | Model: 0.075 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-04-17 |
| Gemini 2.5 Pro | gemini-2.5-pro | 1M | 65.5K | Input: $1.25 Output: $10 Cache Read: $0.31 | Model: 0.625 Completion: 8.000 Cache: 0.248 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-03-20 Updated: 2025-06-05 |
| Gemini 2.5 Flash Lite Preview 09-25 | gemini-2.5-flash-lite-preview-09-2025 | 1M | 65.5K | Input: $0.1 Output: $0.4 Cache Read: $0.025 | Model: 0.050 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-09-25 |
Vertex (Anthropic)¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| Claude Sonnet 3.5 v2 | claude-3-5-sonnet@20241022 | 200K | 8.2K | Input: $3 Output: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🔧 🌡️ | 2024-04-30 | In: text, image Out: text | Released: 2024-10-22 |
| Claude Haiku 3.5 | claude-3-5-haiku@20241022 | 200K | 8.2K | Input: $0.8 Output: $4 Cache Read: $0.08 Cache Write: $1 | Model: 0.400 Completion: 5.000 Cache: 0.100 | 📎 🔧 🌡️ | 2024-07-31 | In: text, image Out: text | Released: 2024-10-22 |
| Claude Sonnet 4 | claude-sonnet-4@20250514 | 200K | 64K | Input: $3 Output: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-05-22 |
| Claude Sonnet 4.5 | claude-sonnet-4-5@20250929 | 200K | 64K | Input: $3 Output: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-07-31 | In: text, image Out: text | Released: 2025-09-29 |
| Claude Opus 4.1 | claude-opus-4-1@20250805 | 200K | 32K | Input: $15 Output: $75 Cache Read: $1.5 Cache Write: $18.75 | Model: 7.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-08-05 |
| Claude Haiku 4.5 | claude-haiku-4-5@20251001 | 200K | 64K | Input: $1 Output: $5 Cache Read: $0.1 Cache Write: $1.25 | Model: 0.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-02-31 | In: text, image Out: text | Released: 2025-10-15 |
| Claude Sonnet 3.7 | claude-3-7-sonnet@20250219 | 200K | 64K | Input: $3 Output: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2024-10-31 | In: text, image Out: text | Released: 2025-02-19 |
| Claude Opus 4 | claude-opus-4@20250514 | 200K | 32K | Input: $15 Output: $75 Cache Read: $1.5 Cache Write: $18.75 | Model: 7.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-05-22 |
Groq¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| Llama 3.1 8B Instant | llama-3.1-8b-instant | 131.1K | 8.2K | Input: $0.05 Output: $0.08 | Model: 0.025 Completion: 1.600 | 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-07-23 |
| Mistral Saba 24B | mistral-saba-24b | 32.8K | 32.8K | Input: $0.79 Output: $0.79 | Model: 0.395 Completion: 1.000 | 🔧 🌡️ | 2024-08 | In: text Out: text | Released: 2025-02-06 |
| Llama 3 8B | llama3-8b-8192 | 8.2K | 8.2K | Input: $0.05 Output: $0.08 | Model: 0.025 Completion: 1.600 | 🔧 🌡️ | 2023-03 | In: text Out: text | Open Weights Released: 2024-04-18 |
| Qwen QwQ 32B | qwen-qwq-32b | 131.1K | 16.4K | Input: $0.29 Output: $0.39 | Model: 0.145 Completion: 1.345 | 🧠 🔧 🌡️ | 2024-09 | In: text Out: text | Open Weights Released: 2024-11-27 |
| Llama 3 70B | llama3-70b-8192 | 8.2K | 8.2K | Input: $0.59 Output: $0.79 | Model: 0.295 Completion: 1.339 | 🔧 🌡️ | 2023-03 | In: text Out: text | Open Weights Released: 2024-04-18 |
| DeepSeek R1 Distill Llama 70B | deepseek-r1-distill-llama-70b | 131.1K | 8.2K | Input: $0.75 Output: $0.99 | Model: 0.375 Completion: 1.320 | 🧠 🔧 🌡️ | 2024-07 | In: text Out: text | Open Weights Released: 2025-01-20 |
| Llama Guard 3 8B | llama-guard-3-8b | 8.2K | 8.2K | Input: $0.2 Output: $0.2 | Model: 0.100 Completion: 1.000 | 🌡️ | - | In: text Out: text | Open Weights Released: 2024-07-23 |
| Gemma 2 9B | gemma2-9b-it | 8.2K | 8.2K | Input: $0.2 Output: $0.2 | Model: 0.100 Completion: 1.000 | 🔧 🌡️ | 2024-06 | In: text Out: text | Open Weights Released: 2024-06-27 |
| Llama 3.3 70B Versatile | llama-3.3-70b-versatile | 131.1K | 32.8K | Input: $0.59 Output: $0.79 | Model: 0.295 Completion: 1.339 | 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-12-06 |
| Kimi K2 Instruct 0905 | moonshotai/kimi-k2-instruct-0905 | 262.1K | 16.4K | Input: $1 Output: $3 | Model: 0.500 Completion: 3.000 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-09-05 |
| Kimi K2 Instruct | moonshotai/kimi-k2-instruct | 131.1K | 16.4K | Input: $1 Output: $3 | Model: 0.500 Completion: 3.000 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-07-14 |
| GPT OSS 20B | openai/gpt-oss-20b | 131.1K | 32.8K | Input: $0.1 Output: $0.5 | Model: 0.050 Completion: 5.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-08-05 |
| GPT OSS 120B | openai/gpt-oss-120b | 131.1K | 32.8K | Input: $0.15 Output: $0.75 | Model: 0.075 Completion: 5.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-08-05 |
| Qwen3 32B | qwen/qwen3-32b | 131.1K | 16.4K | Input: $0.29 Output: $0.59 | Model: 0.145 Completion: 2.034 | 🧠 🔧 🌡️ | 2024-11-08 | In: text Out: text | Open Weights Released: 2024-12-23 |
| Llama 4 Scout 17B | meta-llama/llama-4-scout-17b-16e-instruct | 131.1K | 8.2K | Input: $0.11 Output: $0.34 | Model: 0.055 Completion: 3.091 | 🔧 🌡️ | 2024-08 | In: text, image Out: text | Open Weights Released: 2025-04-05 |
| Llama 4 Maverick 17B | meta-llama/llama-4-maverick-17b-128e-instruct | 131.1K | 8.2K | Input: $0.2 Output: $0.6 | Model: 0.100 Completion: 3.000 | 🔧 🌡️ | 2024-08 | In: text, image Out: text | Open Weights Released: 2025-04-05 |
| Llama Guard 4 12B | meta-llama/llama-guard-4-12b | 131.1K | 128 | Input: $0.2 Output: $0.2 | Model: 0.100 Completion: 1.000 | 🌡️ | - | In: text, image Out: text | Open Weights Released: 2025-04-05 |
Hugging Face¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| Kimi-K2-Instruct | moonshotai/Kimi-K2-Instruct | 131.1K | 16.4K | Input: $1 Output: $3 | Model: 0.500 Completion: 3.000 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-07-14 |
| Kimi-K2-Instruct-0905 | moonshotai/Kimi-K2-Instruct-0905 | 262.1K | 16.4K | Input: $1 Output: $3 | Model: 0.500 Completion: 3.000 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-09-04 |
| MiniMax-M2 | MiniMaxAI/MiniMax-M2 | 204.8K | 204.8K | Input: $0.3 Output: $1.2 | Model: 0.150 Completion: 4.000 | 🧠 🔧 🌡️ | 2025-10 | In: text Out: text | Open Weights Released: 2025-10-27 |
| Qwen 3 Embedding 4B | Qwen/Qwen3-Embedding-8B | 32K | 4.1K | Input: $0.01 Output: $0 | Model: 0.005 | - | 2024-12 | In: text Out: text | Open Weights Released: 2025-01-01 |
| Qwen 3 Embedding 4B | Qwen/Qwen3-Embedding-4B | 32K | 2K | Input: $0.01 Output: $0 | Model: 0.005 | - | 2024-12 | In: text Out: text | Open Weights Released: 2025-01-01 |
| Qwen3-Coder-480B-A35B-Instruct | Qwen/Qwen3-Coder-480B-A35B-Instruct | 262.1K | 66.5K | Input: $2 Output: $2 | Model: 1.000 Completion: 1.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-23 |
| Qwen3-235B-A22B-Thinking-2507 | Qwen/Qwen3-235B-A22B-Thinking-2507 | 262.1K | 131.1K | Input: $0.3 Output: $3 | Model: 0.150 Completion: 10.000 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-25 |
| Qwen3-Next-80B-A3B-Instruct | Qwen/Qwen3-Next-80B-A3B-Instruct | 262.1K | 66.5K | Input: $0.25 Output: $1 | Model: 0.125 Completion: 4.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-09-11 |
| Qwen3-Next-80B-A3B-Thinking | Qwen/Qwen3-Next-80B-A3B-Thinking | 262.1K | 131.1K | Input: $0.3 Output: $2 | Model: 0.150 Completion: 6.667 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-09-11 |
| GLM-4.5 | zai-org/GLM-4.5 | 131.1K | 98.3K | Input: $0.6 Output: $2.2 | Model: 0.300 Completion: 3.667 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-28 |
| GLM-4.6 | zai-org/GLM-4.6 | 200K | 128K | Input: $0.6 Output: $2.2 Cache Read: $0.11 | Model: 0.300 Completion: 3.667 Cache: 0.183 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-09-30 |
| GLM-4.5-Air | zai-org/GLM-4.5-Air | 128K | 96K | Input: $0.2 Output: $1.1 | Model: 0.100 Completion: 5.500 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-28 |
| DeepSeek-V3-0324 | deepseek-ai/Deepseek-V3-0324 | 16.4K | 8.2K | Input: $1.25 Output: $1.25 | Model: 0.625 Completion: 1.000 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-03-24 |
| DeepSeek-R1-0528 | deepseek-ai/DeepSeek-R1-0528 | 163.8K | 163.8K | Input: $3 Output: $5 | Model: 1.500 Completion: 1.667 | 🧠 🔧 🌡️ | 2025-05 | In: text Out: text | Open Weights Released: 2025-05-28 |
iFlow¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| Qwen3-Coder-480B-A35B | qwen3-coder | 256K | 64K | Input: $0 Output: $0 | - | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-01 |
| DeepSeek-V3-671B | deepseek-v3 | 128K | 32K | Input: $0 Output: $0 | - | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2024-12-26 |
| Kimi-K2 | kimi-k2 | 128K | 64K | Input: $0 Output: $0 | - | 🔧 🌡️ | 2024-10 | In: text Out: text | Released: 2024-12-01 |
| DeepSeek-R1 | deepseek-r1 | 128K | 32K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2024-12 | In: text Out: text | Open Weights Released: 2025-01-20 |
| DeepSeek-V3.1-Terminus | deepseek-v3.1 | 128K | 64K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2024-12 | In: text Out: text | Open Weights Released: 2025-01-01 |
| MiniMax M2 | minimax-m2 | 204.8K | 131.1K | Input: $0 Output: $0 Cache Read: $0 Cache Write: $0 | - | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-11-13 |
| Qwen3-235B-A22B | qwen3-235b | 128K | 32K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2024-12-01 |
| Kimi-K2-Instruct-0905 | kimi-k2-0905 | 256K | 64K | Input: $0 Output: $0 | - | 🔧 🌡️ | 2024-12 | In: text Out: text | Released: 2025-09-05 |
| Qwen3-235B-A22B-Thinking | qwen3-235b-a22b-thinking-2507 | 256K | 64K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-01 |
| Qwen3-VL-Plus | qwen3-vl-plus | 256K | 32K | Input: $0 Output: $0 | - | 📎 🔧 🌡️ | 2024-12 | In: text, image Out: text | Released: 2025-01-01 |
| GLM-4.6 | glm-4.6 | 200K | 128K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2024-10 | In: text Out: text | Released: 2024-12-01 Updated: 2025-11-13 |
| TStars-2.0 | tstars2.0 | 128K | 64K | Input: $0 Output: $0 | - | 🔧 🌡️ | 2024-01 | In: text Out: text | Released: 2024-01-01 Updated: 2025-01-01 |
| Qwen3-235B-A22B-Instruct | qwen3-235b-a22b-instruct | 256K | 64K | Input: $0 Output: $0 | - | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-01 |
| Qwen3-Max | qwen3-max | 256K | 32K | Input: $0 Output: $0 | - | 🔧 🌡️ | 2024-12 | In: text Out: text | Released: 2025-01-01 |
| DeepSeek-V3.2-Exp | deepseek-v3.2 | 128K | 64K | Input: $0 Output: $0 | - | 🔧 🌡️ | 2024-12 | In: text Out: text | Open Weights Released: 2025-01-01 |
| Qwen3-Max-Preview | qwen3-max-preview | 256K | 32K | Input: $0 Output: $0 | - | 🔧 🌡️ | 2024-12 | In: text Out: text | Released: 2025-01-01 |
| Qwen3-Coder-Plus | qwen3-coder-plus | 256K | 64K | Input: $0 Output: $0 | - | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-01 |
| Qwen3-32B | qwen3-32b | 128K | 32K | Input: $0 Output: $0 | - | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2024-12-01 |
Inception¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| Mercury Coder | mercury-coder | 128K | 16.4K | Input: $0.25 Output: $1 Cache Read: $0.25 Cache Write: $1 | Model: 0.125 Completion: 4.000 Cache: 1.000 | 🔧 🌡️ | 2023-10 | In: text Out: text | Released: 2025-02-26 Updated: 2025-07-31 |
| Mercury | mercury | 128K | 16.4K | Input: $0.25 Output: $1 Cache Read: $0.25 Cache Write: $1 | Model: 0.125 Completion: 4.000 Cache: 1.000 | 🔧 🌡️ | 2023-10 | In: text Out: text | Released: 2025-06-26 Updated: 2025-07-31 |
Inference¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| Mistral Nemo 12B Instruct | mistral/mistral-nemo-12b-instruct | 16K | 4.1K | Input: $0.038 Output: $0.1 | Model: 0.019 Completion: 2.632 | 🔧 🌡️ | 2024-12 | In: text Out: text | Open Weights Released: 2025-01-01 |
| Google Gemma 3 | google/gemma-3 | 125K | 4.1K | Input: $0.15 Output: $0.3 | Model: 0.075 Completion: 2.000 | 📎 🔧 🌡️ | 2024-12 | In: text, image Out: text | Open Weights Released: 2025-01-01 |
| Osmosis Structure 0.6B | osmosis/osmosis-structure-0.6b | 4K | 2K | Input: $0.1 Output: $0.5 | Model: 0.050 Completion: 5.000 | 🔧 🌡️ | 2024-12 | In: text Out: text | Open Weights Released: 2025-01-01 |
| Qwen 3 Embedding 4B | qwen/qwen3-embedding-4b | 32K | 2K | Input: $0.01 Output: $0 | Model: 0.005 | - | 2024-12 | In: text Out: text | Open Weights Released: 2025-01-01 |
| Qwen 2.5 7B Vision Instruct | qwen/qwen-2.5-7b-vision-instruct | 125K | 4.1K | Input: $0.2 Output: $0.2 | Model: 0.100 Completion: 1.000 | 📎 🔧 🌡️ | 2024-12 | In: text, image Out: text | Open Weights Released: 2025-01-01 |
| Llama 3.2 11B Vision Instruct | meta/llama-3.2-11b-vision-instruct | 16K | 4.1K | Input: $0.055 Output: $0.055 | Model: 0.028 Completion: 1.000 | 📎 🔧 🌡️ | 2023-12 | In: text, image Out: text | Open Weights Released: 2025-01-01 |
| Llama 3.1 8B Instruct | meta/llama-3.1-8b-instruct | 16K | 4.1K | Input: $0.025 Output: $0.025 | Model: 0.013 Completion: 1.000 | 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2025-01-01 |
| Llama 3.2 3B Instruct | meta/llama-3.2-3b-instruct | 16K | 4.1K | Input: $0.02 Output: $0.02 | Model: 0.010 Completion: 1.000 | 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2025-01-01 |
| Llama 3.2 1B Instruct | meta/llama-3.2-1b-instruct | 16K | 4.1K | Input: $0.01 Output: $0.01 | Model: 0.005 Completion: 1.000 | 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2025-01-01 |
Llama¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| Llama-3.3-8B-Instruct | llama-3.3-8b-instruct | 128K | 4.1K | Input: $0 Output: $0 | - | 📎 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-12-06 |
| Llama-4-Maverick-17B-128E-Instruct-FP8 | llama-4-maverick-17b-128e-instruct-fp8 | 128K | 4.1K | Input: $0 Output: $0 | - | 📎 🔧 🌡️ | 2024-08 | In: text, image Out: text | Open Weights Released: 2025-04-05 |
| Llama-3.3-70B-Instruct | llama-3.3-70b-instruct | 128K | 4.1K | Input: $0 Output: $0 | - | 📎 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-12-06 |
| Llama-4-Scout-17B-16E-Instruct-FP8 | llama-4-scout-17b-16e-instruct-fp8 | 128K | 4.1K | Input: $0 Output: $0 | - | 📎 🔧 🌡️ | 2024-08 | In: text, image Out: text | Open Weights Released: 2025-04-05 |
| Groq-Llama-4-Maverick-17B-128E-Instruct | groq-llama-4-maverick-17b-128e-instruct | 128K | 4.1K | Input: $0 Output: $0 | - | 📎 🔧 🌡️ | 2025-01 | In: text Out: text | Open Weights Released: 2025-04-05 |
| Cerebras-Llama-4-Scout-17B-16E-Instruct | cerebras-llama-4-scout-17b-16e-instruct | 128K | 4.1K | Input: $0 Output: $0 | - | 📎 🔧 🌡️ | 2025-01 | In: text Out: text | Open Weights Released: 2025-04-05 |
| Cerebras-Llama-4-Maverick-17B-128E-Instruct | cerebras-llama-4-maverick-17b-128e-instruct | 128K | 4.1K | Input: $0 Output: $0 | - | 📎 🔧 🌡️ | 2025-01 | In: text Out: text | Open Weights Released: 2025-04-05 |
LMStudio¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| GPT OSS 20B | openai/gpt-oss-20b | 131.1K | 32.8K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-08-05 |
| Qwen3 30B A3B 2507 | qwen/qwen3-30b-a3b-2507 | 262.1K | 16.4K | Input: $0 Output: $0 | - | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-30 |
| Qwen3 Coder 30B | qwen/qwen3-coder-30b | 262.1K | 65.5K | Input: $0 Output: $0 | - | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-23 |
LucidQuery AI¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| LucidQuery Nexus Coder | lucidquery-nexus-coder | 250K | 60K | Input: $2 Output: $5 | Model: 1.000 Completion: 2.500 | 📎 🧠 🔧 | 2025-08-01 | In: text Out: text | Released: 2025-09-01 |
| LucidNova RF1 100B | lucidnova-rf1-100b | 120K | 8K | Input: $2 Output: $5 | Model: 1.000 Completion: 2.500 | 📎 🧠 🔧 | 2025-09-16 | In: text Out: text | Released: 2024-12-28 Updated: 2025-09-10 |
Minimax¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| Minimax-M2 | MiniMax-M2 | 196.6K | 128K | Input: $0.3 Output: $1.2 | Model: 0.150 Completion: 4.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-10-27 |
Mistral¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| Devstral Medium | devstral-medium-2507 | 128K | 128K | Input: $0.4 Output: $2 | Model: 0.200 Completion: 5.000 | 🔧 🌡️ | 2025-05 | In: text Out: text | Open Weights Released: 2025-07-10 |
| Mixtral 8x22B | open-mixtral-8x22b | 64K | 64K | Input: $2 Output: $6 | Model: 1.000 Completion: 3.000 | 🔧 🌡️ | 2024-04 | In: text Out: text | Open Weights Released: 2024-04-17 |
| Ministral 8B | ministral-8b-latest | 128K | 128K | Input: $0.1 Output: $0.1 | Model: 0.050 Completion: 1.000 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2024-10-01 Updated: 2024-10-04 |
| Pixtral Large | pixtral-large-latest | 128K | 128K | Input: $2 Output: $6 | Model: 1.000 Completion: 3.000 | 📎 🔧 🌡️ | 2024-11 | In: text, image Out: text | Open Weights Released: 2024-11-01 Updated: 2024-11-04 |
| Ministral 3B | ministral-3b-latest | 128K | 128K | Input: $0.04 Output: $0.04 | Model: 0.020 Completion: 1.000 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2024-10-01 Updated: 2024-10-04 |
| Pixtral 12B | pixtral-12b | 128K | 128K | Input: $0.15 Output: $0.15 | Model: 0.075 Completion: 1.000 | 📎 🔧 🌡️ | 2024-09 | In: text, image Out: text | Open Weights Released: 2024-09-01 |
| Mistral Medium 3 | mistral-medium-2505 | 131.1K | 131.1K | Input: $0.4 Output: $2 | Model: 0.200 Completion: 5.000 | 📎 🔧 🌡️ | 2025-05 | In: text, image Out: text | Released: 2025-05-07 |
| Devstral Small 2505 | devstral-small-2505 | 128K | 128K | Input: $0.1 Output: $0.3 | Model: 0.050 Completion: 3.000 | 🔧 🌡️ | 2025-05 | In: text Out: text | Open Weights Released: 2025-05-07 |
| Mistral Medium 3.1 | mistral-medium-2508 | 262.1K | 262.1K | Input: $0.4 Output: $2 | Model: 0.200 Completion: 5.000 | 📎 🔧 🌡️ | 2025-05 | In: text, image Out: text | Released: 2025-08-12 |
| Mistral Small | mistral-small-latest | 128K | 16.4K | Input: $0.1 Output: $0.3 | Model: 0.050 Completion: 3.000 | 🔧 🌡️ | 2025-03 | In: text, image Out: text | Open Weights Released: 2024-09-01 Updated: 2024-09-04 |
| Magistral Small | magistral-small | 128K | 128K | Input: $0.5 Output: $1.5 | Model: 0.250 Completion: 3.000 | 🧠 🔧 🌡️ | 2025-06 | In: text Out: text | Open Weights Released: 2025-03-17 |
| Devstral Small | devstral-small-2507 | 128K | 128K | Input: $0.1 Output: $0.3 | Model: 0.050 Completion: 3.000 | 🔧 🌡️ | 2025-05 | In: text Out: text | Open Weights Released: 2025-07-10 |
| Codestral | codestral-latest | 256K | 4.1K | Input: $0.3 Output: $0.9 | Model: 0.150 Completion: 3.000 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2024-05-29 Updated: 2025-01-04 |
| Mixtral 8x7B | open-mixtral-8x7b | 32K | 32K | Input: $0.7 Output: $0.7 | Model: 0.350 Completion: 1.000 | 🔧 🌡️ | 2024-01 | In: text Out: text | Open Weights Released: 2023-12-11 |
| Mistral Nemo | mistral-nemo | 128K | 128K | Input: $0.15 Output: $0.15 | Model: 0.075 Completion: 1.000 | 🔧 🌡️ | 2024-07 | In: text Out: text | Open Weights Released: 2024-07-01 |
| Mistral 7B | open-mistral-7b | 8K | 8K | Input: $0.25 Output: $0.25 | Model: 0.125 Completion: 1.000 | 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2023-09-27 |
| Mistral Large | mistral-large-latest | 131.1K | 16.4K | Input: $2 Output: $6 | Model: 1.000 Completion: 3.000 | 🔧 🌡️ | 2024-11 | In: text Out: text | Open Weights Released: 2024-11-01 Updated: 2024-11-04 |
| Mistral Medium | mistral-medium-latest | 128K | 16.4K | Input: $0.4 Output: $2 | Model: 0.200 Completion: 5.000 | 🔧 🌡️ | 2025-05 | In: text, image Out: text | Open Weights Released: 2025-05-07 Updated: 2025-05-10 |
| Magistral Medium | magistral-medium-latest | 128K | 16.4K | Input: $2 Output: $5 | Model: 1.000 Completion: 2.500 | 🧠 🔧 🌡️ | 2025-06 | In: text Out: text | Open Weights Released: 2025-03-17 Updated: 2025-03-20 |
ModelScope¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| GLM-4.5 | ZhipuAI/GLM-4.5 | 131.1K | 98.3K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-28 |
| GLM-4.6 | ZhipuAI/GLM-4.6 | 202.8K | 98.3K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2025-07 | In: text Out: text | Open Weights Released: 2025-09-30 |
| Qwen3 30B A3B Thinking 2507 | Qwen/Qwen3-30B-A3B-Thinking-2507 | 262.1K | 32.8K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-30 |
| Qwen3 235B A22B Instruct 2507 | Qwen/Qwen3-235B-A22B-Instruct-2507 | 262.1K | 131.1K | Input: $0 Output: $0 | - | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04-28 Updated: 2025-07-21 |
| Qwen3 Coder 30B A3B Instruct | Qwen/Qwen3-Coder-30B-A3B-Instruct | 262.1K | 65.5K | Input: $0 Output: $0 | - | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-31 |
| Qwen3 30B A3B Instruct 2507 | Qwen/Qwen3-30B-A3B-Instruct-2507 | 262.1K | 16.4K | Input: $0 Output: $0 | - | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-30 |
| Qwen3-235B-A22B-Thinking-2507 | Qwen/Qwen3-235B-A22B-Thinking-2507 | 262.1K | 131.1K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-25 |
Moonshot AI¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| Kimi K2 Thinking Turbo | kimi-k2-thinking-turbo | 262.1K | 262.1K | Input: $1.15 Output: $8 Cache Read: $0.15 | Model: 0.575 Completion: 6.957 Cache: 0.130 | 🧠 🔧 🌡️ | 2024-08 | In: text Out: text | Open Weights Released: 2025-11-06 |
| Kimi K2 Turbo | kimi-k2-turbo-preview | 262.1K | 262.1K | Input: $2.4 Output: $10 Cache Read: $0.6 | Model: 1.200 Completion: 4.167 Cache: 0.250 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-09-05 |
| Kimi K2 0711 | kimi-k2-0711-preview | 131.1K | 16.4K | Input: $0.6 Output: $2.5 Cache Read: $0.15 | Model: 0.300 Completion: 4.167 Cache: 0.250 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-07-14 |
| Kimi K2 Thinking | kimi-k2-thinking | 262.1K | 262.1K | Input: $0.6 Output: $2.5 Cache Read: $0.15 | Model: 0.300 Completion: 4.167 Cache: 0.250 | 🧠 🔧 🌡️ | 2024-08 | In: text Out: text | Open Weights Released: 2025-11-06 |
| Kimi K2 0905 | kimi-k2-0905-preview | 262.1K | 262.1K | Input: $0.6 Output: $2.5 Cache Read: $0.15 | Model: 0.300 Completion: 4.167 Cache: 0.250 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-09-05 |
Moonshot AI (China)¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| Kimi K2 Thinking Turbo | kimi-k2-thinking-turbo | 262.1K | 262.1K | Input: $1.15 Output: $8 Cache Read: $0.15 | Model: 0.575 Completion: 6.957 Cache: 0.130 | 🧠 🔧 🌡️ | 2024-08 | In: text Out: text | Open Weights Released: 2025-11-06 |
| Kimi K2 Thinking | kimi-k2-thinking | 262.1K | 262.1K | Input: $0.6 Output: $2.5 Cache Read: $0.15 | Model: 0.300 Completion: 4.167 Cache: 0.250 | 🧠 🔧 🌡️ | 2024-08 | In: text Out: text | Open Weights Released: 2025-11-06 |
| Kimi K2 0905 | kimi-k2-0905-preview | 262.1K | 262.1K | Input: $0.6 Output: $2.5 Cache Read: $0.15 | Model: 0.300 Completion: 4.167 Cache: 0.250 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-09-05 |
| Kimi K2 0711 | kimi-k2-0711-preview | 131.1K | 16.4K | Input: $0.6 Output: $2.5 Cache Read: $0.15 | Model: 0.300 Completion: 4.167 Cache: 0.250 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-07-14 |
| Kimi K2 Turbo | kimi-k2-turbo-preview | 262.1K | 262.1K | Input: $2.4 Output: $10 Cache Read: $0.6 | Model: 1.200 Completion: 4.167 Cache: 0.250 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-09-05 |
Morph¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| Morph v3 Large | morph-v3-large | 32K | 32K | Input: $0.9 Output: $1.9 | Model: 0.450 Completion: 2.111 | - | - | In: text Out: text | Released: 2024-08-15 |
| Auto | auto | 32K | 32K | Input: $0.85 Output: $1.55 | Model: 0.425 Completion: 1.824 | - | - | In: text Out: text | Released: 2024-06-01 |
| Morph v3 Fast | morph-v3-fast | 16K | 16K | Input: $0.8 Output: $1.2 | Model: 0.400 Completion: 1.500 | - | - | In: text Out: text | Released: 2024-08-15 |
Nebius Token Factory¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| Hermes 4 70B | NousResearch/hermes-4-70b | 131.1K | 8.2K | Input: $0.13 Output: $0.4 | Model: 0.065 Completion: 3.077 | 🧠 🔧 🌡️ | 2024-07 | In: text Out: text | Released: 2024-08-01 Updated: 2025-10-04 |
| Hermes-4 405B | NousResearch/hermes-4-405b | 131.1K | 8.2K | Input: $1 Output: $3 | Model: 0.500 Completion: 3.000 | 🧠 🔧 🌡️ | 2024-07 | In: text Out: text | Released: 2024-08-01 Updated: 2025-10-04 |
| Kimi K2 Instruct | moonshotai/kimi-k2-instruct | 131.1K | 8.2K | Input: $0.5 Output: $2.4 | Model: 0.250 Completion: 4.800 | 🧠 🔧 🌡️ | 2024-01 | In: text Out: text | Released: 2025-01-01 Updated: 2025-10-04 |
| Llama 3.1 Nemotron Ultra 253B v1 | nvidia/llama-3_1-nemotron-ultra-253b-v1 | 131.1K | 8.2K | Input: $0.6 Output: $1.8 | Model: 0.300 Completion: 3.000 | 🧠 🔧 🌡️ | 2024-07 | In: text Out: text | Released: 2024-07-01 Updated: 2025-10-04 |
| GPT OSS 20B | openai/gpt-oss-20b | 131.1K | 8.2K | Input: $0.05 Output: $0.2 | Model: 0.025 Completion: 4.000 | 📎 🧠 🔧 🌡️ | 2024-01 | In: text Out: text | Released: 2024-01-01 Updated: 2025-10-04 |
| GPT OSS 120B | openai/gpt-oss-120b | 131.1K | 8.2K | Input: $0.15 Output: $0.6 | Model: 0.075 Completion: 4.000 | 📎 🧠 🔧 🌡️ | 2024-01 | In: text Out: text | Released: 2024-01-01 Updated: 2025-10-04 |
| Qwen3 235B A22B Instruct 2507 | qwen/qwen3-235b-a22b-instruct-2507 | 262.1K | 8.2K | Input: $0.2 Output: $0.6 | Model: 0.100 Completion: 3.000 | 🧠 🔧 🌡️ | 2025-07 | In: text Out: text | Released: 2025-07-25 Updated: 2025-10-04 |
| Qwen3 235B A22B Thinking 2507 | qwen/qwen3-235b-a22b-thinking-2507 | 262.1K | 8.2K | Input: $0.2 Output: $0.8 | Model: 0.100 Completion: 4.000 | 🧠 🔧 🌡️ | 2025-07 | In: text Out: text | Released: 2025-07-25 Updated: 2025-10-04 |
| Qwen3 Coder 480B A35B Instruct | qwen/qwen3-coder-480b-a35b-instruct | 262.1K | 66.5K | Input: $0.4 Output: $1.8 | Model: 0.200 Completion: 4.500 | 🔧 🌡️ | 2025-04 | In: text Out: text | Released: 2025-07-23 Updated: 2025-10-04 |
| Llama 3.1 405B Instruct | meta-llama/llama-3_1-405b-instruct | 131.1K | 8.2K | Input: $1 Output: $3 | Model: 0.500 Completion: 3.000 | 🧠 🔧 🌡️ | 2024-03 | In: text Out: text | Released: 2024-07-23 Updated: 2025-10-04 |
| Llama-3.3-70B-Instruct (Fast) | meta-llama/llama-3.3-70b-instruct-fast | 131.1K | 8.2K | Input: $0.25 Output: $0.75 | Model: 0.125 Completion: 3.000 | 🧠 🔧 🌡️ | 2024-08 | In: text Out: text | Released: 2024-08-22 Updated: 2025-10-04 |
| Llama-3.3-70B-Instruct (Base) | meta-llama/llama-3.3-70b-instruct-base | 131.1K | 8.2K | Input: $0.13 Output: $0.4 | Model: 0.065 Completion: 3.077 | 🧠 🔧 🌡️ | 2024-08 | In: text Out: text | Released: 2024-08-22 Updated: 2025-10-04 |
| GLM 4.5 | zai-org/glm-4.5 | 131.1K | 8.2K | Input: $0.6 Output: $2.2 | Model: 0.300 Completion: 3.667 | 🧠 🔧 🌡️ | 2024-05 | In: text Out: text | Released: 2024-06-01 Updated: 2025-10-04 |
| GLM 4.5 Air | zai-org/glm-4.5-air | 131.1K | 8.2K | Input: $0.2 Output: $1.2 | Model: 0.100 Completion: 6.000 | 🧠 🔧 🌡️ | 2024-05 | In: text Out: text | Released: 2024-06-01 Updated: 2025-10-04 |
| DeepSeek V3 | deepseek-ai/deepseek-v3 | 131.1K | 8.2K | Input: $0.5 Output: $1.5 | Model: 0.250 Completion: 3.000 | 🧠 🔧 🌡️ | 2024-04 | In: text Out: text | Released: 2024-05-07 Updated: 2025-10-04 |
Nvidia¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| Kimi K2 0905 | moonshotai/kimi-k2-instruct-0905 | 262.1K | 262.1K | Input: $0 Output: $0 | - | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-09-05 |
| Kimi K2 Instruct | moonshotai/kimi-k2-instruct | 128K | 8.2K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2024-01 | In: text Out: text | Released: 2025-01-01 Updated: 2025-09-05 |
| nvidia-nemotron-nano-9b-v2 | nvidia/nvidia-nemotron-nano-9b-v2 | 131.1K | 131.1K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2024-09 | In: text Out: text | Open Weights Released: 2025-08-18 |
| Cosmos Nemotron 34B | nvidia/cosmos-nemotron-34b | 131.1K | 8.2K | Input: $0 Output: $0 | - | 🧠 🌡️ | 2024-01 | In: text, image, video Out: text | Released: 2024-01-01 Updated: 2025-09-05 |
| Llama Embed Nemotron 8B | nvidia/llama-embed-nemotron-8b | 32.8K | 2K | Input: $0 Output: $0 | - | - | 2025-03 | In: text Out: text | Released: 2025-03-18 |
| Parakeet TDT 0.6B v2 | nvidia/parakeet-tdt-0.6b-v2 | - | 4.1K | Input: $0 Output: $0 | - | - | 2024-01 | In: audio Out: text | Released: 2024-01-01 Updated: 2025-09-05 |
| NeMo Retriever OCR v1 | nvidia/nemoretriever-ocr-v1 | - | 4.1K | Input: $0 Output: $0 | - | - | 2024-01 | In: image Out: text | Released: 2024-01-01 Updated: 2025-09-05 |
| Llama-3.1-Nemotron-Ultra-253B-v1 | nvidia/llama-3.1-nemotron-ultra-253b-v1 | 131.1K | 8.2K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2024-07 | In: text Out: text | Released: 2024-07-01 Updated: 2025-09-05 |
| MiniMax-M2 | minimaxai/minimax-m2 | 128K | 16.4K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2024-07 | In: text Out: text | Open Weights Released: 2025-10-27 Updated: 2025-10-31 |
| Gemma-3-27B-IT | google/gemma-3-27b-it | 131.1K | 8.2K | Input: $0 Output: $0 | - | 📎 🧠 🔧 🌡️ | 2024-12 | In: text, image Out: text | Released: 2024-12-01 Updated: 2025-09-05 |
| Phi-4-Mini | microsoft/phi-4-mini-instruct | 131.1K | 8.2K | Input: $0 Output: $0 | - | 📎 🧠 🔧 🌡️ | 2024-12 | In: text, image, audio Out: text | Released: 2024-12-01 Updated: 2025-09-05 |
| Whisper Large v3 | openai/whisper-large-v3 | - | 4.1K | Input: $0 Output: $0 | - | - | 2023-09 | In: audio Out: text | Open Weights Released: 2023-09-01 Updated: 2025-09-05 |
| GPT-OSS-120B | openai/gpt-oss-120b | 128K | 8.2K | Input: $0 Output: $0 | - | 📎 🧠 🌡️ | 2025-08 | In: text Out: text | Released: 2025-08-04 Updated: 2025-08-14 |
| Qwen3-Next-80B-A3B-Instruct | qwen/qwen3-next-80b-a3b-instruct | 262.1K | 16.4K | Input: $0 Output: $0 | - | 🔧 🌡️ | 2024-12 | In: text Out: text | Released: 2024-12-01 Updated: 2025-09-05 |
| Qwen3-235B-A22B | qwen/qwen3-235b-a22b | 131.1K | 8.2K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2024-12 | In: text Out: text | Released: 2024-12-01 Updated: 2025-09-05 |
| Qwen3 Coder 480B A35B Instruct | qwen/qwen3-coder-480b-a35b-instruct | 262.1K | 66.5K | Input: $0 Output: $0 | - | 🔧 🌡️ | 2025-04 | In: text Out: text | Released: 2025-07-23 |
| Qwen3-Next-80B-A3B-Thinking | qwen/qwen3-next-80b-a3b-thinking | 262.1K | 16.4K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2024-12 | In: text Out: text | Open Weights Released: 2024-12-01 Updated: 2025-09-05 |
| DeepSeek V3.1 Terminus | deepseek-ai/deepseek-v3.1-terminus | 128K | 8.2K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2025-01 | In: text Out: text | Released: 2025-09-22 |
| DeepSeek V3.1 | deepseek-ai/deepseek-v3.1 | 128K | 8.2K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2024-07 | In: text Out: text | Released: 2025-08-20 Updated: 2025-08-26 |
| FLUX.1-dev | black-forest-labs/flux.1-dev | 4.1K | - | Input: $0 Output: $0 | - | 🌡️ | 2024-08 | In: text Out: image | Released: 2024-08-01 Updated: 2025-09-05 |
Ollama Cloud¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| Kimi K2 | kimi-k2 | 256K | 8.2K | - | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-09-05 |
| Qwen3-VL 235B Instruct | qwen3-vl-235b-instruct | 200K | 8.2K | - | - | 📎 🔧 🌡️ | - | In: text, image Out: text | Open Weights Released: 2025-09-22 |
| Gemini 3 Pro Preview | gemini-3-pro-preview | 1M | 64K | - | - | 📎 🔧 🌡️ | - | In: text, image, audio, video Out: text | Released: 2025-11-18 |
| MiniMax M2 | minimax-m2 | 200K | 8.2K | - | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-10-27 |
| Kimi K2 Thinking | kimi-k2-thinking | 256K | 8.2K | - | - | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-11-06 |
| Qwen3 Coder 480B | qwen3-coder-480b | 200K | 8.2K | - | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-07-22 |
| GLM-4.6 | glm-4.6 | 200K | 8.2K | - | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-09-29 |
| DeepSeek-V3.1 671B | deepseek-v3.1-671b | 160K | 8.2K | - | - | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-08-21 |
| Cogito 2.1 671B | cogito-2.1-671b | 160K | 8.2K | - | - | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-11-19 |
| GPT-OSS 120B | gpt-oss-120b | 200K | 8.2K | - | - | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-08-05 |
OpenAI¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| GPT-4.1 nano | gpt-4.1-nano | 1M | 32.8K | Input: $0.1 Output: $0.4 Cache Read: $0.03 | Model: 0.050 Completion: 4.000 Cache: 0.300 | 📎 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-04-14 |
| TEXT-EMBEDDING-3-SMALL | text-embedding-3-small | 32K | 1K | Input: $4 Output: $8 Cache Read: $0.04 Cache Write: $0.3 | Model: 2.000 Completion: 2.000 Cache: 0.010 | 📎 🧠 🔧 🌡️ | 2023-10 | In: text Out: vector | Released: 2023-11-10 Updated: 2023-10-01 |
| GPT-4 | gpt-4 | 8.2K | 8.2K | Input: $30 Output: $60 | Model: 15.000 Completion: 2.000 | 📎 🔧 🌡️ | 2023-11 | In: text Out: text | Released: 2023-11-06 Updated: 2024-04-09 |
| o1-pro | o1-pro | 200K | 100K | Input: $150 Output: $600 | Model: 75.000 Completion: 4.000 | 📎 🧠 🔧 | 2023-09 | In: text, image Out: text | Released: 2025-03-19 |
| GPT-4o (2024-05-13) | gpt-4o-2024-05-13 | 128K | 4.1K | Input: $5 Output: $15 | Model: 2.500 Completion: 3.000 | 📎 🔧 🌡️ | 2023-09 | In: text, image Out: text | Released: 2024-05-13 |
| GPT-5.1 Codex | gpt-5.1-codex | 400K | 128K | Input: $1.25 Output: $10 Cache Read: $0.125 | Model: 0.625 Completion: 8.000 Cache: 0.100 | 📎 🧠 🔧 | 2024-09-30 | In: text, image Out: text, image | Released: 2025-11-13 |
| GPT-4o (2024-08-06) | gpt-4o-2024-08-06 | 128K | 16.4K | Input: $2.5 Output: $10 Cache Read: $1.25 | Model: 1.250 Completion: 4.000 Cache: 0.500 | 📎 🔧 🌡️ | 2023-09 | In: text, image Out: text | Released: 2024-08-06 |
| GPT-4.1 mini | gpt-4.1-mini | 1M | 32.8K | Input: $0.4 Output: $1.6 Cache Read: $0.1 | Model: 0.200 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-04-14 |
| o3-deep-research | o3-deep-research | 200K | 100K | Input: $10 Output: $40 Cache Read: $2.5 | Model: 5.000 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 | 2024-05 | In: text, image Out: text | Released: 2024-06-26 |
| GPT-3.5-turbo | gpt-3.5-turbo | 16.4K | 4.1K | Input: $0.5 Output: $1.5 Cache Read: $1.25 | Model: 0.250 Completion: 3.000 Cache: 2.500 | 🌡️ | 2021-09-01 | In: text Out: text | Released: 2023-03-01 Updated: 2023-11-06 |
| TEXT-EMBEDDING-3-LARGE | text-embedding-3-large | 64K | 2K | Input: $7 Output: $10 Cache Read: $0.05 Cache Write: $0.4 | Model: 3.500 Completion: 1.429 Cache: 0.007 | 📎 🧠 🔧 🌡️ | 2023-10 | In: text Out: vector | Released: 2023-12-15 Updated: 2023-10-01 |
| GPT-4 Turbo | gpt-4-turbo | 128K | 4.1K | Input: $10 Output: $30 | Model: 5.000 Completion: 3.000 | 📎 🔧 🌡️ | 2023-12 | In: text, image Out: text | Released: 2023-11-06 Updated: 2024-04-09 |
| o1-preview | o1-preview | 128K | 32.8K | Input: $15 Output: $60 Cache Read: $7.5 | Model: 7.500 Completion: 4.000 Cache: 0.500 | 🧠 🌡️ | 2023-09 | In: text Out: text | Released: 2024-09-12 |
| GPT-5.1 Codex mini | gpt-5.1-codex-mini | 400K | 128K | Input: $0.25 Output: $2 Cache Read: $0.025 | Model: 0.125 Completion: 8.000 Cache: 0.100 | 📎 🧠 🔧 | 2024-09-30 | In: text, image Out: text, image | Released: 2025-11-13 |
| o3-mini | o3-mini | 200K | 100K | Input: $1.1 Output: $4.4 Cache Read: $0.55 | Model: 0.550 Completion: 4.000 Cache: 0.500 | 🧠 🔧 | 2024-05 | In: text Out: text | Released: 2024-12-20 Updated: 2025-01-29 |
| GPT-5.1 | gpt-5.1 | 400K | 128K | Input: $1.25 Output: $10 Cache Read: $0.13 | Model: 0.625 Completion: 8.000 Cache: 0.104 | 📎 🧠 🔧 | 2024-09-30 | In: text, image Out: text | Released: 2025-11-13 |
| Codex Mini | codex-mini-latest | 200K | 100K | Input: $1.5 Output: $6 Cache Read: $0.375 | Model: 0.750 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 | 2024-04 | In: text Out: text | Released: 2025-05-16 |
| GPT-5 Nano | gpt-5-nano | 400K | 128K | Input: $0.05 Output: $0.4 Cache Read: $0.01 | Model: 0.025 Completion: 8.000 Cache: 0.200 | 📎 🧠 🔧 | 2024-05-30 | In: text, image Out: text | Released: 2025-08-07 |
| GPT-5-Codex | gpt-5-codex | 400K | 128K | Input: $1.25 Output: $10 Cache Read: $0.125 | Model: 0.625 Completion: 8.000 Cache: 0.100 | 🧠 🔧 | 2024-09-30 | In: text, image Out: text | Released: 2025-09-15 |
| GPT-4o | gpt-4o | 128K | 16.4K | Input: $2.5 Output: $10 Cache Read: $1.25 | Model: 1.250 Completion: 4.000 Cache: 0.500 | 📎 🔧 🌡️ | 2023-09 | In: text, image Out: text | Released: 2024-05-13 Updated: 2024-08-06 |
| GPT-4.1 | gpt-4.1 | 1M | 32.8K | Input: $2 Output: $8 Cache Read: $0.5 | Model: 1.000 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-04-14 |
| o4-mini | o4-mini | 200K | 100K | Input: $1.1 Output: $4.4 Cache Read: $0.28 | Model: 0.550 Completion: 4.000 Cache: 0.255 | 📎 🧠 🔧 | 2024-05 | In: text, image Out: text | Released: 2025-04-16 |
| o1 | o1 | 200K | 100K | Input: $15 Output: $60 Cache Read: $7.5 | Model: 7.500 Completion: 4.000 Cache: 0.500 | 📎 🧠 🔧 | 2023-09 | In: text, image Out: text | Released: 2024-12-05 |
| GPT-5 Mini | gpt-5-mini | 400K | 128K | Input: $0.25 Output: $2 Cache Read: $0.03 | Model: 0.125 Completion: 8.000 Cache: 0.120 | 📎 🧠 🔧 | 2024-05-30 | In: text, image Out: text | Released: 2025-08-07 |
| o1-mini | o1-mini | 128K | 65.5K | Input: $1.1 Output: $4.4 Cache Read: $0.55 | Model: 0.550 Completion: 4.000 Cache: 0.500 | 🧠 | 2023-09 | In: text Out: text | Released: 2024-09-12 |
| TEXT-EMBEDDING-ADA-002 | text-embedding-ada-002 | 60K | 1.5K | Input: $6 Output: $12 Cache Read: $0.06 Cache Write: $0.45 | Model: 3.000 Completion: 2.000 Cache: 0.010 | 📎 🧠 🔧 🌡️ | 2023-10 | In: text Out: vector | Released: 2023-11-20 Updated: 2023-10-01 |
| o3-pro | o3-pro | 200K | 100K | Input: $20 Output: $80 | Model: 10.000 Completion: 4.000 | 📎 🧠 🔧 | 2024-05 | In: text, image Out: text | Released: 2025-06-10 |
| GPT-4o (2024-11-20) | gpt-4o-2024-11-20 | 128K | 16.4K | Input: $2.5 Output: $10 Cache Read: $1.25 | Model: 1.250 Completion: 4.000 Cache: 0.500 | 📎 🔧 🌡️ | 2023-09 | In: text, image Out: text | Released: 2024-11-20 |
| o3 | o3 | 200K | 100K | Input: $2 Output: $8 Cache Read: $0.5 | Model: 1.000 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 | 2024-05 | In: text, image Out: text | Released: 2025-04-16 |
| o4-mini-deep-research | o4-mini-deep-research | 200K | 100K | Input: $2 Output: $8 Cache Read: $0.5 | Model: 1.000 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 | 2024-05 | In: text, image Out: text | Released: 2024-06-26 |
| GPT-5 Chat (latest) | gpt-5-chat-latest | 400K | 128K | Input: $1.25 Output: $10 | Model: 0.625 Completion: 8.000 | 📎 🧠 🌡️ | 2024-09-30 | In: text, image Out: text | Released: 2025-08-07 |
| GPT-4o mini | gpt-4o-mini | 128K | 16.4K | Input: $0.15 Output: $0.6 Cache Read: $0.08 | Model: 0.075 Completion: 4.000 Cache: 0.533 | 📎 🔧 🌡️ | 2023-09 | In: text, image Out: text | Released: 2024-07-18 |
| GPT-5 | gpt-5 | 400K | 128K | Input: $1.25 Output: $10 Cache Read: $0.13 | Model: 0.625 Completion: 8.000 Cache: 0.104 | 📎 🧠 🔧 | 2024-09-30 | In: text, image Out: text | Released: 2025-08-07 |
| GPT-5 Pro | gpt-5-pro | 400K | 272K | Input: $15 Output: $120 | Model: 7.500 Completion: 8.000 | 📎 🧠 🔧 | 2024-09-30 | In: text, image Out: text | Released: 2025-10-06 |
| GPT-5.1 Chat | gpt-5.1-chat-latest | 128K | 16.4K | Input: $1.25 Output: $10 Cache Read: $0.125 | Model: 0.625 Completion: 8.000 Cache: 0.100 | 📎 🧠 🔧 | 2024-09-30 | In: text, image Out: text | Released: 2025-11-13 |
| DALL-E 2 | dall-e-2 | 1K | 1 | Input: $0.02 Output: $0.1 Cache Read: $0.01 Cache Write: $0.05 | Model: 0.010 Completion: 5.000 Cache: 0.500 | 📎 🔧 | 2021-04 | In: text Out: image | Released: 2022-04-06 Updated: 2022-06-15 |
| DALL-E 3 | dall-e-3 | 2K | 1 | Input: $0.03 Output: $0.15 Cache Read: $0.01 Cache Write: $0.05 | Model: 0.015 Completion: 5.000 Cache: 0.333 | 📎 🔧 | 2024-04 | In: text Out: image | Released: 2024-03-01 Updated: 2024-08-15 |
| GPT-IMAGE-1 | gpt-image-1 | 1K | 512 | Input: $10 Output: $20 Cache Read: $0.1 Cache Write: $0.6 | Model: 5.000 Completion: 2.000 Cache: 0.010 | 📎 🧠 🔧 🌡️ | 2023-10 | In: text Out: image | Open Weights Released: 2024-01-15 Updated: 2024-10-01 |
OpenCode Zen¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| Qwen3 Coder | qwen3-coder | 262.1K | 65.5K | Input: $0.45 Output: $1.8 | Model: 0.225 Completion: 4.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-23 |
| Claude Opus 4.1 | claude-opus-4-1 | 200K | 32K | Input: $15 Output: $75 Cache Read: $1.5 Cache Write: $18.75 Reasoning: $75 | Model: 7.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-08-05 |
| Kimi K2 | kimi-k2 | 262.1K | 262.1K | Input: $0.6 Output: $2.5 Cache Read: $0.36 | Model: 0.300 Completion: 4.167 Cache: 0.600 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-09-05 |
| GPT-5.1 Codex | gpt-5.1-codex | 400K | 128K | Input: $1.07 Output: $8.5 Cache Read: $0.107 | Model: 0.535 Completion: 7.944 Cache: 0.100 | 📎 🧠 🔧 | 2024-09-30 | In: text, image Out: text, image | Released: 2025-11-12 |
| Claude Haiku 4.5 | claude-haiku-4-5 | 200K | 64K | Input: $1 Output: $5 Cache Read: $0.1 Cache Write: $1.25 | Model: 0.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-02-31 | In: text, image Out: text | Released: 2025-10-15 |
| Gemini 3 Pro | gemini-3-pro | 1M | 64K | Input: $2 Output: $12 Cache Read: $0.2 | Model: 1.000 Completion: 6.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, video, audio, pdf Out: text | Released: 2025-11-18 |
| Claude Sonnet 4.5 | claude-sonnet-4-5 | 1M | 64K | Input: $3 Output: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-07-31 | In: text, image Out: text | Released: 2025-09-29 |
| Code GD4 (alpha) | alpha-gd4 | 200K | 128K | Input: $0.5 Output: $2 Cache Read: $0.15 | Model: 0.250 Completion: 4.000 Cache: 0.300 | 🧠 🔧 🌡️ | 2025-01 | In: text Out: text | Open Weights Released: 2025-01-01 |
| Kimi K2 Thinking (alpha) | alpha-kimi-k2-thinking | 262.1K | 262.1K | Input: $0.6 Output: $2.5 | Model: 0.300 Completion: 4.167 | 🧠 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-09-05 |
| GPT-5.1 | gpt-5.1 | 400K | 128K | Input: $1.07 Output: $8.5 Cache Read: $0.107 | Model: 0.535 Completion: 7.944 Cache: 0.100 | 📎 🧠 🔧 | 2024-09-30 | In: text, image Out: text, image | Released: 2025-11-12 |
| MiniMax M2 (alpha) | alpha-minimax-m2 | 204.8K | 131.1K | Input: $0.3 Output: $1.2 | Model: 0.150 Completion: 4.000 | 🧠 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-10-27 |
| GPT-5 Nano | gpt-5-nano | 400K | 128K | Input: $0 Output: $0 Cache Read: $0 | - | 📎 🧠 🔧 | 2024-05-30 | In: text, image Out: text | Released: 2025-08-07 |
| GPT-5 Codex | gpt-5-codex | 400K | 128K | Input: $1.07 Output: $8.5 Cache Read: $0.107 | Model: 0.535 Completion: 7.944 Cache: 0.100 | 📎 🧠 🔧 | 2024-09-30 | In: text, image Out: text | Released: 2025-08-07 |
| Big Pickle | big-pickle | 200K | 128K | Input: $0 Output: $0 Cache Read: $0 Cache Write: $0 | - | 🧠 🔧 🌡️ | 2025-01 | In: text Out: text | Released: 2025-10-17 |
| Claude Haiku 3.5 | claude-3-5-haiku | 200K | 8.2K | Input: $0.8 Output: $4 Cache Read: $0.08 Cache Write: $1 | Model: 0.400 Completion: 5.000 Cache: 0.100 | 📎 🔧 🌡️ | 2024-07-31 | In: text, image Out: text | Released: 2024-10-22 |
| GLM-4.6 | glm-4.6 | 204.8K | 131.1K | Input: $0.6 Output: $2.2 Cache Read: $0.1 | Model: 0.300 Completion: 3.667 Cache: 0.167 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-09-30 |
| Grok Code Fast 1 | grok-code | 256K | 256K | Input: $0 Output: $0 Cache Read: $0 Cache Write: $0 | - | 📎 🧠 🔧 🌡️ | - | In: text Out: text | Released: 2025-08-20 |
| Doubao Seed Code (alpha) | alpha-doubao-seed-code | 256K | 32K | Input: $0 Output: $0 Cache Read: $0 | - | 🧠 🔧 🌡️ | 2024-10 | In: text, image, video Out: text | Open Weights Released: 2025-11-11 |
| Claude Sonnet 4 | claude-sonnet-4 | 1M | 64K | Input: $3 Output: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-05-22 |
| GPT-5 | gpt-5 | 400K | 128K | Input: $1.07 Output: $8.5 Cache Read: $0.107 | Model: 0.535 Completion: 7.944 Cache: 0.100 | 📎 🧠 🔧 | 2024-09-30 | In: text, image Out: text | Released: 2025-08-07 |
OpenRouter¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| Kimi K2 | moonshotai/kimi-k2 | 131.1K | 32.8K | Input: $0.55 Output: $2.2 | Model: 0.275 Completion: 4.000 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-07-11 |
| Kimi K2 Instruct 0905 | moonshotai/kimi-k2-0905 | 262.1K | 16.4K | Input: $0.6 Output: $2.5 | Model: 0.300 Completion: 4.167 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-09-05 |
| Kimi Dev 72b (free) | moonshotai/kimi-dev-72b:free | 131.1K | 131.1K | Input: $0 Output: $0 | - | 🔧 🌡️ | 2025-06 | In: text Out: text | Open Weights Released: 2025-06-16 |
| Kimi K2 Thinking | moonshotai/kimi-k2-thinking | 262.1K | 262.1K | Input: $0.6 Output: $2.5 Cache Read: $0.15 | Model: 0.300 Completion: 4.167 Cache: 0.250 | 🧠 🔧 🌡️ | 2024-08 | In: text Out: text | Open Weights Released: 2025-11-06 |
| Kimi K2 Instruct 0905 (exacto) | moonshotai/kimi-k2-0905:exacto | 262.1K | 16.4K | Input: $0.6 Output: $2.5 | Model: 0.300 Completion: 4.167 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-09-05 |
| Kimi K2 (free) | moonshotai/kimi-k2:free | 32.8K | 32.8K | Input: $0 Output: $0 | - | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-11 |
| GLM Z1 32B (free) | thudm/glm-z1-32b:free | 32.8K | 32.8K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04-17 |
| Hermes 4 70B | nousresearch/hermes-4-70b | 131.1K | 131.1K | Input: $0.13 Output: $0.4 | Model: 0.065 Completion: 3.077 | 🧠 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2025-08-25 |
| Hermes 4 405B | nousresearch/hermes-4-405b | 131.1K | 131.1K | Input: $1 Output: $3 | Model: 0.500 Completion: 3.000 | 🧠 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2025-08-25 |
| DeepHermes 3 Llama 3 8B Preview | nousresearch/deephermes-3-llama-3-8b-preview | 131.1K | 8.2K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2024-04 | In: text Out: text | Open Weights Released: 2025-02-28 |
| nvidia-nemotron-nano-9b-v2 | nvidia/nemotron-nano-9b-v2 | 131.1K | 131.1K | Input: $0.04 Output: $0.16 | Model: 0.020 Completion: 4.000 | 🧠 🔧 🌡️ | 2024-09 | In: text Out: text | Open Weights Released: 2025-08-18 |
| Grok 4 | x-ai/grok-4 | 256K | 64K | Input: $3 Output: $15 Cache Read: $0.75 Cache Write: $15 | Model: 1.500 Completion: 5.000 Cache: 0.250 | 🧠 🔧 🌡️ | 2025-07 | In: text Out: text | Released: 2025-07-09 |
| Grok Code Fast 1 | x-ai/grok-code-fast-1 | 256K | 10K | Input: $0.2 Output: $1.5 Cache Read: $0.02 | Model: 0.100 Completion: 7.500 Cache: 0.100 | 🧠 🔧 🌡️ | 2025-08 | In: text Out: text | Released: 2025-08-26 |
| Grok 3 | x-ai/grok-3 | 131.1K | 8.2K | Input: $3 Output: $15 Cache Read: $0.75 Cache Write: $15 | Model: 1.500 Completion: 5.000 Cache: 0.250 | 🔧 🌡️ | 2024-11 | In: text Out: text | Released: 2025-02-17 |
| Grok 4 Fast | x-ai/grok-4-fast | 2M | 30K | Input: $0.2 Output: $0.5 Cache Read: $0.05 Cache Write: $0.05 | Model: 0.100 Completion: 2.500 Cache: 0.250 | 🧠 🔧 🌡️ | 2024-11 | In: text, image Out: text | Released: 2025-08-19 |
| Grok 3 Beta | x-ai/grok-3-beta | 131.1K | 8.2K | Input: $3 Output: $15 Cache Read: $0.75 Cache Write: $15 | Model: 1.500 Completion: 5.000 Cache: 0.250 | 🔧 🌡️ | 2024-11 | In: text Out: text | Released: 2025-02-17 |
| Grok 3 Mini Beta | x-ai/grok-3-mini-beta | 131.1K | 8.2K | Input: $0.3 Output: $0.5 Cache Read: $0.075 Cache Write: $0.5 | Model: 0.150 Completion: 1.667 Cache: 0.250 | 🧠 🔧 🌡️ | 2024-11 | In: text Out: text | Released: 2025-02-17 |
| Grok 3 Mini | x-ai/grok-3-mini | 131.1K | 8.2K | Input: $0.3 Output: $0.5 Cache Read: $0.075 Cache Write: $0.5 | Model: 0.150 Completion: 1.667 Cache: 0.250 | 🧠 🔧 🌡️ | 2024-11 | In: text Out: text | Released: 2025-02-17 |
| Grok 4.1 Fast | x-ai/grok-4.1-fast | 2M | 30K | Input: $0.2 Output: $0.5 Cache Read: $0.05 Cache Write: $0.05 | Model: 0.100 Completion: 2.500 Cache: 0.250 | 🧠 🔧 🌡️ | 2024-11 | In: text, image Out: text | Released: 2025-11-19 |
| Kat Coder Pro (free) | kwaipilot/kat-coder-pro:free | 256K | 65.5K | Input: $0 Output: $0 | - | 🔧 🌡️ | 2025-11 | In: text Out: text | Released: 2025-11-10 |
| Dolphin3.0 Mistral 24B | cognitivecomputations/dolphin3.0-mistral-24b | 32.8K | 8.2K | Input: $0 Output: $0 | - | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-02-13 |
| Dolphin3.0 R1 Mistral 24B | cognitivecomputations/dolphin3.0-r1-mistral-24b | 32.8K | 8.2K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-02-13 |
| DeepSeek-V3.1 | deepseek/deepseek-chat-v3.1 | 163.8K | 163.8K | Input: $0.2 Output: $0.8 | Model: 0.100 Completion: 4.000 | 🧠 🔧 🌡️ | 2025-07 | In: text Out: text | Open Weights Released: 2025-08-21 |
| R1 (free) | deepseek/deepseek-r1:free | 163.8K | 163.8K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2025-01 | In: text Out: text | Open Weights Released: 2025-01-20 |
| DeepSeek V3 Base (free) | deepseek/deepseek-v3-base:free | 163.8K | 163.8K | Input: $0 Output: $0 | - | 🌡️ | 2025-03 | In: text Out: text | Open Weights Released: 2025-03-29 |
| DeepSeek V3.1 Terminus | deepseek/deepseek-v3.1-terminus | 131.1K | 65.5K | Input: $0.27 Output: $1 | Model: 0.135 Completion: 3.704 | 🧠 🔧 🌡️ | 2025-07 | In: text Out: text | Open Weights Released: 2025-09-22 |
| Deepseek R1 0528 Qwen3 8B (free) | deepseek/deepseek-r1-0528-qwen3-8b:free | 131.1K | 131.1K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2025-05 | In: text Out: text | Open Weights Released: 2025-05-29 |
| DeepSeek V3 0324 | deepseek/deepseek-chat-v3-0324 | 16.4K | 8.2K | Input: $0 Output: $0 | - | 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-03-24 |
| R1 0528 (free) | deepseek/deepseek-r1-0528:free | 163.8K | 163.8K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2025-05 | In: text Out: text | Open Weights Released: 2025-05-28 |
| DeepSeek R1 Distill Llama 70B | deepseek/deepseek-r1-distill-llama-70b | 8.2K | 8.2K | Input: $0 Output: $0 | - | 🧠 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-01-23 |
| DeepSeek R1 Distill Qwen 14B | deepseek/deepseek-r1-distill-qwen-14b | 64K | 8.2K | Input: $0 Output: $0 | - | 🧠 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-01-29 |
| DeepSeek V3.1 Terminus (exacto) | deepseek/deepseek-v3.1-terminus:exacto | 131.1K | 65.5K | Input: $0.27 Output: $1 | Model: 0.135 Completion: 3.704 | 🧠 🔧 🌡️ | 2025-07 | In: text Out: text | Open Weights Released: 2025-09-22 |
| Qwerky 72B | featherless/qwerky-72b | 32.8K | 8.2K | Input: $0 Output: $0 | - | 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-03-20 |
| DeepSeek R1T2 Chimera (free) | tngtech/deepseek-r1t2-chimera:free | 163.8K | 163.8K | Input: $0 Output: $0 | - | 🧠 🌡️ | 2025-07 | In: text Out: text | Open Weights Released: 2025-07-08 |
| MiniMax M1 | minimax/minimax-m1 | 1M | 40K | Input: $0.4 Output: $2.2 | Model: 0.200 Completion: 5.500 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-06-17 |
| MiniMax M2 | minimax/minimax-m2 | 196.6K | 118K | Input: $0.28 Output: $1.15 Cache Read: $0.28 Cache Write: $1.15 | Model: 0.140 Completion: 4.107 Cache: 1.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-10-23 |
| MiniMax-01 | minimax/minimax-01 | 1M | 1M | Input: $0.2 Output: $1.1 | Model: 0.100 Completion: 5.500 | 📎 🧠 🔧 🌡️ | - | In: text, image Out: text | Open Weights Released: 2025-01-15 |
| Gemini 2.0 Flash | google/gemini-2.0-flash-001 | 1M | 8.2K | Input: $0.1 Output: $0.4 Cache Read: $0.025 | Model: 0.050 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-06 | In: text, image, audio, video, pdf Out: text | Released: 2024-12-11 |
| Gemma 2 9B (free) | google/gemma-2-9b-it:free | 8.2K | 8.2K | Input: $0 Output: $0 | - | 🔧 🌡️ | 2024-06 | In: text Out: text | Open Weights Released: 2024-06-28 |
| Gemini 3 Pro Preview | google/gemini-3-pro-preview | 1.1M | 66K | Input: $2 Output: $12 | Model: 1.000 Completion: 6.000 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-11-18 Updated: 2025-11 |
| Gemini 2.5 Flash | google/gemini-2.5-flash | 1M | 65.5K | Input: $0.3 Output: $2.5 Cache Read: $0.0375 | Model: 0.150 Completion: 8.333 Cache: 0.125 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-07-17 |
| Gemini 2.5 Pro Preview 05-06 | google/gemini-2.5-pro-preview-05-06 | 1M | 65.5K | Input: $1.25 Output: $10 Cache Read: $0.31 | Model: 0.625 Completion: 8.000 Cache: 0.248 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-05-06 |
| Gemma 3n E4B IT | google/gemma-3n-e4b-it | 8.2K | 8.2K | Input: $0 Output: $0 | - | 📎 🌡️ | 2024-10 | In: text, image, audio Out: text | Open Weights Released: 2025-05-20 |
| Gemini 2.5 Flash Lite | google/gemini-2.5-flash-lite | 1M | 65.5K | Input: $0.1 Output: $0.4 Cache Read: $0.025 | Model: 0.050 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-06-17 |
| Gemini 2.5 Pro Preview 06-05 | google/gemini-2.5-pro-preview-06-05 | 1M | 65.5K | Input: $1.25 Output: $10 Cache Read: $0.31 | Model: 0.625 Completion: 8.000 Cache: 0.248 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-06-05 |
| Gemini 2.5 Flash Preview 09-25 | google/gemini-2.5-flash-preview-09-2025 | 1M | 65.5K | Input: $0.3 Output: $2.5 Cache Read: $0.031 | Model: 0.150 Completion: 8.333 Cache: 0.103 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-09-25 |
| Gemini 2.5 Pro | google/gemini-2.5-pro | 1M | 65.5K | Input: $1.25 Output: $10 Cache Read: $0.31 | Model: 0.625 Completion: 8.000 Cache: 0.248 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-03-20 Updated: 2025-06-05 |
| Gemma 3 12B IT | google/gemma-3-12b-it | 96K | 8.2K | Input: $0 Output: $0 | - | 📎 🔧 🌡️ | 2024-10 | In: text, image Out: text | Open Weights Released: 2025-03-13 |
| Gemma 3n 4B (free) | google/gemma-3n-e4b-it:free | 8.2K | 8.2K | Input: $0 Output: $0 | - | 📎 🔧 🌡️ | 2025-05 | In: text, image, audio Out: text | Open Weights Released: 2025-05-20 |
| Gemini 2.5 Flash Lite Preview 09-25 | google/gemini-2.5-flash-lite-preview-09-2025 | 1M | 65.5K | Input: $0.1 Output: $0.4 Cache Read: $0.025 | Model: 0.050 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-09-25 |
| Gemini 2.0 Flash Experimental (free) | google/gemini-2.0-flash-exp:free | 1M | 1M | Input: $0 Output: $0 | - | 📎 🔧 🌡️ | 2024-12 | In: text, image Out: text | Released: 2024-12-11 |
| Gemma 3 27B IT | google/gemma-3-27b-it | 96K | 8.2K | Input: $0 Output: $0 | - | 📎 🔧 🌡️ | 2024-10 | In: text, image Out: text | Open Weights Released: 2025-03-12 |
| MAI DS R1 (free) | microsoft/mai-ds-r1:free | 163.8K | 163.8K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04-21 |
| GPT OSS Safeguard 20B | openai/gpt-oss-safeguard-20b | 131.1K | 65.5K | Input: $0.075 Output: $0.3 | Model: 0.037 Completion: 4.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Released: 2025-10-29 |
| GPT-5.1-Codex | openai/gpt-5.1-codex | 400K | 128K | Input: $1.25 Output: $10 Cache Read: $0.125 | Model: 0.625 Completion: 8.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2024-09-30 | In: text, image Out: text | Released: 2025-11-13 |
| GPT-4.1 Mini | openai/gpt-4.1-mini | 1M | 32.8K | Input: $0.4 Output: $1.6 Cache Read: $0.1 | Model: 0.200 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-04-14 |
| GPT-5 Chat (latest) | openai/gpt-5-chat | 400K | 128K | Input: $1.25 Output: $10 | Model: 0.625 Completion: 8.000 | 📎 🧠 🌡️ | 2024-09-30 | In: text, image Out: text | Released: 2025-08-07 |
| GPT-5.1-Codex-Mini | openai/gpt-5.1-codex-mini | 400K | 100K | Input: $0.25 Output: $2 Cache Read: $0.025 | Model: 0.125 Completion: 8.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2024-09-30 | In: text, image Out: text | Released: 2025-11-13 |
| GPT-5.1 | openai/gpt-5.1 | 400K | 128K | Input: $1.25 Output: $10 Cache Read: $0.125 | Model: 0.625 Completion: 8.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2024-09-30 | In: text, image Out: text | Released: 2025-11-13 |
| GPT-5 Nano | openai/gpt-5-nano | 400K | 128K | Input: $0.05 Output: $0.4 | Model: 0.025 Completion: 8.000 | 📎 🧠 🔧 🌡️ | 2024-10-01 | In: text, image Out: text | Released: 2025-08-07 |
| GPT-5 Codex | openai/gpt-5-codex | 400K | 128K | Input: $1.25 Output: $10 Cache Read: $0.125 | Model: 0.625 Completion: 8.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2024-10-01 | In: text, image Out: text | Released: 2025-09-15 |
| GPT-4.1 | openai/gpt-4.1 | 1M | 32.8K | Input: $2 Output: $8 Cache Read: $0.5 | Model: 1.000 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-04-14 |
| GPT OSS 120B (exacto) | openai/gpt-oss-120b:exacto | 131.1K | 32.8K | Input: $0.05 Output: $0.24 | Model: 0.025 Completion: 4.800 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-08-05 |
| o4 Mini | openai/o4-mini | 200K | 100K | Input: $1.1 Output: $4.4 Cache Read: $0.28 | Model: 0.550 Completion: 4.000 Cache: 0.255 | 📎 🧠 🔧 🌡️ | 2024-06 | In: text, image Out: text | Released: 2025-04-16 |
| GPT-5.1 Chat | openai/gpt-5.1-chat | 128K | 16.4K | Input: $1.25 Output: $10 Cache Read: $0.125 | Model: 0.625 Completion: 8.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2024-09-30 | In: text, image Out: text | Released: 2025-11-13 |
| GPT-5 Mini | openai/gpt-5-mini | 400K | 128K | Input: $0.25 Output: $2 | Model: 0.125 Completion: 8.000 | 📎 🧠 🔧 🌡️ | 2024-10-01 | In: text, image Out: text | Released: 2025-08-07 |
| GPT-5 Image | openai/gpt-5-image | 400K | 128K | Input: $5 Output: $10 Cache Read: $1.25 | Model: 2.500 Completion: 2.000 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2024-10-01 | In: text, image, pdf Out: text, image | Released: 2025-10-14 |
| GPT OSS 20B | openai/gpt-oss-20b | 131.1K | 32.8K | Input: $0.05 Output: $0.2 | Model: 0.025 Completion: 4.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-08-05 |
| GPT OSS 120B | openai/gpt-oss-120b | 131.1K | 32.8K | Input: $0.072 Output: $0.28 | Model: 0.036 Completion: 3.889 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-08-05 |
| GPT-4o-mini | openai/gpt-4o-mini | 128K | 16.4K | Input: $0.15 Output: $0.6 Cache Read: $0.08 | Model: 0.075 Completion: 4.000 Cache: 0.533 | 📎 🔧 🌡️ | 2024-10 | In: text, image Out: text | Released: 2024-07-18 |
| GPT-5 | openai/gpt-5 | 400K | 128K | Input: $1.25 Output: $10 | Model: 0.625 Completion: 8.000 | 📎 🧠 🔧 🌡️ | 2024-10-01 | In: text, image Out: text | Released: 2025-08-07 |
| GPT-5 Pro | openai/gpt-5-pro | 400K | 272K | Input: $15 Output: $120 | Model: 7.500 Completion: 8.000 | 📎 🧠 🔧 | 2024-09-30 | In: text, image Out: text | Released: 2025-10-06 |
| GLM 4.5 | z-ai/glm-4.5 | 128K | 96K | Input: $0.6 Output: $2.2 | Model: 0.300 Completion: 3.667 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-28 |
| GLM 4.5 Air | z-ai/glm-4.5-air | 128K | 96K | Input: $0.2 Output: $1.1 | Model: 0.100 Completion: 5.500 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-28 |
| GLM 4.5V | z-ai/glm-4.5v | 64K | 16.4K | Input: $0.6 Output: $1.8 | Model: 0.300 Completion: 3.000 | 📎 🧠 🔧 🌡️ | 2025-04 | In: text, image, video Out: text | Open Weights Released: 2025-08-11 |
| GLM 4.6 | z-ai/glm-4.6 | 200K | 128K | Input: $0.6 Output: $2.2 Cache Read: $0.11 | Model: 0.300 Completion: 3.667 Cache: 0.183 | 🧠 🔧 🌡️ | 2025-09 | In: text Out: text | Open Weights Released: 2025-09-30 |
| GLM 4.6 (exacto) | z-ai/glm-4.6:exacto | 200K | 128K | Input: $0.6 Output: $1.9 Cache Read: $0.11 | Model: 0.300 Completion: 3.167 Cache: 0.183 | 🧠 🔧 🌡️ | 2025-09 | In: text Out: text | Open Weights Released: 2025-09-30 |
| GLM 4.5 Air (free) | z-ai/glm-4.5-air:free | 128K | 96K | Input: $0 Output: $0 | - | 🧠 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-28 |
| Qwen3 Coder | qwen/qwen3-coder | 262.1K | 66.5K | Input: $0.3 Output: $1.2 | Model: 0.150 Completion: 4.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-23 |
| Qwen3 32B (free) | qwen/qwen3-32b:free | 41K | 41K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04-28 |
| Qwen3 Next 80B A3B Instruct | qwen/qwen3-next-80b-a3b-instruct | 262.1K | 262.1K | Input: $0.14 Output: $1.4 | Model: 0.070 Completion: 10.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-09-11 |
| Qwen2.5 Coder 32B Instruct | qwen/qwen-2.5-coder-32b-instruct | 32.8K | 8.2K | Input: $0 Output: $0 | - | 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2024-11-11 |
| Qwen3 235B A22B (free) | qwen/qwen3-235b-a22b:free | 131.1K | 131.1K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04-28 |
| QwQ 32B (free) | qwen/qwq-32b:free | 32.8K | 32.8K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2025-03 | In: text Out: text | Open Weights Released: 2025-03-05 |
| Qwen3 30B A3B Thinking 2507 | qwen/qwen3-30b-a3b-thinking-2507 | 262K | 262K | Input: $0.2 Output: $0.8 | Model: 0.100 Completion: 4.000 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-29 |
| Qwen3 30B A3B (free) | qwen/qwen3-30b-a3b:free | 41K | 41K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04-28 |
| Qwen2.5 VL 72B Instruct | qwen/qwen2.5-vl-72b-instruct | 32.8K | 8.2K | Input: $0 Output: $0 | - | 📎 🌡️ | 2024-10 | In: text, image Out: text | Open Weights Released: 2025-02-01 |
| Qwen3 14B (free) | qwen/qwen3-14b:free | 41K | 41K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04-28 |
| Qwen3 30B A3B Instruct 2507 | qwen/qwen3-30b-a3b-instruct-2507 | 262K | 262K | Input: $0.2 Output: $0.8 | Model: 0.100 Completion: 4.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-29 |
| Qwen3 235B A22B Thinking 2507 | qwen/qwen3-235b-a22b-thinking-2507 | 262.1K | 81.9K | Input: $0.078 Output: $0.312 | Model: 0.039 Completion: 4.000 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-25 |
| Qwen2.5 VL 32B Instruct (free) | qwen/qwen2.5-vl-32b-instruct:free | 8.2K | 8.2K | Input: $0 Output: $0 | - | 📎 🔧 🌡️ | 2025-03 | In: text, image, video Out: text | Open Weights Released: 2025-03-24 |
| Qwen2.5 VL 72B Instruct (free) | qwen/qwen2.5-vl-72b-instruct:free | 32.8K | 32.8K | Input: $0 Output: $0 | - | 📎 🔧 🌡️ | 2025-02 | In: text, image Out: text | Open Weights Released: 2025-02-01 |
| Qwen3 235B A22B Instruct 2507 (free) | qwen/qwen3-235b-a22b-07-25:free | 262.1K | 131.1K | Input: $0 Output: $0 | - | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04-28 Updated: 2025-07-21 |
| Qwen3 Coder 480B A35B Instruct (free) | qwen/qwen3-coder:free | 262.1K | 66.5K | Input: $0 Output: $0 | - | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-23 |
| Qwen3 235B A22B Instruct 2507 | qwen/qwen3-235b-a22b-07-25 | 262.1K | 131.1K | Input: $0.15 Output: $0.85 | Model: 0.075 Completion: 5.667 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04-28 Updated: 2025-07-21 |
| Qwen3 8B (free) | qwen/qwen3-8b:free | 41K | 41K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04-28 |
| Qwen3 Max | qwen/qwen3-max | 262.1K | 32.8K | Input: $1.2 Output: $6 | Model: 0.600 Completion: 5.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Released: 2025-09-05 |
| Qwen3 Next 80B A3B Thinking | qwen/qwen3-next-80b-a3b-thinking | 262.1K | 262.1K | Input: $0.14 Output: $1.4 | Model: 0.070 Completion: 10.000 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-09-11 |
| Qwen3 Coder (exacto) | qwen/qwen3-coder:exacto | 131.1K | 32.8K | Input: $0.38 Output: $1.53 | Model: 0.190 Completion: 4.026 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-23 |
| Devstral Medium | mistralai/devstral-medium-2507 | 131.1K | 131.1K | Input: $0.4 Output: $2 | Model: 0.200 Completion: 5.000 | 🔧 🌡️ | 2025-05 | In: text Out: text | Open Weights Released: 2025-07-10 |
| Codestral 2508 | mistralai/codestral-2508 | 256K | 256K | Input: $0.3 Output: $0.9 | Model: 0.150 Completion: 3.000 | 🔧 🌡️ | 2025-05 | In: text Out: text | Open Weights Released: 2025-08-01 |
| Mistral 7B Instruct (free) | mistralai/mistral-7b-instruct:free | 32.8K | 32.8K | Input: $0 Output: $0 | - | 🔧 🌡️ | 2024-05 | In: text Out: text | Open Weights Released: 2024-05-27 |
| Devstral Small | mistralai/devstral-small-2505 | 128K | 128K | Input: $0.06 Output: $0.12 | Model: 0.030 Completion: 2.000 | 🔧 🌡️ | 2025-05 | In: text Out: text | Open Weights Released: 2025-05-07 |
| Mistral Small 3.2 24B Instruct | mistralai/mistral-small-3.2-24b-instruct | 96K | 8.2K | Input: $0 Output: $0 | - | 📎 🔧 🌡️ | 2024-10 | In: text, image Out: text | Open Weights Released: 2025-06-20 |
| Devstral Small 2505 (free) | mistralai/devstral-small-2505:free | 32.8K | 32.8K | Input: $0 Output: $0 | - | 🔧 🌡️ | 2025-05 | In: text Out: text | Open Weights Released: 2025-05-21 |
| Mistral Small 3.2 24B (free) | mistralai/mistral-small-3.2-24b-instruct:free | 96K | 96K | Input: $0 Output: $0 | - | 📎 🔧 🌡️ | 2025-06 | In: text, image Out: text | Open Weights Released: 2025-06-20 |
| Mistral Medium 3 | mistralai/mistral-medium-3 | 131.1K | 131.1K | Input: $0.4 Output: $2 | Model: 0.200 Completion: 5.000 | 📎 🔧 🌡️ | 2025-05 | In: text, image Out: text | Released: 2025-05-07 |
| Mistral Small 3.1 24B Instruct | mistralai/mistral-small-3.1-24b-instruct | 128K | 8.2K | Input: $0 Output: $0 | - | 📎 🔧 🌡️ | 2024-10 | In: text, image Out: text | Open Weights Released: 2025-03-17 |
| Devstral Small 1.1 | mistralai/devstral-small-2507 | 131.1K | 131.1K | Input: $0.1 Output: $0.3 | Model: 0.050 Completion: 3.000 | 🔧 🌡️ | 2025-05 | In: text Out: text | Open Weights Released: 2025-07-10 |
| Mistral Medium 3.1 | mistralai/mistral-medium-3.1 | 262.1K | 262.1K | Input: $0.4 Output: $2 | Model: 0.200 Completion: 5.000 | 📎 🔧 🌡️ | 2025-05 | In: text, image Out: text | Released: 2025-08-12 |
| Mistral Nemo (free) | mistralai/mistral-nemo:free | 131.1K | 131.1K | Input: $0 Output: $0 | - | 🔧 🌡️ | 2024-07 | In: text Out: text | Open Weights Released: 2024-07-19 |
| Reka Flash 3 | rekaai/reka-flash-3 | 32.8K | 8.2K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-03-12 |
| Llama 3.2 11B Vision Instruct | meta-llama/llama-3.2-11b-vision-instruct | 131.1K | 8.2K | Input: $0 Output: $0 | - | 📎 🌡️ | 2023-12 | In: text, image Out: text | Open Weights Released: 2024-09-25 |
| Llama 3.3 70B Instruct (free) | meta-llama/llama-3.3-70b-instruct:free | 65.5K | 65.5K | Input: $0 Output: $0 | - | 🔧 🌡️ | 2024-12 | In: text Out: text | Open Weights Released: 2024-12-06 |
| Llama 4 Scout (free) | meta-llama/llama-4-scout:free | 64K | 64K | Input: $0 Output: $0 | - | 📎 🔧 🌡️ | 2024-08 | In: text, image Out: text | Open Weights Released: 2025-04-05 |
| Claude Opus 4 | anthropic/claude-opus-4 | 200K | 32K | Input: $15 Output: $75 Cache Read: $1.5 Cache Write: $18.75 | Model: 7.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-05-22 |
| Claude Haiku 4.5 | anthropic/claude-haiku-4.5 | 200K | 64K | Input: $1 Output: $5 Cache Read: $0.1 Cache Write: $1.25 | Model: 0.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-02-31 | In: text, image Out: text | Released: 2025-10-15 |
| Claude Opus 4.1 | anthropic/claude-opus-4.1 | 200K | 32K | Input: $15 Output: $75 Cache Read: $1.5 Cache Write: $18.75 | Model: 7.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-08-05 |
| Claude Sonnet 3.7 | anthropic/claude-3.7-sonnet | 200K | 128K | Input: $15 Output: $75 Cache Read: $1.5 Cache Write: $18.75 | Model: 7.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2024-01 | In: text, image Out: text | Released: 2025-02-19 |
| Claude Haiku 3.5 | anthropic/claude-3.5-haiku | 200K | 8.2K | Input: $0.8 Output: $4 Cache Read: $0.08 Cache Write: $1 | Model: 0.400 Completion: 5.000 Cache: 0.100 | 📎 🔧 🌡️ | 2024-07-31 | In: text, image Out: text | Released: 2024-10-22 |
| Claude Sonnet 4 | anthropic/claude-sonnet-4 | 200K | 64K | Input: $3 Output: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-05-22 |
| Claude Sonnet 4.5 | anthropic/claude-sonnet-4.5 | 1M | 64K | Input: $3 Output: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-07-31 | In: text, image Out: text | Released: 2025-09-29 |
| Sarvam-M (free) | sarvamai/sarvam-m:free | 32.8K | 32.8K | Input: $0 Output: $0 | - | 🧠 🔧 🌡️ | 2025-05 | In: text Out: text | Open Weights Released: 2025-05-25 |
OVHcloud AI Endpoints¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| Mixtral-8x7B-Instruct-v0.1 | mixtral-8x7b-instruct-v0.1 | 32K | 32K | Input: $0.7 Output: $0.7 | Model: 0.350 Completion: 1.000 | 🌡️ | - | In: text Out: text | Open Weights Released: 2025-04-01 |
| Mistral-7B-Instruct-v0.3 | mistral-7b-instruct-v0.3 | 127K | 127K | Input: $0.11 Output: $0.11 | Model: 0.055 Completion: 1.000 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-04-01 |
| Llama-3.1-8B-Instruct | llama-3.1-8b-instruct | 131K | 131K | Input: $0.11 Output: $0.11 | Model: 0.055 Completion: 1.000 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-06-11 |
| Qwen2.5-VL-72B-Instruct | qwen2.5-vl-72b-instruct | 32K | 32K | Input: $1.01 Output: $1.01 | Model: 0.505 Completion: 1.000 | 📎 🌡️ | - | In: text, image Out: text | Open Weights Released: 2025-03-31 |
| Mistral-Nemo-Instruct-2407 | mistral-nemo-instruct-2407 | 118K | 118K | Input: $0.14 Output: $0.14 | Model: 0.070 Completion: 1.000 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-11-20 |
| Mistral-Small-3.2-24B-Instruct-2506 | mistral-small-3.2-24b-instruct-2506 | 128K | 128K | Input: $0.1 Output: $0.31 | Model: 0.050 Completion: 3.100 | 📎 🔧 🌡️ | - | In: text, image Out: text | Open Weights Released: 2025-07-16 |
| Qwen2.5-Coder-32B-Instruct | qwen2.5-coder-32b-instruct | 32K | 32K | Input: $0.96 Output: $0.96 | Model: 0.480 Completion: 1.000 | 🌡️ | - | In: text Out: text | Open Weights Released: 2025-03-24 |
| Qwen3-Coder-30B-A3B-Instruct | qwen3-coder-30b-a3b-instruct | 256K | 256K | Input: $0.07 Output: $0.26 | Model: 0.035 Completion: 3.714 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-10-28 |
| llava-next-mistral-7b | llava-next-mistral-7b | 32K | 32K | Input: $0.32 Output: $0.32 | Model: 0.160 Completion: 1.000 | 📎 🌡️ | - | In: text, image Out: text | Open Weights Released: 2025-01-08 |
| DeepSeek-R1-Distill-Llama-70B | deepseek-r1-distill-llama-70b | 131K | 131K | Input: $0.74 Output: $0.74 | Model: 0.370 Completion: 1.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-01-30 |
| Meta-Llama-3_1-70B-Instruct | meta-llama-3_1-70b-instruct | 131K | 131K | Input: $0.74 Output: $0.74 | Model: 0.370 Completion: 1.000 | 🌡️ | - | In: text Out: text | Open Weights Released: 2025-04-01 |
| gpt-oss-20b | gpt-oss-20b | 131K | 131K | Input: $0.05 Output: $0.18 | Model: 0.025 Completion: 3.600 | 🧠 🔧 | - | In: text Out: text | Open Weights Released: 2025-08-28 |
| gpt-oss-120b | gpt-oss-120b | 131K | 131K | Input: $0.09 Output: $0.47 | Model: 0.045 Completion: 5.222 | 🧠 🔧 | - | In: text Out: text | Open Weights Released: 2025-08-28 |
| Meta-Llama-3_3-70B-Instruct | meta-llama-3_3-70b-instruct | 131K | 131K | Input: $0.74 Output: $0.74 | Model: 0.370 Completion: 1.000 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-04-01 |
| Qwen3-32B | qwen3-32b | 32K | 32K | Input: $0.09 Output: $0.25 | Model: 0.045 Completion: 2.778 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-07-16 |
Perplexity¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| Sonar Reasoning | sonar-reasoning | 128K | 4.1K | Input: $1 Output: $5 | Model: 0.500 Completion: 5.000 | 🧠 🌡️ | 2025-09-01 | In: text Out: text | Released: 2024-01-01 Updated: 2025-09-01 |
| Sonar | sonar | 128K | 4.1K | Input: $1 Output: $1 | Model: 0.500 Completion: 1.000 | 🌡️ | 2025-09-01 | In: text Out: text | Released: 2024-01-01 Updated: 2025-09-01 |
| Sonar Pro | sonar-pro | 200K | 8.2K | Input: $3 Output: $15 | Model: 1.500 Completion: 5.000 | 📎 🌡️ | 2025-09-01 | In: text, image Out: text | Released: 2024-01-01 Updated: 2025-09-01 |
| Sonar Reasoning Pro | sonar-reasoning-pro | 128K | 4.1K | Input: $2 Output: $8 | Model: 1.000 Completion: 4.000 | 📎 🧠 🌡️ | 2025-09-01 | In: text, image Out: text | Released: 2024-01-01 Updated: 2025-09-01 |
Poe¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| Llama-3.1-8B | facebook/llama-3.1-8b | 8.2K | - | Input: $0.2 Output: $0.2 | Model: 0.100 Completion: 1.000 | 📎 🔧 | - | In: text, image Out: text | Released: 2024-08-08 |
| Llama-3.1-405B | facebook/llama-3.1-405b | 8.2K | - | Input: $3 Output: $3 | Model: 1.500 Completion: 1.000 | 📎 🔧 | - | In: text, image Out: text | Released: 2024-08-08 |
| Llama-3.1-70B | facebook/llama-3.1-70b | 8.2K | - | Input: $0.9 Output: $0.9 | Model: 0.450 Completion: 1.000 | 📎 🔧 | - | In: text, image Out: text | Released: 2024-08-08 |
| Grok-4-Fast-Non-Reasoning | xai/grok-4-fast-non-reasoning | 256K | 128K | Input: $0.2 Output: $0.5 Cache Read: $0.05 | Model: 0.100 Completion: 2.500 Cache: 0.250 | 📎 🔧 | - | In: text Out: text | Released: 2025-09-16 |
| Grok 4 Fast Reasoning | xai/grok-4-fast-reasoning | 256K | 128K | Input: $0.2 Output: $0.5 Cache Read: $0.05 | Model: 0.100 Completion: 2.500 Cache: 0.250 | 📎 🧠 🔧 | - | In: text Out: text | Released: 2025-09-16 |
| Grok 4 | xai/grok-4 | 256K | 128K | Input: $3 Output: $15 Cache Read: $0.75 | Model: 1.500 Completion: 5.000 Cache: 0.250 | 📎 🧠 🔧 | - | In: text, image Out: text | Released: 2025-07-10 |
| Grok Code Fast 1 | xai/grok-code-fast-1 | 256K | 128K | Input: $0.2 Output: $1.5 Cache Read: $0.02 | Model: 0.100 Completion: 7.500 Cache: 0.100 | 📎 🧠 🔧 | - | In: text, image Out: text | Released: 2025-08-22 |
| Grok-2 | xai/grok-2 | 131.1K | 8.2K | Input: $2 Output: $10 | Model: 1.000 Completion: 5.000 | 📎 🔧 | - | In: text Out: text | Released: 2025-01-14 |
| Grok 3 | xai/grok-3 | 131.1K | 8.2K | Input: $3 Output: $15 Cache Read: $0.75 | Model: 1.500 Completion: 5.000 Cache: 0.250 | 📎 🔧 | - | In: text, image Out: text | Released: 2025-04-11 |
| Grok 3 Mini | xai/grok-3-mini | 131.1K | 8.2K | Input: $0.3 Output: $0.5 Cache Read: $0.075 | Model: 0.150 Completion: 1.667 Cache: 0.250 | 📎 🧠 🔧 | - | In: text Out: text | Released: 2025-04-11 |
| Ideogram | ideogramai/ideogram | 150 | - | - | - | 📎 🔧 | - | In: text, image Out: image | Released: 2024-04-03 |
| Ideogram-v2a | ideogramai/ideogram-v2a | 150 | - | - | - | 📎 🔧 | - | In: text Out: image | Released: 2025-02-27 |
| Ideogram-v2a-Turbo | ideogramai/ideogram-v2a-turbo | 150 | - | - | - | 📎 🔧 | - | In: text Out: image | Released: 2025-02-27 |
| Ideogram-v2 | ideogramai/ideogram-v2 | 150 | - | - | - | 📎 🔧 | - | In: text, image Out: image | Released: 2024-08-21 |
| Runway | runwayml/runway | 256 | - | - | - | 📎 🔧 | - | In: text, image Out: video | Released: 2024-10-11 |
| Runway-Gen-4-Turbo | runwayml/runway-gen-4-turbo | 256 | - | - | - | 📎 🔧 | - | In: text, image Out: video | Released: 2025-05-09 |
| GPT-4.1-nano | openAi/gpt-4.1-nano | 1M | 32.8K | Input: $0.09 Output: $0.36 Cache Read: $0.023 | Model: 0.045 Completion: 4.000 Cache: 0.256 | 📎 🔧 | - | In: text, image Out: text | Released: 2025-04-15 |
| Sora-2 | openAi/sora-2 | - | - | - | - | 📎 🔧 | - | In: text, image Out: video | Released: 2025-10-06 |
| o1-pro | openAi/o1-pro | 200K | 100K | Input: $140 Output: $540 | Model: 70.000 Completion: 3.857 | 📎 🧠 🔧 | - | In: text, image Out: text | Released: 2025-03-19 |
| GPT-3.5-Turbo-Raw | openAi/gpt-3.5-turbo-raw | 4.5K | 2K | Input: $0.45 Output: $1.3 | Model: 0.225 Completion: 2.889 | 📎 🔧 | - | In: text, image Out: text | Released: 2023-09-27 |
| GPT-4-Classic | openAi/gpt-4-classic | 8.2K | 4.1K | Input: $27 Output: $54 | Model: 13.500 Completion: 2.000 | 📎 🔧 | - | In: text, image Out: text | Released: 2024-03-25 |
| GPT-4.1-mini | openAi/gpt-4.1-mini | 1M | 32.8K | Input: $0.36 Output: $1.4 Cache Read: $0.09 | Model: 0.180 Completion: 3.889 Cache: 0.250 | 📎 🔧 | - | In: text, image Out: text | Released: 2025-04-15 |
| GPT-5-Chat | openAi/gpt-5-chat | 128K | 16.4K | Input: $1.1 Output: $9 Cache Read: $0.11 | Model: 0.550 Completion: 8.182 Cache: 0.100 | 📎 🔧 | - | In: text, image Out: text | Released: 2025-08-07 |
| o3-deep-research | openAi/o3-deep-research | 200K | 100K | Input: $9 Output: $36 Cache Read: $2.3 | Model: 4.500 Completion: 4.000 Cache: 0.256 | 📎 🧠 🔧 | - | In: text Out: text | Released: 2025-06-27 |
| GPT-4o-Search | openAi/gpt-4o-search | 128K | 8.2K | Input: $2.3 Output: $9 | Model: 1.150 Completion: 3.913 | 📎 🔧 | - | In: text Out: text | Released: 2025-03-11 |
| GPT-Image-1-Mini | openAi/gpt-image-1-mini | - | - | - | - | 📎 🔧 | - | In: text, image Out: image | Released: 2025-08-26 |
| GPT-3.5-Turbo | openAi/gpt-3.5-turbo | 16.4K | 2K | Input: $0.45 Output: $1.3 | Model: 0.225 Completion: 2.889 | 📎 🔧 | - | In: text, image Out: text | Released: 2023-09-13 |
| o3-mini-high | openAi/o3-mini-high | 200K | 100K | Input: $0.99 Output: $4 | Model: 0.495 Completion: 4.040 | 📎 🧠 🔧 | - | In: text, image Out: text | Released: 2025-01-31 |
| ChatGPT-4o-Latest | openAi/chatgpt-4o-latest | 128K | 8.2K | Input: $4.5 Output: $13 | Model: 2.250 Completion: 2.889 | 📎 🔧 | - | In: text, image Out: text | Released: 2024-08-14 |
| GPT-4-Turbo | openAi/gpt-4-turbo | 128K | 4.1K | Input: $9 Output: $27 | Model: 4.500 Completion: 3.000 | 📎 🔧 | - | In: text, image Out: text | Released: 2023-09-13 |
| o3-mini | openAi/o3-mini | 200K | 100K | Input: $0.99 Output: $4 | Model: 0.495 Completion: 4.040 | 📎 🧠 🔧 | - | In: text, image Out: text | Released: 2025-01-31 |
| GPT-5-nano | openAi/gpt-5-nano | 400K | 128K | Input: $0.045 Output: $0.36 Cache Read: $0.0045 | Model: 0.022 Completion: 8.000 Cache: 0.100 | 📎 🧠 🔧 | - | In: text, image Out: text | Released: 2025-08-05 |
| GPT-5-Codex | openAi/gpt-5-codex | 400K | 128K | Input: $1.1 Output: $9 | Model: 0.550 Completion: 8.182 | 📎 🧠 🔧 | - | In: text, image Out: text | Released: 2025-09-23 |
| GPT-4o | openAi/gpt-4o | 128K | 8.2K | - | - | 📎 🔧 | - | In: text, image Out: text | Released: 2024-05-13 |
| GPT-4.1 | openAi/gpt-4.1 | 1M | 32.8K | Input: $1.8 Output: $7.2 Cache Read: $0.45 | Model: 0.900 Completion: 4.000 Cache: 0.250 | 📎 🔧 | - | In: text, image Out: text | Released: 2025-04-14 |
| o4-mini | openAi/o4-mini | 200K | 100K | Input: $0.99 Output: $4 Cache Read: $0.25 | Model: 0.495 Completion: 4.040 Cache: 0.253 | 📎 🧠 🔧 | - | In: text, image Out: text | Released: 2025-04-16 |
| o1 | openAi/o1 | 200K | 100K | Input: $13 Output: $54 | Model: 6.500 Completion: 4.154 | 📎 🧠 🔧 | - | In: text, image Out: text | Released: 2024-12-18 |
| GPT-5-mini | openAi/gpt-5-mini | 400K | 128K | Input: $0.22 Output: $1.8 Cache Read: $0.022 | Model: 0.110 Completion: 8.182 Cache: 0.100 | 📎 🧠 🔧 | - | In: text, image Out: text | Released: 2025-06-25 |
| GPT-4o-Aug | openAi/gpt-4o-aug | 128K | 8.2K | Input: $2.3 Output: $9 Cache Read: $1.1 | Model: 1.150 Completion: 3.913 Cache: 0.478 | 📎 🔧 | - | In: text, image Out: text | Released: 2024-11-21 |
| o3-pro | openAi/o3-pro | 200K | 100K | Input: $18 Output: $72 | Model: 9.000 Completion: 4.000 | 📎 🧠 🔧 | - | In: text, image Out: text | Released: 2025-06-10 |
| GPT-Image-1 | openAi/gpt-image-1 | 128K | - | - | - | 📎 🔧 | - | In: text, image Out: image | Released: 2025-03-31 |
| GPT-3.5-Turbo-Instruct | openAi/gpt-3.5-turbo-instruct | 3.5K | 1K | Input: $1.3 Output: $1.8 | Model: 0.650 Completion: 1.385 | 📎 🔧 | - | In: text, image Out: text | Released: 2023-09-20 |
| o3 | openAi/o3 | 200K | 100K | Input: $1.8 Output: $7.2 Cache Read: $0.45 | Model: 0.900 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 | - | In: text, image Out: text | Released: 2025-04-16 |
| o4-mini-deep-research | openAi/o4-mini-deep-research | 200K | 100K | Input: $1.8 Output: $7.2 Cache Read: $0.45 | Model: 0.900 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 | - | In: text Out: text | Released: 2025-06-27 |
| GPT-4-Classic-0314 | openAi/gpt-4-classic-0314 | 8.2K | 4.1K | Input: $27 Output: $54 | Model: 13.500 Completion: 2.000 | 📎 🔧 | - | In: text, image Out: text | Released: 2024-08-26 |
| GPT-4o-mini | openAi/gpt-4o-mini | 128K | 4.1K | Input: $0.14 Output: $0.54 Cache Read: $0.068 | Model: 0.070 Completion: 3.857 Cache: 0.486 | 📎 🔧 | - | In: text, image Out: text | Released: 2024-07-18 |
| GPT-5 | openAi/gpt-5 | 400K | 128K | Input: $1.1 Output: $9 Cache Read: $0.11 | Model: 0.550 Completion: 8.182 Cache: 0.100 | 📎 🧠 🔧 | - | In: text, image Out: text | Released: 2025-08-05 |
| DALL-E-3 | openAi/dall-e-3 | 800 | - | - | - | 📎 🔧 | - | In: text Out: image | Released: 2023-11-06 |
| Sora-2-Pro | openAi/sora-2-pro | - | - | - | - | 📎 🔧 | - | In: text, image Out: video | Released: 2025-10-06 |
| GPT-5-Pro | openAi/gpt-5-pro | 400K | 128K | Input: $13 Output: $110 | Model: 6.500 Completion: 8.462 | 📎 🧠 🔧 | - | In: text, image Out: text | Released: 2025-10-06 |
| GPT-4o-mini-Search | openAi/gpt-4o-mini-search | 128K | 8.2K | Input: $0.14 Output: $0.54 | Model: 0.070 Completion: 3.857 | 📎 🔧 | - | In: text Out: text | Released: 2025-03-11 |
| ElevenLabs-v3 | elevenlabs/elevenlabs-v3 | 128K | - | - | - | 📎 🔧 | - | In: text Out: audio | Released: 2025-06-05 |
| ElevenLabs-Music | elevenlabs/elevenlabs-music | 2K | - | - | - | 📎 🔧 | - | In: text Out: audio | Released: 2025-08-29 |
| ElevenLabs-v2.5-Turbo | elevenlabs/elevenlabs-v2.5-turbo | 128K | - | - | - | 📎 🔧 | - | In: text Out: audio | Released: 2024-10-28 |
| Nano-Banana | google/nano-banana | 32.8K | - | Input: $0.21 Output: $1.7 | Model: 0.105 Completion: 8.095 | 📎 🔧 | - | In: text, image Out: text, image | Released: 2025-08-21 |
| Imagen-4 | google/imagen-4 | 480 | - | - | - | 📎 🔧 | - | In: text Out: image | Released: 2025-05-22 |
| Imagen-3 | google/imagen-3 | 480 | - | - | - | 📎 🔧 | - | In: text Out: image | Released: 2024-10-15 |
| Imagen-4-Ultra | google/imagen-4-ultra | 480 | - | - | - | 📎 🔧 | - | In: text Out: image | Released: 2025-05-24 |
| Gemini 2.5 Flash | google/gemini-2.5-flash | 1.1M | 65.5K | Input: $0.21 Output: $1.7 Cache Read: $0.052 | Model: 0.105 Completion: 8.095 Cache: 0.248 | 📎 🧠 🔧 | - | In: text, image, video, audio Out: text | Released: 2025-04-26 |
| Gemini-3.0-Pro | google/gemini-3.0-pro | 1M | 64K | Input: $1.6 Output: $9.6 Cache Read: $0.16 | Model: 0.800 Completion: 6.000 Cache: 0.100 | 📎 🧠 🔧 | - | In: text, image, video, audio Out: text | Released: 2025-10-22 |
| Gemini-2.0-Flash-Lite | google/gemini-2.0-flash-lite | 990K | 8.2K | Input: $0.052 Output: $0.21 | Model: 0.026 Completion: 4.038 | 📎 🔧 | - | In: text, image, video, audio Out: text | Released: 2025-02-05 |
| Veo-3.1 | google/veo-3.1 | 480 | - | - | - | 📎 🔧 | - | In: text Out: video | Released: 2025-10-15 |
| Imagen-3-Fast | google/imagen-3-fast | 480 | - | - | - | 📎 🔧 | - | In: text Out: image | Released: 2024-10-17 |
| Lyria | google/lyria | - | - | - | - | 📎 🔧 | - | In: text Out: audio | Released: 2025-06-04 |
| Gemini-2.0-Flash | google/gemini-2.0-flash | 990K | 8.2K | Input: $0.1 Output: $0.42 | Model: 0.050 Completion: 4.200 | 📎 🔧 | - | In: text, image, video, audio Out: text | Released: 2024-12-11 |
| Gemini 2.5 Flash Lite | google/gemini-2.5-flash-lite | 1M | 64K | Input: $0.07 Output: $0.28 | Model: 0.035 Completion: 4.000 | 📎 🧠 🔧 | - | In: text, image, video, audio Out: text | Released: 2025-06-19 |
| Veo-3 | google/veo-3 | 480 | - | - | - | 📎 🔧 | - | In: text Out: video | Released: 2025-05-21 |
| Veo-3-Fast | google/veo-3-fast | 480 | - | - | - | 📎 🔧 | - | In: text Out: video | Released: 2025-10-13 |
| Imagen-4-Fast | google/imagen-4-fast | 480 | - | - | - | 📎 🔧 | - | In: text Out: image | Released: 2025-06-25 |
| Veo-2 | google/veo-2 | 480 | - | - | - | 📎 🔧 | - | In: text Out: video | Released: 2024-12-02 |
| Gemini 2.5 Pro | google/gemini-2.5-pro | 1.1M | 65.5K | Input: $0.87 Output: $7 Cache Read: $0.22 | Model: 0.435 Completion: 8.046 Cache: 0.253 | 📎 🧠 🔧 | - | In: text, image, video, audio Out: text | Released: 2025-02-05 |
| Veo-3.1-Fast | google/veo-3.1-fast | 480 | - | - | - | 📎 🔧 | - | In: text Out: video | Released: 2025-10-15 |
| GPT-5.1-Codex | openai/gpt-5.1-codex | 400K | 128K | Input: $1.1 Output: $9 Cache Read: $0.11 | Model: 0.550 Completion: 8.182 Cache: 0.100 | 📎 🧠 🔧 | - | In: text Out: text | Released: 2025-11-12 |
| GPT-5.1-Codex-Mini | openai/gpt-5.1-codex-mini | 400K | 128K | Input: $0.22 Output: $1.8 Cache Read: $0.022 | Model: 0.110 Completion: 8.182 Cache: 0.100 | 📎 🧠 🔧 | - | In: text Out: text | Released: 2025-11-12 |
| GPT-5.1-Instant | openai/gpt-5.1-instant | 128K | 16.4K | Input: $1.1 Output: $9 Cache Read: $0.11 | Model: 0.550 Completion: 8.182 Cache: 0.100 | 📎 🔧 | - | In: text, image Out: text | Released: 2025-11-12 |
| GPT-5.1 | openai/gpt-5.1 | 400K | 128K | Input: $1.1 Output: $9 Cache Read: $0.11 | Model: 0.550 Completion: 8.182 Cache: 0.100 | 📎 🧠 🔧 | - | In: text, image Out: text | Released: 2025-11-12 |
| StableDiffusionXL | stabilityai/stablediffusionxl | 200 | - | - | - | 📎 🔧 | - | In: text, image Out: image | Released: 2023-07-09 |
| TopazLabs | topazlabs-co/topazlabs | 204 | - | - | - | 📎 🔧 | - | In: text Out: image | Released: 2024-12-03 |
| Ray2 | lumalabs/ray2 | 5K | - | - | - | 📎 🔧 | - | In: text, image Out: video | Released: 2025-02-20 |
| Dream-Machine | lumalabs/dream-machine | 5K | - | - | - | 📎 🔧 | - | In: text, image Out: video | Released: 2024-09-18 |
| Claude-Opus-3 | anthropic/claude-opus-3 | 189.1K | 8.2K | Input: $13 Output: $64 Cache Read: $1.3 Cache Write: $16 | Model: 6.500 Completion: 4.923 Cache: 0.100 | 📎 🔧 | - | In: text, image Out: text | Released: 2024-03-04 |
| Claude Opus 4 | anthropic/claude-opus-4 | 192.5K | 32.8K | Input: $13 Output: $64 Cache Read: $1.3 Cache Write: $16 | Model: 6.500 Completion: 4.923 Cache: 0.100 | 📎 🧠 🔧 | - | In: text, image Out: text | Released: 2025-05-21 |
| Claude Sonnet 3.7 Reasoning | anthropic/claude-sonnet-3.7-reasoning | 196.6K | 128K | Input: $2.6 Output: $13 Cache Read: $0.25 Cache Write: $3.2 | Model: 1.300 Completion: 5.000 Cache: 0.096 | 📎 🧠 🔧 | - | In: text, image Out: text | Released: 2025-02-19 |
| Claude Opus 4 Search | anthropic/claude-opus-4-search | 196.6K | 128K | Input: $13 Output: $64 Cache Read: $1.3 Cache Write: $16 | Model: 6.500 Completion: 4.923 Cache: 0.100 | 📎 🧠 🔧 | - | In: text, image Out: text | Released: 2025-06-20 |
| Claude Sonnet 3.7 | anthropic/claude-sonnet-3.7 | 196.6K | 32.8K | Input: $2.6 Output: $13 Cache Read: $0.25 Cache Write: $3.2 | Model: 1.300 Completion: 5.000 Cache: 0.096 | 📎 🧠 🔧 | - | In: text, image Out: text | Released: 2025-02-19 |
| Claude-Haiku-3.5-Search | anthropic/claude-haiku-3.5-search | 189.1K | 8.2K | Input: $0.68 Output: $3.4 Cache Read: $0.068 Cache Write: $0.85 | Model: 0.340 Completion: 5.000 Cache: 0.100 | 📎 🔧 | - | In: text, image Out: text | Released: 2025-05-15 |
| Claude Haiku 4.5 | anthropic/claude-haiku-4.5 | 192K | 64K | Input: $0.85 Output: $4.2 Cache Read: $0.085 Cache Write: $1.1 | Model: 0.425 Completion: 4.941 Cache: 0.100 | 📎 🧠 🔧 | - | In: text, image Out: text | Released: 2025-10-15 |
| Claude Sonnet 4 Reasoning | anthropic/claude-sonnet-4-reasoning | 983K | 64K | Input: $2.6 Output: $13 Cache Read: $0.25 Cache Write: $3.2 | Model: 1.300 Completion: 5.000 Cache: 0.096 | 📎 🧠 🔧 | - | In: text, image Out: text | Released: 2025-05-21 |
| Claude-Haiku-3 | anthropic/claude-haiku-3 | 189.1K | 8.2K | Input: $0.21 Output: $1.1 Cache Read: $0.021 Cache Write: $0.26 | Model: 0.105 Completion: 5.238 Cache: 0.100 | 📎 🔧 | - | In: text, image Out: text | Released: 2024-03-09 |
| Claude Opus 4.1 | anthropic/claude-opus-4.1 | 196.6K | 32K | Input: $13 Output: $64 Cache Read: $1.3 Cache Write: $16 | Model: 6.500 Completion: 4.923 Cache: 0.100 | 📎 🧠 🔧 | - | In: text, image Out: text | Released: 2025-08-05 |
| Claude Sonnet 3.7 Search | anthropic/claude-sonnet-3.7-search | 196.6K | 128K | Input: $2.6 Output: $13 Cache Read: $0.25 Cache Write: $3.2 | Model: 1.300 Completion: 5.000 Cache: 0.096 | 📎 🧠 🔧 | - | In: text, image Out: text | Released: 2025-05-15 |
| Claude Opus 4 Reasoning | anthropic/claude-opus-4-reasoning | 196.6K | 32.8K | Input: $13 Output: $64 Cache Read: $1.3 Cache Write: $16 | Model: 6.500 Completion: 4.923 Cache: 0.100 | 📎 🧠 🔧 | - | In: text, image Out: text | Released: 2025-05-21 |
| Claude-Sonnet-3.5 | anthropic/claude-sonnet-3.5 | 189.1K | 8.2K | Input: $2.6 Output: $13 Cache Read: $0.25 Cache Write: $3.2 | Model: 1.300 Completion: 5.000 Cache: 0.096 | 📎 🔧 | - | In: text, image Out: text | Released: 2024-06-05 |
| Claude Sonnet 4 | anthropic/claude-sonnet-4 | 983K | 32.8K | Input: $2.6 Output: $13 Cache Read: $0.25 Cache Write: $3.2 | Model: 1.300 Completion: 5.000 Cache: 0.096 | 📎 🧠 🔧 | - | In: text, image Out: text | Released: 2025-05-21 |
| Claude-Haiku-3.5 | anthropic/claude-haiku-3.5 | 189.1K | 8.2K | Input: $0.68 Output: $3.4 Cache Read: $0.068 Cache Write: $0.85 | Model: 0.340 Completion: 5.000 Cache: 0.100 | 📎 🔧 | - | In: text, image Out: text | Released: 2024-10-01 |
| Claude-Sonnet-3.5-June | anthropic/claude-sonnet-3.5-june | 189.1K | 8.2K | Input: $2.6 Output: $13 Cache Read: $0.25 Cache Write: $3.2 | Model: 1.300 Completion: 5.000 Cache: 0.096 | 📎 🔧 | - | In: text, image Out: text | Released: 2024-11-18 |
| Claude Sonnet 4.5 | anthropic/claude-sonnet-4.5 | 983K | 32.8K | Input: $2.6 Output: $13 Cache Read: $0.25 Cache Write: $3.2 | Model: 1.300 Completion: 5.000 Cache: 0.096 | 📎 🧠 🔧 | - | In: text, image Out: text | Released: 2025-09-26 |
| Claude Sonnet 4 Search | anthropic/claude-sonnet-4-search | 983K | 128K | Input: $2.6 Output: $13 Cache Read: $0.25 Cache Write: $3.2 | Model: 1.300 Completion: 5.000 Cache: 0.096 | 📎 🧠 🔧 | - | In: text, image Out: text | Released: 2025-06-20 |
| Tako | trytako/tako | 2K | - | - | - | 📎 🔧 | - | In: text Out: text | Released: 2024-08-15 |
| GLM-4.6 | novita/glm-4.6 | - | - | - | - | 📎 🔧 | - | In: text Out: text | Released: 2025-09-30 |
Requesty¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| Grok 4 | xai/grok-4 | 256K | 64K | Input: $3 Output: $15 Cache Read: $0.75 Cache Write: $3 | Model: 1.500 Completion: 5.000 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image Out: text | Released: 2025-09-09 |
| Grok 4 Fast | xai/grok-4-fast | 2M | 64K | Input: $0.2 Output: $0.5 Cache Read: $0.05 Cache Write: $0.2 | Model: 0.100 Completion: 2.500 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text Out: text | Released: 2025-09-19 |
| Gemini 2.5 Flash | google/gemini-2.5-flash | 1M | 65.5K | Input: $0.3 Output: $2.5 Cache Read: $0.075 Cache Write: $0.55 | Model: 0.150 Completion: 8.333 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-06-17 |
| Gemini 2.5 Pro | google/gemini-2.5-pro | 1M | 65.5K | Input: $1.25 Output: $10 Cache Read: $0.31 Cache Write: $2.375 | Model: 0.625 Completion: 8.000 Cache: 0.248 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-06-17 |
| GPT-4.1 Mini | openai/gpt-4.1-mini | 1M | 32.8K | Input: $0.4 Output: $1.6 Cache Read: $0.1 | Model: 0.200 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-04-14 |
| GPT-5 Nano | openai/gpt-5-nano | 16K | 4K | Input: $0.05 Output: $0.4 Cache Read: $0.01 | Model: 0.025 Completion: 8.000 Cache: 0.200 | 📎 🧠 🔧 | 2024-05-30 | In: text Out: text | Released: 2025-08-07 |
| GPT-4.1 | openai/gpt-4.1 | 1M | 32.8K | Input: $2 Output: $8 Cache Read: $0.5 | Model: 1.000 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-04-14 |
| o4 Mini | openai/o4-mini | 200K | 100K | Input: $1.1 Output: $4.4 Cache Read: $0.28 | Model: 0.550 Completion: 4.000 Cache: 0.255 | 📎 🧠 🔧 🌡️ | 2024-06 | In: text, image Out: text | Released: 2025-04-16 |
| GPT-5 Mini | openai/gpt-5-mini | 128K | 32K | Input: $0.25 Output: $2 Cache Read: $0.03 | Model: 0.125 Completion: 8.000 Cache: 0.120 | 📎 🧠 🔧 | 2024-05-30 | In: text, image Out: text | Released: 2025-08-07 |
| GPT-4o Mini | openai/gpt-4o-mini | 128K | 16.4K | Input: $0.15 Output: $0.6 Cache Read: $0.08 | Model: 0.075 Completion: 4.000 Cache: 0.533 | 📎 🔧 🌡️ | 2024-10 | In: text, image Out: text | Released: 2024-07-18 |
| GPT-5 | openai/gpt-5 | 400K | 128K | Input: $1.25 Output: $10 Cache Read: $0.13 | Model: 0.625 Completion: 8.000 Cache: 0.104 | 📎 🧠 🔧 | 2024-09-30 | In: text, audio, image, video Out: text, audio, image | Released: 2025-08-07 |
| Claude Opus 4 | anthropic/claude-opus-4 | 200K | 32K | Input: $15 Output: $75 Cache Read: $1.5 Cache Write: $18.75 | Model: 7.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-05-22 |
| Claude Opus 4.1 | anthropic/claude-opus-4-1 | 200K | 32K | Input: $15 Output: $75 Cache Read: $1.5 Cache Write: $18.75 | Model: 7.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-08-05 |
| Claude Haiku 4.5 | anthropic/claude-haiku-4-5 | 200K | 62K | Input: $1 Output: $5 Cache Read: $0.1 Cache Write: $1.25 | Model: 0.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-02-01 | In: text, image Out: text | Released: 2025-10-15 |
| Claude Sonnet 4.5 | anthropic/claude-sonnet-4-5 | 1M | 64K | Input: $3 Output: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-07-31 | In: text, image Out: text | Released: 2025-09-29 |
| Claude Sonnet 3.7 | anthropic/claude-3-7-sonnet | 200K | 64K | Input: $3 Output: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2024-01 | In: text, image Out: text | Released: 2025-02-19 |
| Claude Sonnet 4 | anthropic/claude-sonnet-4 | 200K | 64K | Input: $3 Output: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-05-22 |
Scaleway¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| Qwen3 235B A22B Instruct 2507 | qwen3-235b-a22b-instruct-2507 | 260K | 8.2K | Input: $0.75 Output: $2.25 | Model: 0.375 Completion: 3.000 | 📎 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-07-01 |
| Pixtral 12B 2409 | pixtral-12b-2409 | 128K | 4.1K | Input: $0.2 Output: $0.2 | Model: 0.100 Completion: 1.000 | 📎 🔧 🌡️ | - | In: text, image Out: text | Open Weights Released: 2024-09-25 |
| Llama 3.1 8B Instruct | llama-3.1-8b-instruct | 128K | 16.4K | Input: $0.2 Output: $0.2 | Model: 0.100 Completion: 1.000 | 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2025-01-01 |
| Mistral Nemo Instruct 2407 | mistral-nemo-instruct-2407 | 128K | 8.2K | Input: $0.2 Output: $0.2 | Model: 0.100 Completion: 1.000 | 📎 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-07-25 |
| Mistral Small 3.2 24B Instruct (2506) | mistral-small-3.2-24b-instruct-2506 | 128K | 8.2K | Input: $0.15 Output: $0.35 | Model: 0.075 Completion: 2.333 | 🔧 🌡️ | - | In: text, image Out: text | Open Weights Released: 2025-06-20 |
| Qwen3-Coder 30B-A3B Instruct | qwen3-coder-30b-a3b-instruct | 128K | 8.2K | Input: $0.2 Output: $0.8 | Model: 0.100 Completion: 4.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04 |
| Llama-3.3-70B-Instruct | llama-3.3-70b-instruct | 100K | 4.1K | Input: $0.9 Output: $0.9 | Model: 0.450 Completion: 1.000 | 📎 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-12-06 |
| Whisper Large v3 | whisper-large-v3 | - | 4.1K | Input: $0.003 Output: $0 | Model: 0.002 | - | 2023-09 | In: audio Out: text | Open Weights Released: 2023-09-01 Updated: 2025-09-05 |
| DeepSeek R1 Distill Llama 70B | deepseek-r1-distill-llama-70b | 32K | 4.1K | Input: $0.9 Output: $0.9 | Model: 0.450 Completion: 1.000 | 🧠 🔧 🌡️ | 2024-07 | In: text Out: text | Open Weights Released: 2025-01-20 |
| Voxtral Small 24B 2507 | voxtral-small-24b-2507 | 32K | 8.2K | Input: $0.15 Output: $0.35 | Model: 0.075 Completion: 2.333 | 📎 🔧 🌡️ | - | In: text, audio Out: text | Open Weights Released: 2025-07-01 |
| GPT-OSS 120B | gpt-oss-120b | 128K | 8.2K | Input: $0.15 Output: $0.6 | Model: 0.075 Completion: 4.000 | 📎 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2024-01-01 |
| BGE Multilingual Gemma2 | bge-multilingual-gemma2 | 8.2K | 3.1K | Input: $0.13 Output: $0 | Model: 0.065 | - | - | In: text Out: text | Released: 2024-07-26 Updated: 2025-06-15 |
| Gemma-3-27B-IT | gemma-3-27b-it | 40K | 8.2K | Input: $0.25 Output: $0.5 | Model: 0.125 Completion: 2.000 | 📎 🧠 🔧 🌡️ | 2024-12 | In: text, image Out: text | Released: 2024-12-01 Updated: 2025-09-05 |
submodel¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| GPT OSS 120B | openai/gpt-oss-120b | 131.1K | 32.8K | Input: $0.1 Output: $0.5 | Model: 0.050 Completion: 5.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-08-23 |
| Qwen3 235B A22B Instruct 2507 | Qwen/Qwen3-235B-A22B-Instruct-2507 | 262.1K | 131.1K | Input: $0.2 Output: $0.3 | Model: 0.100 Completion: 1.500 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-08-23 |
| Qwen3 Coder 480B A35B Instruct | Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8 | 262.1K | 262.1K | Input: $0.2 Output: $0.8 | Model: 0.100 Completion: 4.000 | 🔧 🌡️ | - | In: text Out: text | Released: 2025-08-23 |
| Qwen3 235B A22B Thinking 2507 | Qwen/Qwen3-235B-A22B-Thinking-2507 | 262.1K | 131.1K | Input: $0.2 Output: $0.6 | Model: 0.100 Completion: 3.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-08-23 |
| GLM 4.5 FP8 | zai-org/GLM-4.5-FP8 | 131.1K | 131.1K | Input: $0.2 Output: $0.8 | Model: 0.100 Completion: 4.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-07-28 |
| GLM 4.5 Air | zai-org/GLM-4.5-Air | 131.1K | 131.1K | Input: $0.1 Output: $0.5 | Model: 0.050 Completion: 5.000 | 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-07-28 |
| DeepSeek R1 0528 | deepseek-ai/DeepSeek-R1-0528 | 75K | 163.8K | Input: $0.5 Output: $2.15 | Model: 0.250 Completion: 4.300 | 🧠 🔧 🌡️ | - | In: text Out: text | Released: 2025-08-23 |
| DeepSeek V3.1 | deepseek-ai/DeepSeek-V3.1 | 75K | 163.8K | Input: $0.2 Output: $0.8 | Model: 0.100 Completion: 4.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Released: 2025-08-23 |
| DeepSeek V3 0324 | deepseek-ai/DeepSeek-V3-0324 | 75K | 163.8K | Input: $0.2 Output: $0.8 | Model: 0.100 Completion: 4.000 | 🔧 🌡️ | - | In: text Out: text | Released: 2025-08-23 |
Synthetic¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| Qwen 3 235B Instruct | hf:Qwen/Qwen3-235B-A22B-Instruct-2507 | 256K | 32K | Input: $0.2 Output: $0.6 | Model: 0.100 Completion: 3.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04-28 Updated: 2025-07-21 |
| Qwen2.5-Coder-32B-Instruct | hf:Qwen/Qwen2.5-Coder-32B-Instruct | 32.8K | 32.8K | Input: $0.8 Output: $0.8 | Model: 0.400 Completion: 1.000 | 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2024-11-11 |
| Qwen 3 Coder 480B | hf:Qwen/Qwen3-Coder-480B-A35B-Instruct | 256K | 32K | Input: $2 Output: $2 | Model: 1.000 Completion: 1.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-23 |
| Qwen3 235B A22B Thinking 2507 | hf:Qwen/Qwen3-235B-A22B-Thinking-2507 | 256K | 32K | Input: $0.65 Output: $3 | Model: 0.325 Completion: 4.615 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-25 |
| Minimax-M2 | hf:MiniMaxAI/MiniMax-M2 | 196.6K | 131K | Input: $0.55 Output: $2.19 | Model: 0.275 Completion: 3.982 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-10-27 |
| Llama-3.1-70B-Instruct | hf:meta-llama/Llama-3.1-70B-Instruct | 128K | 32.8K | Input: $0.9 Output: $0.9 | Model: 0.450 Completion: 1.000 | 🧠 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-07-23 |
| Llama-3.1-8B-Instruct | hf:meta-llama/Llama-3.1-8B-Instruct | 128K | 32.8K | Input: $0.2 Output: $0.2 | Model: 0.100 Completion: 1.000 | 🧠 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-07-23 |
| Llama-3.3-70B-Instruct | hf:meta-llama/Llama-3.3-70B-Instruct | 128K | 32.8K | Input: $0.9 Output: $0.9 | Model: 0.450 Completion: 1.000 | 🧠 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-12-06 |
| Llama-4-Scout-17B-16E-Instruct | hf:meta-llama/Llama-4-Scout-17B-16E-Instruct | 328K | 4.1K | Input: $0.15 Output: $0.6 | Model: 0.075 Completion: 4.000 | 📎 🔧 🌡️ | 2024-08 | In: text, image Out: text | Open Weights Released: 2025-04-05 |
| Llama-4-Maverick-17B-128E-Instruct-FP8 | hf:meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 | 524K | 4.1K | Input: $0.22 Output: $0.88 | Model: 0.110 Completion: 4.000 | 📎 🔧 🌡️ | 2024-08 | In: text, image Out: text | Open Weights Released: 2025-04-05 |
| Llama-3.1-405B-Instruct | hf:meta-llama/Llama-3.1-405B-Instruct | 128K | 32.8K | Input: $3 Output: $3 | Model: 1.500 Completion: 1.000 | 🧠 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-07-23 |
| Kimi K2 | hf:moonshotai/Kimi-K2-Instruct | 128K | 32.8K | Input: $0.6 Output: $2.5 | Model: 0.300 Completion: 4.167 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-07-11 |
| Kimi K2 0905 | hf:moonshotai/Kimi-K2-Instruct-0905 | 262.1K | 32.8K | Input: $1.2 Output: $1.2 | Model: 0.600 Completion: 1.000 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-09-05 |
| Kimi K2 Thinking | hf:moonshotai/Kimi-K2-Thinking | 262.1K | 262.1K | Input: $0.55 Output: $2.19 | Model: 0.275 Completion: 3.982 | 🧠 🔧 🌡️ | 2025-11 | In: text Out: text | Open Weights Released: 2025-11-07 |
| GLM 4.5 | hf:zai-org/GLM-4.5 | 128K | 96K | Input: $0.55 Output: $2.19 | Model: 0.275 Completion: 3.982 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-28 |
| GLM 4.6 | hf:zai-org/GLM-4.6 | 200K | 64K | Input: $0.55 Output: $2.19 | Model: 0.275 Completion: 3.982 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-09-30 |
| DeepSeek R1 | hf:deepseek-ai/DeepSeek-R1 | 128K | 128K | Input: $0.55 Output: $2.19 | Model: 0.275 Completion: 3.982 | 🧠 🔧 🌡️ | 2025-01 | In: text Out: text | Open Weights Released: 2025-01-20 |
| DeepSeek R1 (0528) | hf:deepseek-ai/DeepSeek-R1-0528 | 128K | 128K | Input: $3 Output: $8 | Model: 1.500 Completion: 2.667 | 🧠 🔧 🌡️ | - | In: text Out: text | Released: 2025-08-01 |
| DeepSeek V3.1 Terminus | hf:deepseek-ai/DeepSeek-V3.1-Terminus | 128K | 128K | Input: $1.2 Output: $1.2 | Model: 0.600 Completion: 1.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Released: 2025-09-22 Updated: 2025-09-25 |
| DeepSeek V3 | hf:deepseek-ai/DeepSeek-V3 | 128K | 128K | Input: $1.25 Output: $1.25 | Model: 0.625 Completion: 1.000 | 🧠 🔧 🌡️ | 2024-07 | In: text Out: text | Open Weights Released: 2025-01-20 Updated: 2025-05-29 |
| DeepSeek V3.1 | hf:deepseek-ai/DeepSeek-V3.1 | 128K | 128K | Input: $0.56 Output: $1.68 | Model: 0.280 Completion: 3.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Released: 2025-08-21 |
| DeepSeek V3 (0324) | hf:deepseek-ai/DeepSeek-V3-0324 | 128K | 128K | Input: $1.2 Output: $1.2 | Model: 0.600 Completion: 1.000 | 🔧 🌡️ | - | In: text Out: text | Released: 2025-08-01 |
| GPT OSS 120B | hf:openai/gpt-oss-120b | 128K | 32.8K | Input: $0.1 Output: $0.1 | Model: 0.050 Completion: 1.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-08-05 |
Together AI¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| Kimi K2 Instruct | moonshotai/Kimi-K2-Instruct | 131.1K | 32.8K | Input: $1 Output: $3 | Model: 0.500 Completion: 3.000 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-07-14 |
| GPT OSS 120B | openai/gpt-oss-120b | 131.1K | 131.1K | Input: $0.15 Output: $0.6 | Model: 0.075 Completion: 4.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-08-05 |
| Llama 3.3 70B | meta-llama/Llama-3.3-70B-Instruct-Turbo | 131.1K | 66.5K | Input: $0.88 Output: $0.88 | Model: 0.440 Completion: 1.000 | 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-12-06 |
| Qwen3 Coder 480B A35B Instruct | Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8 | 262.1K | 66.5K | Input: $2 Output: $2 | Model: 1.000 Completion: 1.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-23 |
| DeepSeek R1 | deepseek-ai/DeepSeek-R1 | 163.8K | 12.3K | Input: $3 Output: $7 | Model: 1.500 Completion: 2.333 | 🧠 🌡️ | 2024-07 | In: text Out: text | Open Weights Released: 2024-12-26 Updated: 2025-03-24 |
| DeepSeek V3 | deepseek-ai/DeepSeek-V3 | 131.1K | 12.3K | Input: $1.25 Output: $1.25 | Model: 0.625 Completion: 1.000 | 🧠 🔧 🌡️ | 2024-07 | In: text Out: text | Open Weights Released: 2025-01-20 Updated: 2025-05-29 |
Upstage¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| solar-mini | solar-mini | 32.8K | 4.1K | Input: $0.15 Output: $0.15 | Model: 0.075 Completion: 1.000 | 🔧 🌡️ | 2024-09 | In: text Out: text | Released: 2024-06-12 Updated: 2025-04-22 |
| solar-pro2 | solar-pro2 | 65.5K | 8.2K | Input: $0.25 Output: $0.25 | Model: 0.125 Completion: 1.000 | 🧠 🔧 🌡️ | 2025-03 | In: text Out: text | Released: 2025-05-20 |
v0¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| v0-1.5-lg | v0-1.5-lg | 512K | 32K | Input: $15 Output: $75 | Model: 7.500 Completion: 5.000 | 📎 🧠 🔧 🌡️ | - | In: text, image Out: text | Released: 2025-06-09 |
| v0-1.5-md | v0-1.5-md | 128K | 32K | Input: $3 Output: $15 | Model: 1.500 Completion: 5.000 | 📎 🧠 🔧 🌡️ | - | In: text, image Out: text | Released: 2025-06-09 |
| v0-1.0-md | v0-1.0-md | 128K | 32K | Input: $3 Output: $15 | Model: 1.500 Completion: 5.000 | 📎 🧠 🔧 🌡️ | - | In: text, image Out: text | Released: 2025-05-22 |
Venice AI¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| Dolphin 72B | dolphin-2.9.2-qwen2-72b | 32.8K | 8.2K | Input: $0.7 Output: $2.8 | Model: 0.350 Completion: 4.000 | 🌡️ | 2021-09 | In: text Out: text | Open Weights Released: 2025-05-21 |
| Venice Medium | mistral-31-24b | 131.1K | 8.2K | Input: $0.5 Output: $2 | Model: 0.250 Completion: 4.000 | 🔧 🌡️ | 2023-10 | In: text, image Out: text | Open Weights Released: 2025-07-15 |
| Venice Uncensored 1.1 | venice-uncensored | 32.8K | 8.2K | Input: $0.5 Output: $2 | Model: 0.250 Completion: 4.000 | 🌡️ | 2023-10 | In: text Out: text | Open Weights Released: 2025-07-15 |
| Qwen 2.5 VL 72B | qwen-2.5-vl | 32.8K | 8.2K | Input: $0.7 Output: $2.8 | Model: 0.350 Completion: 4.000 | 🌡️ | 2023-10 | In: text, image Out: text | Open Weights Released: 2025-06-09 |
| Venice Large | qwen3-235b | 131.1K | 8.2K | Input: $1.5 Output: $6 | Model: 0.750 Completion: 4.000 | 🧠 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-07-27 |
| Venice Reasoning | qwen-2.5-qwq-32b | 32.8K | 8.2K | Input: $0.5 Output: $2 | Model: 0.250 Completion: 4.000 | 🧠 🌡️ | 2023-10 | In: text Out: text | Open Weights Released: 2025-07-08 |
| DeepSeek Coder V2 Lite | deepseek-coder-v2-lite | 131.1K | 8.2K | Input: $0.5 Output: $2 | Model: 0.250 Completion: 4.000 | 🌡️ | 2021-09 | In: text Out: text | Open Weights Released: 2025-06-22 |
| Venice Small | qwen3-4b | 32.8K | 8.2K | Input: $0.15 Output: $0.6 | Model: 0.075 Completion: 4.000 | 🧠 🔧 🌡️ | 2024-07 | In: text Out: text | Open Weights Released: 2025-07-27 |
| Llama 3.3 70B | llama-3.3-70b | 65.5K | 8.2K | Input: $0.7 Output: $2.8 | Model: 0.350 Completion: 4.000 | 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2025-06-09 |
| Qwen 2.5 Coder 32B | qwen-2.5-coder-32b | 32.8K | 8.2K | Input: $0.5 Output: $2 | Model: 0.250 Completion: 4.000 | 🌡️ | 2023-10 | In: text Out: text | Open Weights Released: 2025-06-14 |
| DeepSeek R1 671B | deepseek-r1-671b | 131.1K | 8.2K | Input: $3.5 Output: $14 | Model: 1.750 Completion: 4.000 | 🧠 🌡️ | 2023-10 | In: text Out: text | Open Weights Released: 2025-06-05 |
| Llama 3.2 3B | llama-3.2-3b | 131.1K | 8.2K | Input: $0.15 Output: $0.6 | Model: 0.075 Completion: 4.000 | 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2025-05-23 |
| Llama 3.1 405B | llama-3.1-405b | 65.5K | 8.2K | Input: $1.5 Output: $6 | Model: 0.750 Completion: 4.000 | 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2025-06-30 |
| GLM 4.6 | zai-org-glm-4.6 | 202.8K | 8.2K | Input: $0.85 Output: $2.75 | Model: 0.425 Completion: 3.235 | 🔧 🌡️ | 2024-04 | In: text Out: text | Open Weights Released: 2025-09-30 |
Vercel AI Gateway¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| Kimi K2 Instruct | moonshotai/kimi-k2 | 131.1K | 16.4K | Input: $1 Output: $3 | Model: 0.500 Completion: 3.000 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-07-14 |
| Qwen3 Next 80B A3B Instruct | alibaba/qwen3-next-80b-a3b-instruct | 131.1K | 32.8K | Input: $0.5 Output: $2 | Model: 0.250 Completion: 4.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-09-12 |
| Qwen3 VL Instruct | alibaba/qwen3-vl-instruct | 131.1K | 129K | Input: $0.7 Output: $2.8 | Model: 0.350 Completion: 4.000 | 📎 🔧 🌡️ | 2025-04 | In: text, image Out: text | Open Weights Released: 2025-09-24 |
| Qwen3 VL Thinking | alibaba/qwen3-vl-thinking | 131.1K | 129K | Input: $0.7 Output: $8.4 | Model: 0.350 Completion: 12.000 | 📎 🧠 🔧 🌡️ | 2025-09 | In: text, image Out: text | Open Weights Released: 2025-09-24 |
| Qwen3 Max | alibaba/qwen3-max | 262.1K | 32.8K | Input: $1.2 Output: $6 | Model: 0.600 Completion: 5.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Released: 2025-09-23 |
| Qwen3 Coder Plus | alibaba/qwen3-coder-plus | 1M | 1M | Input: $1 Output: $5 | Model: 0.500 Completion: 5.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-23 |
| Qwen3 Next 80B A3B Thinking | alibaba/qwen3-next-80b-a3b-thinking | 131.1K | 32.8K | Input: $0.5 Output: $6 | Model: 0.250 Completion: 12.000 | 🧠 🔧 🌡️ | 2025-09 | In: text Out: text | Open Weights Released: 2025-09-12 |
| Grok 3 Mini Fast | xai/grok-3-mini-fast | 131.1K | 8.2K | Input: $0.6 Output: $4 Cache Read: $0.15 Reasoning: $4 | Model: 0.300 Completion: 6.667 Cache: 0.250 | 🧠 🔧 🌡️ | 2024-11 | In: text Out: text | Released: 2025-02-17 |
| Grok 3 Mini | xai/grok-3-mini | 131.1K | 8.2K | Input: $0.3 Output: $0.5 Cache Read: $0.075 Reasoning: $0.5 | Model: 0.150 Completion: 1.667 Cache: 0.250 | 🧠 🔧 🌡️ | 2024-11 | In: text Out: text | Released: 2025-02-17 |
| Grok 4 Fast | xai/grok-4-fast | 2M | 30K | Input: $0.2 Output: $0.5 Cache Read: $0.05 | Model: 0.100 Completion: 2.500 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-07 | In: text, image Out: text | Released: 2025-09-19 |
| Grok 3 | xai/grok-3 | 131.1K | 8.2K | Input: $3 Output: $15 Cache Read: $0.75 | Model: 1.500 Completion: 5.000 Cache: 0.250 | 🔧 🌡️ | 2024-11 | In: text Out: text | Released: 2025-02-17 |
| Grok 2 | xai/grok-2 | 131.1K | 8.2K | Input: $2 Output: $10 Cache Read: $2 | Model: 1.000 Completion: 5.000 Cache: 1.000 | 🔧 🌡️ | 2024-08 | In: text Out: text | Released: 2024-08-20 |
| Grok Code Fast 1 | xai/grok-code-fast-1 | 256K | 10K | Input: $0.2 Output: $1.5 Cache Read: $0.02 | Model: 0.100 Completion: 7.500 Cache: 0.100 | 🧠 🔧 🌡️ | 2023-10 | In: text Out: text | Released: 2025-08-28 |
| Grok 2 Vision | xai/grok-2-vision | 8.2K | 4.1K | Input: $2 Output: $10 Cache Read: $2 | Model: 1.000 Completion: 5.000 Cache: 1.000 | 📎 🔧 🌡️ | 2024-08 | In: text, image Out: text | Released: 2024-08-20 |
| Grok 4 | xai/grok-4 | 256K | 64K | Input: $3 Output: $15 Cache Read: $0.75 Reasoning: $15 | Model: 1.500 Completion: 5.000 Cache: 0.250 | 🧠 🔧 🌡️ | 2025-07 | In: text Out: text | Released: 2025-07-09 |
| Grok 3 Fast | xai/grok-3-fast | 131.1K | 8.2K | Input: $5 Output: $25 Cache Read: $1.25 | Model: 2.500 Completion: 5.000 Cache: 0.250 | 🔧 🌡️ | 2024-11 | In: text Out: text | Released: 2025-02-17 |
| Grok 4 Fast (Non-Reasoning) | xai/grok-4-fast-non-reasoning | 2M | 30K | Input: $0.2 Output: $0.5 Cache Read: $0.05 | Model: 0.100 Completion: 2.500 Cache: 0.250 | 📎 🔧 🌡️ | 2025-07 | In: text, image Out: text | Released: 2025-09-19 |
| Codestral | mistral/codestral | 256K | 4.1K | Input: $0.3 Output: $0.9 | Model: 0.150 Completion: 3.000 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2024-05-29 Updated: 2025-01-04 |
| Magistral Medium | mistral/magistral-medium | 128K | 16.4K | Input: $2 Output: $5 | Model: 1.000 Completion: 2.500 | 🧠 🔧 🌡️ | 2025-06 | In: text Out: text | Open Weights Released: 2025-03-17 Updated: 2025-03-20 |
| Mistral Large | mistral/mistral-large | 131.1K | 16.4K | Input: $2 Output: $6 | Model: 1.000 Completion: 3.000 | 🔧 🌡️ | 2024-11 | In: text Out: text | Open Weights Released: 2024-11-01 Updated: 2024-11-04 |
| Pixtral Large | mistral/pixtral-large | 128K | 128K | Input: $2 Output: $6 | Model: 1.000 Completion: 3.000 | 📎 🔧 🌡️ | 2024-11 | In: text, image Out: text | Open Weights Released: 2024-11-01 Updated: 2024-11-04 |
| Ministral 8B | mistral/ministral-8b | 128K | 128K | Input: $0.1 Output: $0.1 | Model: 0.050 Completion: 1.000 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2024-10-01 Updated: 2024-10-04 |
| Ministral 3B | mistral/ministral-3b | 128K | 128K | Input: $0.04 Output: $0.04 | Model: 0.020 Completion: 1.000 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2024-10-01 Updated: 2024-10-04 |
| Magistral Small | mistral/magistral-small | 128K | 128K | Input: $0.5 Output: $1.5 | Model: 0.250 Completion: 3.000 | 🧠 🔧 🌡️ | 2025-06 | In: text Out: text | Open Weights Released: 2025-03-17 |
| Mistral Small | mistral/mistral-small | 128K | 16.4K | Input: $0.1 Output: $0.3 | Model: 0.050 Completion: 3.000 | 🔧 🌡️ | 2025-03 | In: text, image Out: text | Open Weights Released: 2024-09-01 Updated: 2024-09-04 |
| Pixtral 12B | mistral/pixtral-12b | 128K | 128K | Input: $0.15 Output: $0.15 | Model: 0.075 Completion: 1.000 | 📎 🔧 🌡️ | 2024-09 | In: text, image Out: text | Open Weights Released: 2024-09-01 |
| Mixtral 8x22B | mistral/mixtral-8x22b-instruct | 64K | 64K | Input: $2 Output: $6 | Model: 1.000 Completion: 3.000 | 🔧 🌡️ | 2024-04 | In: text Out: text | Open Weights Released: 2024-04-17 |
| v0-1.0-md | vercel/v0-1.0-md | 128K | 32K | Input: $3 Output: $15 | Model: 1.500 Completion: 5.000 | 📎 🧠 🔧 🌡️ | - | In: text, image Out: text | Released: 2025-05-22 |
| v0-1.5-md | vercel/v0-1.5-md | 128K | 32K | Input: $3 Output: $15 | Model: 1.500 Completion: 5.000 | 📎 🧠 🔧 🌡️ | - | In: text, image Out: text | Released: 2025-06-09 |
| DeepSeek V3.2 Exp Thinking | deepseek/deepseek-v3.2-exp-thinking | 163.8K | 8.2K | Input: $0.28 Output: $0.42 | Model: 0.140 Completion: 1.500 | 🧠 🔧 🌡️ | 2025-09 | In: text Out: text | Released: 2025-09-29 |
| DeepSeek V3.1 Terminus | deepseek/deepseek-v3.1-terminus | 128K | 8.2K | Input: $0.27 Output: $1 | Model: 0.135 Completion: 3.704 | 🧠 🔧 🌡️ | 2025-07 | In: text Out: text | Open Weights Released: 2025-09-22 |
| DeepSeek V3.2 Exp | deepseek/deepseek-v3.2-exp | 163.8K | 8.2K | Input: $0.28 Output: $0.42 | Model: 0.140 Completion: 1.500 | 🔧 🌡️ | 2025-09 | In: text Out: text | Released: 2025-09-29 |
| DeepSeek R1 Distill Llama 70B | deepseek/deepseek-r1-distill-llama-70b | 131.1K | 8.2K | Input: $0.75 Output: $0.99 | Model: 0.375 Completion: 1.320 | 🧠 🔧 🌡️ | 2024-07 | In: text Out: text | Open Weights Released: 2025-01-20 |
| DeepSeek-R1 | deepseek/deepseek-r1 | 128K | 32.8K | Input: $1.35 Output: $5.4 | Model: 0.675 Completion: 4.000 | 🧠 🔧 🌡️ | 2024-07 | In: text Out: text | Released: 2025-01-20 Updated: 2025-05-29 |
| MiniMax M2 | minimax/minimax-m2 | 205K | 131.1K | Input: $0.3 Output: $1.2 Cache Read: $0.03 Cache Write: $0.38 | Model: 0.150 Completion: 4.000 Cache: 0.100 | 🧠 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-10-27 |
| Gemini 3 Pro Preview | google/gemini-3-pro-preview | 1M | 64K | Input: $2 Output: $12 Cache Read: $0.2 | Model: 1.000 Completion: 6.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, video, audio, pdf Out: text | Released: 2025-11-18 |
| Gemini 2.5 Flash Lite | google/gemini-2.5-flash-lite | 1M | 65.5K | Input: $0.1 Output: $0.4 Cache Read: $0.025 | Model: 0.050 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-06-17 |
| Gemini 2.5 Flash Preview 09-25 | google/gemini-2.5-flash-preview-09-2025 | 1M | 65.5K | Input: $0.3 Output: $2.5 Cache Read: $0.075 Cache Write: $0.383 | Model: 0.150 Completion: 8.333 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-09-25 |
| Gemini 2.5 Flash Lite Preview 09-25 | google/gemini-2.5-flash-lite-preview-09-2025 | 1M | 65.5K | Input: $0.1 Output: $0.4 Cache Read: $0.025 | Model: 0.050 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-09-25 |
| Gemini 2.5 Pro | google/gemini-2.5-pro | 1M | 65.5K | Input: $1.25 Output: $10 Cache Read: $0.31 | Model: 0.625 Completion: 8.000 Cache: 0.248 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-03-20 Updated: 2025-06-05 |
| Gemini 2.0 Flash | google/gemini-2.0-flash | 1M | 8.2K | Input: $0.1 Output: $0.4 Cache Read: $0.025 | Model: 0.050 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-06 | In: text, image, audio, video, pdf Out: text | Released: 2024-12-11 |
| Gemini 2.0 Flash Lite | google/gemini-2.0-flash-lite | 1M | 8.2K | Input: $0.075 Output: $0.3 | Model: 0.037 Completion: 4.000 | 📎 🔧 🌡️ | 2024-06 | In: text, image, audio, video, pdf Out: text | Released: 2024-12-11 |
| Gemini 2.5 Flash | google/gemini-2.5-flash | 1M | 65.5K | Input: $0.3 Output: $2.5 Cache Read: $0.075 Input Audio: $1 | Model: 0.500 Completion: 2.500 Cache: 0.075 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-03-20 Updated: 2025-06-05 |
| GPT OSS 20B | openai/gpt-oss-20b | 131.1K | 32.8K | Input: $0.07 Output: $0.3 | Model: 0.035 Completion: 4.286 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-08-05 |
| GPT OSS 120B | openai/gpt-oss-120b | 131.1K | 32.8K | Input: $0.1 Output: $0.5 | Model: 0.050 Completion: 5.000 | 🧠 🔧 🌡️ | - | In: text Out: text | Open Weights Released: 2025-08-05 |
| GPT-5 | openai/gpt-5 | 400K | 128K | Input: $1.25 Output: $10 Cache Read: $0.13 | Model: 0.625 Completion: 8.000 Cache: 0.104 | 📎 🧠 🔧 | 2024-09-30 | In: text, image Out: text | Released: 2025-08-07 |
| GPT-4o mini | openai/gpt-4o-mini | 128K | 16.4K | Input: $0.15 Output: $0.6 Cache Read: $0.08 | Model: 0.075 Completion: 4.000 Cache: 0.533 | 📎 🔧 🌡️ | 2023-09 | In: text, image Out: text | Released: 2024-07-18 |
| o3 | openai/o3 | 200K | 100K | Input: $2 Output: $8 Cache Read: $0.5 | Model: 1.000 Completion: 4.000 Cache: 0.250 | 📎 🧠 🔧 | 2024-05 | In: text, image Out: text | Released: 2025-04-16 |
| GPT-5 Mini | openai/gpt-5-mini | 400K | 128K | Input: $0.25 Output: $2 Cache Read: $0.03 | Model: 0.125 Completion: 8.000 Cache: 0.120 | 📎 🧠 🔧 | 2024-05-30 | In: text, image Out: text | Released: 2025-08-07 |
| o1 | openai/o1 | 200K | 100K | Input: $15 Output: $60 Cache Read: $7.5 | Model: 7.500 Completion: 4.000 Cache: 0.500 | 📎 🧠 🔧 | 2023-09 | In: text, image Out: text | Released: 2024-12-05 |
| o4-mini | openai/o4-mini | 200K | 100K | Input: $1.1 Output: $4.4 Cache Read: $0.28 | Model: 0.550 Completion: 4.000 Cache: 0.255 | 📎 🧠 🔧 | 2024-05 | In: text, image Out: text | Released: 2025-04-16 |
| GPT-4.1 | openai/gpt-4.1 | 1M | 32.8K | Input: $2 Output: $8 Cache Read: $0.5 | Model: 1.000 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-04-14 |
| GPT-4o | openai/gpt-4o | 128K | 16.4K | Input: $2.5 Output: $10 Cache Read: $1.25 | Model: 1.250 Completion: 4.000 Cache: 0.500 | 📎 🔧 🌡️ | 2023-09 | In: text, image Out: text | Released: 2024-05-13 Updated: 2024-08-06 |
| GPT-5-Codex | openai/gpt-5-codex | 400K | 128K | Input: $1.25 Output: $10 Cache Read: $0.125 | Model: 0.625 Completion: 8.000 Cache: 0.100 | 🧠 🔧 | 2024-09-30 | In: text, image Out: text | Released: 2025-09-15 |
| GPT-5 Nano | openai/gpt-5-nano | 400K | 128K | Input: $0.05 Output: $0.4 Cache Read: $0.01 | Model: 0.025 Completion: 8.000 Cache: 0.200 | 📎 🧠 🔧 | 2024-05-30 | In: text, image Out: text | Released: 2025-08-07 |
| o3-mini | openai/o3-mini | 200K | 100K | Input: $1.1 Output: $4.4 Cache Read: $0.55 | Model: 0.550 Completion: 4.000 Cache: 0.500 | 🧠 🔧 | 2024-05 | In: text Out: text | Released: 2024-12-20 Updated: 2025-01-29 |
| GPT-4 Turbo | openai/gpt-4-turbo | 128K | 4.1K | Input: $10 Output: $30 | Model: 5.000 Completion: 3.000 | 📎 🔧 🌡️ | 2023-12 | In: text, image Out: text | Released: 2023-11-06 Updated: 2024-04-09 |
| GPT-4.1 mini | openai/gpt-4.1-mini | 1M | 32.8K | Input: $0.4 Output: $1.6 Cache Read: $0.1 | Model: 0.200 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-04-14 |
| GPT-4.1 nano | openai/gpt-4.1-nano | 1M | 32.8K | Input: $0.1 Output: $0.4 Cache Read: $0.03 | Model: 0.050 Completion: 4.000 Cache: 0.300 | 📎 🔧 🌡️ | 2024-04 | In: text, image Out: text | Released: 2025-04-14 |
| Sonar Reasoning | perplexity/sonar-reasoning | 127K | 8K | Input: $1 Output: $5 | Model: 0.500 Completion: 5.000 | 🧠 🌡️ | 2025-09 | In: text Out: text | Released: 2025-02-19 |
| Sonar | perplexity/sonar | 127K | 8K | Input: $1 Output: $1 | Model: 0.500 Completion: 1.000 | 🌡️ | 2025-02 | In: text, image Out: text | Released: 2025-02-19 |
| Sonar Pro | perplexity/sonar-pro | 200K | 8K | Input: $3 Output: $15 | Model: 1.500 Completion: 5.000 | 🌡️ | 2025-09 | In: text, image Out: text | Released: 2025-02-19 |
| Sonar Reasoning Pro | perplexity/sonar-reasoning-pro | 127K | 8K | Input: $2 Output: $8 | Model: 1.000 Completion: 4.000 | 🧠 🌡️ | 2025-09 | In: text Out: text | Released: 2025-02-19 |
| GLM 4.5 | zai/glm-4.5 | 128K | 96K | Input: $0.6 Output: $2.2 | Model: 0.300 Completion: 3.667 | 🧠 🔧 🌡️ | 2025-07 | In: text Out: text | Open Weights Released: 2025-07-28 |
| GLM 4.5 Air | zai/glm-4.5-air | 128K | 96K | Input: $0.2 Output: $1.1 | Model: 0.100 Completion: 5.500 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-28 |
| GLM 4.5V | zai/glm-4.5v | 66K | 16K | Input: $0.6 Output: $1.8 | Model: 0.300 Completion: 3.000 | 📎 🧠 🔧 🌡️ | 2025-08 | In: text, image Out: text | Open Weights Released: 2025-08-11 |
| GLM 4.6 | zai/glm-4.6 | 200K | 96K | Input: $0.6 Output: $2.2 | Model: 0.300 Completion: 3.667 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-09-30 |
| Nova Micro | amazon/nova-micro | 128K | 8.2K | Input: $0.035 Output: $0.14 Cache Read: $0.00875 | Model: 0.018 Completion: 4.000 Cache: 0.250 | 🔧 🌡️ | 2024-10 | In: text Out: text | Released: 2024-12-03 |
| Nova Pro | amazon/nova-pro | 300K | 8.2K | Input: $0.8 Output: $3.2 Cache Read: $0.2 | Model: 0.400 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-10 | In: text, image, video Out: text | Released: 2024-12-03 |
| Nova Lite | amazon/nova-lite | 300K | 8.2K | Input: $0.06 Output: $0.24 Cache Read: $0.015 | Model: 0.030 Completion: 4.000 Cache: 0.250 | 📎 🔧 🌡️ | 2024-10 | In: text, image, video Out: text | Released: 2024-12-03 |
| Morph v3 Fast | morph/morph-v3-fast | 16K | 16K | Input: $0.8 Output: $1.2 | Model: 0.400 Completion: 1.500 | - | - | In: text Out: text | Released: 2024-08-15 |
| Morph v3 Large | morph/morph-v3-large | 32K | 32K | Input: $0.9 Output: $1.9 | Model: 0.450 Completion: 2.111 | - | - | In: text Out: text | Released: 2024-08-15 |
| Llama-4-Scout-17B-16E-Instruct-FP8 | meta/llama-4-scout | 128K | 4.1K | Input: $0 Output: $0 | - | 📎 🔧 🌡️ | 2024-08 | In: text, image Out: text | Open Weights Released: 2025-04-05 |
| Llama-3.3-70B-Instruct | meta/llama-3.3-70b | 128K | 4.1K | Input: $0 Output: $0 | - | 📎 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-12-06 |
| Llama-4-Maverick-17B-128E-Instruct-FP8 | meta/llama-4-maverick | 128K | 4.1K | Input: $0 Output: $0 | - | 📎 🔧 🌡️ | 2024-08 | In: text, image Out: text | Open Weights Released: 2025-04-05 |
| Claude Haiku 4.5 | anthropic/claude-haiku-4.5 | 200K | 64K | Input: $1 Output: $1.25 Cache Read: $0.1 Cache Write: $1.25 | Model: 0.500 Completion: 1.250 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-02-31 | In: text, image Out: text | Released: 2025-10-15 |
| Claude Haiku 3.5 | anthropic/claude-3.5-haiku | 200K | 8.2K | Input: $0.8 Output: $4 Cache Read: $0.08 Cache Write: $1 | Model: 0.400 Completion: 5.000 Cache: 0.100 | 📎 🔧 🌡️ | 2024-07-31 | In: text, image Out: text | Released: 2024-10-22 |
| Claude Sonnet 3.7 | anthropic/claude-3.7-sonnet | 200K | 64K | Input: $3 Output: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2024-10-31 | In: text, image Out: text | Released: 2025-02-19 |
| Claude Sonnet 4.5 | anthropic/claude-4.5-sonnet | 200K | 64K | Input: $3 Output: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-07-31 | In: text, image Out: text | Released: 2025-09-29 |
| Claude Sonnet 3.5 v2 | anthropic/claude-3.5-sonnet | 200K | 8.2K | Input: $3 Output: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🔧 🌡️ | 2024-04-30 | In: text, image Out: text | Released: 2024-10-22 |
| Claude Opus 4 | anthropic/claude-4-1-opus | 200K | 32K | Input: $15 Output: $75 Cache Read: $1.5 Cache Write: $18.75 | Model: 7.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-05-22 |
| Claude Sonnet 4 | anthropic/claude-4-sonnet | 200K | 64K | Input: $3 Output: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-05-22 |
| Claude Opus 3 | anthropic/claude-3-opus | 200K | 4.1K | Input: $15 Output: $75 Cache Read: $1.5 Cache Write: $18.75 | Model: 7.500 Completion: 5.000 Cache: 0.100 | 📎 🔧 🌡️ | 2023-08-31 | In: text, image Out: text | Released: 2024-02-29 |
| Claude Haiku 3 | anthropic/claude-3-haiku | 200K | 4.1K | Input: $0.25 Output: $1.25 Cache Read: $0.03 Cache Write: $0.3 | Model: 0.125 Completion: 5.000 Cache: 0.120 | 📎 🔧 🌡️ | 2023-08-31 | In: text, image Out: text | Released: 2024-03-13 |
| Claude Opus 4 | anthropic/claude-4-opus | 200K | 32K | Input: $15 Output: $75 Cache Read: $1.5 Cache Write: $18.75 | Model: 7.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-05-22 |
| Qwen 3 Coder 480B | cerebras/qwen3-coder | 131K | 32K | Input: $2 Output: $2 | Model: 1.000 Completion: 1.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-23 |
Vultr¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| DeepSeek R1 Distill Qwen 32B | deepseek-r1-distill-qwen-32b | 121.8K | 8.2K | Input: $0.2 Output: $0.2 | Model: 0.100 Completion: 1.000 | 🧠 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-01-20 |
| Qwen2.5 Coder 32B Instruct | qwen2.5-coder-32b-instruct | 13K | 2K | Input: $0.2 Output: $0.2 | Model: 0.100 Completion: 1.000 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2024-11-06 |
| Kimi K2 Instruct | kimi-k2-instruct | 58.9K | 4.1K | Input: $0.2 Output: $0.2 | Model: 0.100 Completion: 1.000 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2024-07-18 |
| DeepSeek R1 Distill Llama 70B | deepseek-r1-distill-llama-70b | 121.8K | 8.2K | Input: $0.2 Output: $0.2 | Model: 0.100 Completion: 1.000 | 🧠 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-01-20 |
| GPT OSS 120B | gpt-oss-120b | 121.8K | 8.2K | Input: $0.2 Output: $0.2 | Model: 0.100 Completion: 1.000 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-06-23 |
Weights & Biases¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| Kimi-K2-Instruct | moonshotai/Kimi-K2-Instruct | 128K | 16.4K | Input: $1.35 Output: $4 | Model: 0.675 Completion: 2.963 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-07-14 |
| Phi-4-mini-instruct | microsoft/Phi-4-mini-instruct | 128K | 4.1K | Input: $0.08 Output: $0.35 | Model: 0.040 Completion: 4.375 | 🧠 🔧 🌡️ | 2023-10 | In: text Out: text | Open Weights Released: 2024-12-11 |
| Meta-Llama-3.1-8B-Instruct | meta-llama/Llama-3.1-8B-Instruct | 128K | 32.8K | Input: $0.22 Output: $0.22 | Model: 0.110 Completion: 1.000 | 🧠 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-07-23 |
| Llama-3.3-70B-Instruct | meta-llama/Llama-3.3-70B-Instruct | 128K | 32.8K | Input: $0.71 Output: $0.71 | Model: 0.355 Completion: 1.000 | 🧠 🔧 🌡️ | 2023-12 | In: text Out: text | Open Weights Released: 2024-12-06 |
| Llama 4 Scout 17B 16E Instruct | meta-llama/Llama-4-Scout-17B-16E-Instruct | 64K | 8.2K | Input: $0.17 Output: $0.66 | Model: 0.085 Completion: 3.882 | 🧠 🔧 🌡️ | 2024-12 | In: text, image Out: text | Open Weights Released: 2025-01-31 |
| Qwen3 235B A22B Instruct 2507 | Qwen/Qwen3-235B-A22B-Instruct-2507 | 262.1K | 131.1K | Input: $0.1 Output: $0.1 | Model: 0.050 Completion: 1.000 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-04-28 Updated: 2025-07-21 |
| Qwen3-Coder-480B-A35B-Instruct | Qwen/Qwen3-Coder-480B-A35B-Instruct | 262.1K | 66.5K | Input: $1 Output: $1.5 | Model: 0.500 Completion: 1.500 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-23 |
| Qwen3-235B-A22B-Thinking-2507 | Qwen/Qwen3-235B-A22B-Thinking-2507 | 262.1K | 131.1K | Input: $0.1 Output: $0.1 | Model: 0.050 Completion: 1.000 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-25 |
| DeepSeek-R1-0528 | deepseek-ai/DeepSeek-R1-0528 | 161K | 163.8K | Input: $1.35 Output: $5.4 | Model: 0.675 Completion: 4.000 | 🧠 🔧 🌡️ | 2025-05 | In: text Out: text | Open Weights Released: 2025-05-28 |
| DeepSeek-V3-0324 | deepseek-ai/DeepSeek-V3-0324 | 161K | 8.2K | Input: $1.14 Output: $2.75 | Model: 0.570 Completion: 2.412 | 🔧 🌡️ | 2024-10 | In: text Out: text | Open Weights Released: 2025-03-24 |
xAI¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| Grok 4 Fast (Non-Reasoning) | grok-4-fast-non-reasoning | 2M | 30K | Input: $0.2 Output: $0.5 Cache Read: $0.05 | Model: 0.100 Completion: 2.500 Cache: 0.250 | 📎 🔧 🌡️ | 2025-07 | In: text, image Out: text | Released: 2025-09-19 |
| Grok 3 Fast | grok-3-fast | 131.1K | 8.2K | Input: $5 Output: $25 Cache Read: $1.25 | Model: 2.500 Completion: 5.000 Cache: 0.250 | 🔧 🌡️ | 2024-11 | In: text Out: text | Released: 2025-02-17 |
| Grok 4 | grok-4 | 256K | 64K | Input: $3 Output: $15 Cache Read: $0.75 Reasoning: $15 | Model: 1.500 Completion: 5.000 Cache: 0.250 | 🧠 🔧 🌡️ | 2025-07 | In: text Out: text | Released: 2025-07-09 |
| Grok 2 Vision | grok-2-vision | 8.2K | 4.1K | Input: $2 Output: $10 Cache Read: $2 | Model: 1.000 Completion: 5.000 Cache: 1.000 | 📎 🔧 🌡️ | 2024-08 | In: text, image Out: text | Released: 2024-08-20 |
| Grok Code Fast 1 | grok-code-fast-1 | 256K | 10K | Input: $0.2 Output: $1.5 Cache Read: $0.02 | Model: 0.100 Completion: 7.500 Cache: 0.100 | 🧠 🔧 🌡️ | 2023-10 | In: text Out: text | Released: 2025-08-28 |
| Grok 2 | grok-2 | 131.1K | 8.2K | Input: $2 Output: $10 Cache Read: $2 | Model: 1.000 Completion: 5.000 Cache: 1.000 | 🔧 🌡️ | 2024-08 | In: text Out: text | Released: 2024-08-20 |
| Grok 3 Mini Fast Latest | grok-3-mini-fast-latest | 131.1K | 8.2K | Input: $0.6 Output: $4 Cache Read: $0.15 Reasoning: $4 | Model: 0.300 Completion: 6.667 Cache: 0.250 | 🧠 🔧 🌡️ | 2024-11 | In: text Out: text | Released: 2025-02-17 |
| Grok 2 Vision (1212) | grok-2-vision-1212 | 8.2K | 4.1K | Input: $2 Output: $10 Cache Read: $2 | Model: 1.000 Completion: 5.000 Cache: 1.000 | 📎 🔧 🌡️ | 2024-08 | In: text, image Out: text | Released: 2024-08-20 Updated: 2024-12-12 |
| Grok 3 | grok-3 | 131.1K | 8.2K | Input: $3 Output: $15 Cache Read: $0.75 | Model: 1.500 Completion: 5.000 Cache: 0.250 | 🔧 🌡️ | 2024-11 | In: text Out: text | Released: 2025-02-17 |
| Grok 4 Fast | grok-4-fast | 2M | 30K | Input: $0.2 Output: $0.5 Cache Read: $0.05 | Model: 0.100 Completion: 2.500 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-07 | In: text, image Out: text | Released: 2025-09-19 |
| Grok 2 Latest | grok-2-latest | 131.1K | 8.2K | Input: $2 Output: $10 Cache Read: $2 | Model: 1.000 Completion: 5.000 Cache: 1.000 | 🔧 🌡️ | 2024-08 | In: text Out: text | Released: 2024-08-20 Updated: 2024-12-12 |
| Grok 4.1 Fast | grok-4-1-fast | 2M | 30K | Input: $0.2 Output: $0.5 Cache Read: $0.05 | Model: 0.100 Completion: 2.500 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-07 | In: text, image Out: text | Released: 2025-11-19 |
| Grok 2 (1212) | grok-2-1212 | 131.1K | 8.2K | Input: $2 Output: $10 Cache Read: $2 | Model: 1.000 Completion: 5.000 Cache: 1.000 | 🔧 🌡️ | 2024-08 | In: text Out: text | Released: 2024-12-12 |
| Grok 3 Fast Latest | grok-3-fast-latest | 131.1K | 8.2K | Input: $5 Output: $25 Cache Read: $1.25 | Model: 2.500 Completion: 5.000 Cache: 0.250 | 🔧 🌡️ | 2024-11 | In: text Out: text | Released: 2025-02-17 |
| Grok 3 Latest | grok-3-latest | 131.1K | 8.2K | Input: $3 Output: $15 Cache Read: $0.75 | Model: 1.500 Completion: 5.000 Cache: 0.250 | 🔧 🌡️ | 2024-11 | In: text Out: text | Released: 2025-02-17 |
| Grok 2 Vision Latest | grok-2-vision-latest | 8.2K | 4.1K | Input: $2 Output: $10 Cache Read: $2 | Model: 1.000 Completion: 5.000 Cache: 1.000 | 📎 🔧 🌡️ | 2024-08 | In: text, image Out: text | Released: 2024-08-20 Updated: 2024-12-12 |
| Grok Vision Beta | grok-vision-beta | 8.2K | 4.1K | Input: $5 Output: $15 Cache Read: $5 | Model: 2.500 Completion: 3.000 Cache: 1.000 | 📎 🔧 🌡️ | 2024-08 | In: text, image Out: text | Released: 2024-11-01 |
| Grok 3 Mini | grok-3-mini | 131.1K | 8.2K | Input: $0.3 Output: $0.5 Cache Read: $0.075 Reasoning: $0.5 | Model: 0.150 Completion: 1.667 Cache: 0.250 | 🧠 🔧 🌡️ | 2024-11 | In: text Out: text | Released: 2025-02-17 |
| Grok Beta | grok-beta | 131.1K | 4.1K | Input: $5 Output: $15 Cache Read: $5 | Model: 2.500 Completion: 3.000 Cache: 1.000 | 🔧 🌡️ | 2024-08 | In: text Out: text | Released: 2024-11-01 |
| Grok 3 Mini Latest | grok-3-mini-latest | 131.1K | 8.2K | Input: $0.3 Output: $0.5 Cache Read: $0.075 Reasoning: $0.5 | Model: 0.150 Completion: 1.667 Cache: 0.250 | 🧠 🔧 🌡️ | 2024-11 | In: text Out: text | Released: 2025-02-17 |
| Grok 4.1 Fast (Non-Reasoning) | grok-4-1-fast-non-reasoning | 2M | 30K | Input: $0.2 Output: $0.5 Cache Read: $0.05 | Model: 0.100 Completion: 2.500 Cache: 0.250 | 📎 🔧 🌡️ | 2025-07 | In: text, image Out: text | Released: 2025-11-19 |
| Grok 3 Mini Fast | grok-3-mini-fast | 131.1K | 8.2K | Input: $0.6 Output: $4 Cache Read: $0.15 Reasoning: $4 | Model: 0.300 Completion: 6.667 Cache: 0.250 | 🧠 🔧 🌡️ | 2024-11 | In: text Out: text | Released: 2025-02-17 |
Z.AI¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| GLM-4.5-Flash | glm-4.5-flash | 131.1K | 98.3K | Input: $0 Output: $0 Cache Read: $0 Cache Write: $0 | - | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-28 |
| GLM-4.5 | glm-4.5 | 131.1K | 98.3K | Input: $0.6 Output: $2.2 Cache Read: $0.11 Cache Write: $0 | Model: 0.300 Completion: 3.667 Cache: 0.183 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-28 |
| GLM-4.5-Air | glm-4.5-air | 131.1K | 98.3K | Input: $0.2 Output: $1.1 Cache Read: $0.03 Cache Write: $0 | Model: 0.100 Completion: 5.500 Cache: 0.150 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-28 |
| GLM 4.5V | glm-4.5v | 64K | 16.4K | Input: $0.6 Output: $1.8 | Model: 0.300 Completion: 3.000 | 📎 🧠 🔧 🌡️ | 2025-04 | In: text, image, video Out: text | Open Weights Released: 2025-08-11 |
| GLM-4.6 | glm-4.6 | 204.8K | 131.1K | Input: $0.6 Output: $2.2 Cache Read: $0.11 Cache Write: $0 | Model: 0.300 Completion: 3.667 Cache: 0.183 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-09-30 |
Z.AI Coding Plan¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| GLM-4.5-Flash | glm-4.5-flash | 131.1K | 98.3K | Input: $0 Output: $0 Cache Read: $0 Cache Write: $0 | - | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-28 |
| GLM-4.5 | glm-4.5 | 131.1K | 98.3K | Input: $0 Output: $0 Cache Read: $0 Cache Write: $0 | - | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-28 |
| GLM-4.5-Air | glm-4.5-air | 131.1K | 98.3K | Input: $0 Output: $0 Cache Read: $0 Cache Write: $0 | - | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-28 |
| GLM 4.5V | glm-4.5v | 64K | 16.4K | Input: $0 Output: $0 | - | 📎 🧠 🔧 🌡️ | 2025-04 | In: text, image, video Out: text | Open Weights Released: 2025-08-11 |
| GLM-4.6 | glm-4.6 | 204.8K | 131.1K | Input: $0 Output: $0 Cache Read: $0 Cache Write: $0 | - | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-09-30 |
ZenMux¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| Kimi K2 Thinking Turbo | moonshotai/kimi-k2-thinking-turbo | 262.1K | 262.1K | Input: $1.15 Output: $8 | Model: 0.575 Completion: 6.957 | 🧠 🔧 🌡️ | 2025-11 | In: text Out: text | Released: 2025-11-06 |
| Kimi K2 0905 | moonshotai/kimi-k2-0905 | 262.1K | 16.4K | Input: $0.6 Output: $2.5 Cache Read: $0.15 | Model: 0.300 Completion: 4.167 Cache: 0.250 | 🔧 🌡️ | 2024-10 | In: text Out: text | Released: 2025-09-04 |
| Kimi K2 Thinking | moonshotai/kimi-k2-thinking | 262.1K | 262.1K | Input: $0.6 Output: $2.5 | Model: 0.300 Completion: 4.167 | 🧠 🔧 🌡️ | 2025-11 | In: text Out: text | Released: 2025-11-06 |
| Grok 4 Fast None Reasoning | x-ai/grok-4-fast-non-reasoning | 2M | 30K | Input: $0.2 Output: $0.5 Cache Read: $0.05 | Model: 0.100 Completion: 2.500 Cache: 0.250 | 📎 🔧 🌡️ | 2025-01-01 | In: text, image Out: text | Released: 2025-09-19 |
| Grok 4 | x-ai/grok-4 | 256K | 256K | Input: $3 Output: $15 Cache Read: $0.75 | Model: 1.500 Completion: 5.000 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01-01 | In: image, text Out: text | Released: 2025-07-09 |
| Grok Code Fast 1 | x-ai/grok-code-fast-1 | 256K | 10K | Input: $0.2 Output: $1.5 Cache Read: $0.02 | Model: 0.100 Completion: 7.500 Cache: 0.100 | 🧠 🔧 🌡️ | 2025-01-01 | In: text Out: text | Released: 2025-08-26 |
| Grok 4 Fast | x-ai/grok-4-fast | 2M | 30K | Input: $0.2 Output: $0.5 Cache Read: $0.05 | Model: 0.100 Completion: 2.500 Cache: 0.250 | 📎 🧠 🔧 🌡️ | 2025-01-01 | In: text, image Out: text | Released: 2025-09-19 |
| DeepSeek-V3.2-Exp (Non-thinking Mode) | deepseek/deepseek-chat | 128K | 8K | Input: $0.56 Output: $1.68 Cache Read: $0.07 | Model: 0.280 Completion: 3.000 Cache: 0.125 | 🧠 🔧 🌡️ | 2025-07 | In: text Out: text | Open Weights Released: 2025-09-29 |
| MiniMax M2 | minimax/minimax-m2 | 204.8K | 128K | Input: $0.3 Output: $1.2 | Model: 0.150 Completion: 4.000 | 🧠 🔧 🌡️ | 2025-10 | In: text Out: text | Released: 2025-10-27 |
| Gemini 2.5 Pro | google/gemini-2.5-pro | 1M | 65.5K | Input: $1.25 Output: $10 Cache Read: $0.31 Cache Write: $4.5 | Model: 0.625 Completion: 8.000 Cache: 0.248 | 📎 🧠 🔧 🌡️ | 2025-01 | In: text, image, audio, video, pdf Out: text | Released: 2025-03-20 Updated: 2025-06-05 |
| GPT-5 Codex | openai/gpt-5-codex | 400K | 128K | Input: $1.25 Output: $10 Cache Read: $0.125 | Model: 0.625 Completion: 8.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2024-10-01 | In: text, image Out: text | Released: 2025-09-23 |
| GPT-5 | openai/gpt-5 | 400K | 128K | Input: $1.25 Output: $10 Cache Read: $0.125 | Model: 0.625 Completion: 8.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2024-10-01 | In: text, image Out: text | Released: 2025-08-07 |
| Ring-1T | inclusionai/ring-1t | 128K | 32K | Input: $0.56 Output: $2.24 Cache Read: $0.112 | Model: 0.280 Completion: 4.000 Cache: 0.200 | 🧠 🔧 🌡️ | 2025-10 | In: text Out: text | Open Weights Released: 2025-10-12 |
| Ling-1T | inclusionai/lint-1t | 128K | 32K | Input: $0.56 Output: $2.24 Cache Read: $0.112 | Model: 0.280 Completion: 4.000 Cache: 0.200 | 🧠 🔧 🌡️ | 2025-10 | In: text Out: text | Open Weights Released: 2025-10-09 |
| GLM 4.5 Air | z-ai/glm-4.5-air | 128K | 96K | Input: $0.11 Output: $0.56 Cache Read: $0.022 | Model: 0.055 Completion: 5.091 Cache: 0.200 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-25 |
| GLM 4.6 | z-ai/glm-4.6 | 200K | 128K | Input: $0.35 Output: $1.54 Cache Read: $0.07 | Model: 0.175 Completion: 4.400 Cache: 0.200 | 🧠 🔧 🌡️ | 2025-09 | In: text Out: text | Open Weights Released: 2025-09-30 |
| Qwen3 Coder Plus | qwen/qwen3-coder-plus | 1M | 66.5K | Input: $1 Output: $5 Cache Read: $0.1 Cache Write: $1.25 | Model: 0.500 Completion: 5.000 Cache: 0.100 | 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-23 |
| KAT-Coder-Pro-V1 | kuaishou/kat-coder-pro-v1 | 256K | 32K | Input: $0.6 Output: $2.4 Cache Read: $0.12 | Model: 0.300 Completion: 4.000 Cache: 0.200 | 🧠 🔧 🌡️ | 2025-01-01 | In: text Out: text | Released: 2025-10-23 |
| Claude Haiku 4.5 | anthropic/claude-haiku-4.5 | 200K | 64K | Input: $1 Output: $5 Cache Read: $0.1 Cache Write: $1.25 | Model: 0.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-02-31 | In: text, image Out: text | Released: 2025-10-15 |
| Claude Opus 4.1 | anthropic/claude-opus-4.1 | 200K | 32K | Input: $15 Output: $75 Cache Read: $1.5 Cache Write: $18.75 | Model: 7.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-03-31 | In: text, image Out: text | Released: 2025-08-05 |
| Claude Sonnet 4.5 | anthropic/claude-sonnet-4.5 | 1M | 64K | Input: $3 Output: $15 Cache Read: $0.3 Cache Write: $3.75 | Model: 1.500 Completion: 5.000 Cache: 0.100 | 📎 🧠 🔧 🌡️ | 2025-07-31 | In: text, image Out: text | Released: 2025-09-29 |
Zhipu AI¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| GLM-4.6 | glm-4.6 | 204.8K | 131.1K | Input: $0.6 Output: $2.2 Cache Read: $0.11 Cache Write: $0 | Model: 0.300 Completion: 3.667 Cache: 0.183 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-09-30 |
| GLM 4.5V | glm-4.5v | 64K | 16.4K | Input: $0.6 Output: $1.8 | Model: 0.300 Completion: 3.000 | 📎 🧠 🔧 🌡️ | 2025-04 | In: text, image, video Out: text | Open Weights Released: 2025-08-11 |
| GLM-4.5-Air | glm-4.5-air | 131.1K | 98.3K | Input: $0.2 Output: $1.1 Cache Read: $0.03 Cache Write: $0 | Model: 0.100 Completion: 5.500 Cache: 0.150 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-28 |
| GLM-4.5 | glm-4.5 | 131.1K | 98.3K | Input: $0.6 Output: $2.2 Cache Read: $0.11 Cache Write: $0 | Model: 0.300 Completion: 3.667 Cache: 0.183 | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-28 |
| GLM-4.5-Flash | glm-4.5-flash | 131.1K | 98.3K | Input: $0 Output: $0 Cache Read: $0 Cache Write: $0 | - | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-28 |
Zhipu AI Coding Plan¶
| 模型 | 模型 ID | 上下文 | 输出 | 定价 (1M) | NewAPI 比率 | 能力 | 知识库 | 模态 | 详情 |
|---|---|---|---|---|---|---|---|---|---|
| GLM-4.6 | glm-4.6 | 204.8K | 131.1K | Input: $0 Output: $0 Cache Read: $0 Cache Write: $0 | - | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-09-30 |
| GLM 4.5V | glm-4.5v | 64K | 16.4K | Input: $0 Output: $0 | - | 📎 🧠 🔧 🌡️ | 2025-04 | In: text, image, video Out: text | Open Weights Released: 2025-08-11 |
| GLM-4.5-Air | glm-4.5-air | 131.1K | 98.3K | Input: $0 Output: $0 Cache Read: $0 Cache Write: $0 | - | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-28 |
| GLM-4.5 | glm-4.5 | 131.1K | 98.3K | Input: $0 Output: $0 Cache Read: $0 Cache Write: $0 | - | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-28 |
| GLM-4.5-Flash | glm-4.5-flash | 131.1K | 98.3K | Input: $0 Output: $0 Cache Read: $0 Cache Write: $0 | - | 🧠 🔧 🌡️ | 2025-04 | In: text Out: text | Open Weights Released: 2025-07-28 |