Wafer logo
ModelsUsageBilling
Docs
Wafer logo
ModelsUsageBilling
Docs

Models

All models below are available serverless — pay per token with a Wafer API key.

/
Quick start
1curl -sS "https://pass.wafer.ai/v1/chat/completions" \
2 -H "Authorization: Bearer <YOUR_API_KEY>" \
3 -H "Content-Type: application/json" \
4 -d '{
5 "model": "GLM-5.1",
6 "messages": [{"role": "user", "content": "Hello!"}],
7 "max_tokens": 64,
8 "thinking": {"type": "disabled"}
9 }'

Use your Wafer API key as <YOUR_API_KEY> or set WAFER_API_KEY for SDK snippets. Manage keys on your Wafer API key.

Quick start
1curl -sS "https://pass.wafer.ai/v1/chat/completions" \
2 -H "Authorization: Bearer <YOUR_API_KEY>" \
3 -H "Content-Type: application/json" \
4 -d '{
5 "model": "Kimi-K2.6",
6 "messages": [{"role": "user", "content": "Hello!"}],
7 "max_tokens": 64,
8 "thinking": {"type": "disabled"}
9 }'

Use your Wafer API key as <YOUR_API_KEY> or set WAFER_API_KEY for SDK snippets. Manage keys on your Wafer API key.

Quick start
1curl -sS "https://pass.wafer.ai/v1/chat/completions" \
2 -H "Authorization: Bearer <YOUR_API_KEY>" \
3 -H "Content-Type: application/json" \
4 -d '{
5 "model": "deepseek-v4-pro",
6 "messages": [{"role": "user", "content": "Hello!"}],
7 "max_tokens": 64,
8 "thinking": {"type": "disabled"}
9 }'

Use your Wafer API key as <YOUR_API_KEY> or set WAFER_API_KEY for SDK snippets. Manage keys on your Wafer API key.

Quick start
1curl -sS "https://pass.wafer.ai/v1/chat/completions" \
2 -H "Authorization: Bearer <YOUR_API_KEY>" \
3 -H "Content-Type: application/json" \
4 -d '{
5 "model": "deepseek-v4-flash",
6 "messages": [{"role": "user", "content": "Hello!"}],
7 "max_tokens": 64,
8 "thinking": {"type": "disabled"}
9 }'

Use your Wafer API key as <YOUR_API_KEY> or set WAFER_API_KEY for SDK snippets. Manage keys on your Wafer API key.

Quick start
1curl -sS "https://pass.wafer.ai/v1/chat/completions" \
2 -H "Authorization: Bearer <YOUR_API_KEY>" \
3 -H "Content-Type: application/json" \
4 -d '{
5 "model": "MiniMax-M3",
6 "messages": [{"role": "user", "content": "Hello!"}],
7 "max_tokens": 64,
8 "thinking": {"type": "disabled"}
9 }'

Use your Wafer API key as <YOUR_API_KEY> or set WAFER_API_KEY for SDK snippets. Manage keys on your Wafer API key.

Quick start
1curl -sS "https://pass.wafer.ai/v1/chat/completions" \
2 -H "Authorization: Bearer <YOUR_API_KEY>" \
3 -H "Content-Type: application/json" \
4 -d '{
5 "model": "Qwen3.5-397B-A17B",
6 "messages": [{"role": "user", "content": "Hello!"}],
7 "max_tokens": 64,
8 "thinking": {"type": "disabled"}
9 }'

Use your Wafer API key as <YOUR_API_KEY> or set WAFER_API_KEY for SDK snippets. Manage keys on your Wafer API key.

Quick start
1curl -sS "https://pass.wafer.ai/v1/chat/completions" \
2 -H "Authorization: Bearer <YOUR_API_KEY>" \
3 -H "Content-Type: application/json" \
4 -d '{
5 "model": "Qwen3.6-35B-A3B",
6 "messages": [{"role": "user", "content": "Hello!"}],
7 "max_tokens": 64,
8 "thinking": {"type": "disabled"}
9 }'

Use your Wafer API key as <YOUR_API_KEY> or set WAFER_API_KEY for SDK snippets. Manage keys on your Wafer API key.

Quick start
1curl -sS "https://pass.wafer.ai/v1/chat/completions" \
2 -H "Authorization: Bearer <YOUR_API_KEY>" \
3 -H "Content-Type: application/json" \
4 -d '{
5 "model": "qwen3.7-max",
6 "messages": [{"role": "user", "content": "Hello!"}],
7 "max_tokens": 64,
8 "thinking": {"type": "disabled"}
9 }'

Use your Wafer API key as <YOUR_API_KEY> or set WAFER_API_KEY for SDK snippets. Manage keys on your Wafer API key.

More coming soon

New models added regularly. Stay tuned.