Everything you need to integrate ApiFast into your application.
1. Getting Started
ApiFast provides access to AI models through an OpenAI-compatible API. No registration is required. Follow these steps:
1
Buy credit — Visit the buy page and deposit crypto (USDT or BTC). Your deposit is multiplied into API credit (current rate & first-deposit bonus shown on the buy page).
2
Receive your key — After payment confirmation, you receive an API key starting with sk_kf_.
3
Start making requests — Use the API exactly like OpenAI. Change the base URL and API key, everything else stays the same.
2. Authentication
Include your API key in the Authorization header as a Bearer token:
Authorization: Bearer sk_kf_your_api_key_here
Never share your API key publicly. If compromised, anyone can use your credit balance. Keys cannot be recovered once lost.
One key, every model — no model picker. You choose the model on each request by setting the model field to any Model ID from the table below. The same API key works with all of them, and you can switch models any time without a new key.
All prices are per 1 million tokens, updated daily from live market rates.
Model ID
Display Name
Intelligence
Cache / 1M
Input / 1M
Output / 1M
kimi-k2.7-code
Kimi K2.7 Code
#155.0*
$0.15
$0.74
$3.50
minimax-m3
MiniMax M3
#244.4
$0.06
$0.30
$1.20
deepseek-v4-pro
DeepSeek V4 Pro
#344.3
$0.00
$0.43
$0.87
kimi-k2.6
Kimi K2.6
#442.8
$0.34
$0.68
$3.41
mimo-v2.5-pro
MiMo-V2.5-Pro
#542.2
$0.00
$0.43
$0.87
deepseek-v4-flash
DeepSeek V4 Flash
#640.3
$0.02
$0.09
$0.18
glm-5.1
GLM 5.1
#740.2
$0.18
$0.98
$3.08
mimo-v2.5
MiMo-V2.5
#840.1
$0.00
$0.14
$0.28
glm-5
GLM 5
#939.5
$0.12
$0.60
$1.92
kimi-k2.5
Kimi K2.5
#1038.1
—
$0.38
$2.02
minimax-m2.7
MiniMax M2.7
#1138.1
$0.05
$0.25
$1.00
nemotron-3-ultra
Nemotron 3 Ultra
#1237.8
$0.10
$0.50
$2.20
glm-4.7
GLM 4.7
#1333.8
$0.08
$0.40
$1.75
qwen3.5:397b
Qwen3.5 397B A17B
#1533.7
—
$0.39
$2.45
minimax-m2.5
MiniMax M2.5
#1433.7
$0.05
$0.15
$0.90
minimax-m2.1
MiniMax M2.1
#1631.4
$0.03
$0.29
$0.95
gemma4:31b
Gemma 4 31B
#1729.4
$0.09
$0.12
$0.35
minimax-m2
MiniMax M2
#1828.3
$0.03
$0.26
$1.00
gemini-3-flash-preview
Gemini 3 Flash Preview
#1927.4
$0.05
$0.50
$3.00
nemotron-3-super
Nemotron 3 Super
#2025.4
—
$0.09
$0.45
deepseek-v3.2
DeepSeek V3.2
#2124.7
—
$0.23
$0.34
gpt-oss:120b
gpt-oss-120b
#2223.8
—
$0.04
$0.18
deepseek-v3.1:671b
DeepSeek V3.1 Terminus
#2321.4
$0.13
$0.27
$0.95
qwen3-coder-next
Qwen3 Coder Next
#2421.2
$0.07
$0.11
$0.80
rnj-1:8b
Rnj 1 Instruct
#2521.0*
—
$0.15
$0.15
qwen3-coder:480b
Qwen3 Coder 480B A35B
#2618.0
—
$0.22
$1.80
mistral-large-3:675b
Mistral Large
#2716.2
$0.20
$2.00
$6.00
devstral-2:123b
Devstral 2 2512
#2815.5
$0.04
$0.40
$2.00
gpt-oss:20b
gpt-oss-20b
#2914.9
—
$0.03
$0.14
devstral-small-2:24b
Devstral Small 2 24b
#3013.1
—
—
—
ministral-3:14b
Ministral 3 14B 2512
#3110.0
$0.02
$0.20
$0.20
ministral-3:8b
Ministral 3 8B 2512
#328.9
$0.01
$0.15
$0.15
nemotron-3-nano:30b
Nemotron 3 Nano 30B A3B
#337.4
—
$0.05
$0.20
ministral-3:3b
Ministral 3 3B 2512
#345.6
$0.01
$0.10
$0.10
gemma3:27b
Gemma 3 27B
#354.8
—
$0.08
$0.16
gemma3:12b
Gemma 3 12B
#363.4
—
$0.05
$0.15
gemma3:4b
Gemma 3 4B
#371.1
—
$0.05
$0.10
List models programmatically
Fetch the live catalog (ids + prices) anytime — handy for building your own model selector:
Beyond chat, the same API key works with image generation, text-to-speech and transcription — each on its own OpenAI-compatible endpoint. These are billed per unit (per image, per 1M characters, or per minute of audio), not per token.
Your throughput scales with your tier, which is set by your largest deposit (it never goes down). Limits apply per API key across all endpoints. Exceeding them returns a 429 — retry with exponential backoff.
Tier
Unlocked at
Requests / min
Tokens / min
Tier 1 · Starter
≥ $1
20
50,000
Tier 2 · Pro
≥ $20
60
200,000
Tier 3 · Scale
≥ $50
200
1,000,000
Concurrency — each key runs ONE request at a time (1 agent per key): concurrent calls on the same key queue and are served one-by-one, never in parallel. To run multiple agents at once, use a separate key per agent. A global per-IP ceiling of 240 requests/min also applies before authentication.