CompactifAI·Inference API

The fastest and most affordable way to access leading AI models

Original and Slim by CompactifAI

Why our API?

Lowest Cost for Open-Source Models

Strongest Throughput & TTFT-to-Price Performance Ratio

Plug & Play, No Infrastructure Needed

Scalable Enterprise Deployment & Billed per Usage

Private Endpoints Available on Private Offer

Model Catalog

CompactifAI Only
Market-Leading Price
TOP Speed-to-Price Ratio
Best Value
Multimodal
Hypernova 60B
Hypernova 60B
CompactifAI
Input Cost
$0.04/M
Output Cost
$0.14/M
Whisper Large V3 Turbo Slim
Whisper Large V3 Turbo Slim
CompactifAI
Transcription Cost
$0.000134/Min
Nemotron 3 Nano Omni
Nemotron 3 Nano Omni
Input Cost
$0.20/M
Output Cost
$0.80/M
OpenAI gpt-oss-20b
OpenAI gpt-oss-20b
Input Cost
$0.03/M
Output Cost
$0.10/M
OpenAI gpt-oss-120b
OpenAI gpt-oss-120b
Input Cost
$0.05/M
Output Cost
$0.23/M
Whisper Large V3
Whisper Large V3
Transcription Cost
$0.00034/Min (Audio)
GLM 5.1
GLM 5.1
Input Cost
$0.95/M
Output Cost
$3.15/M
Llama 3.3 70B Slim
Llama 3.3 70B Slim
CompactifAI
Input Cost
$0.11/M
Output Cost
$0.21/M
Mistral Small 3.1 Slim
Mistral Small 3.1 Slim
CompactifAI
Input Cost
$0.05/M
Output Cost
$0.08/M

Need a Private Endpoint or Have Questions?

Our team is ready to help you with custom deployments, private offers, and any technical questions you may have.