🏆HyperNova 60B, the world's most efficient model in its category according to Artificial Analysis.

CompactifAI·Inference API

The fastest and most affordable way to access leading AI models

Original and Slim by CompactifAI

View Models Read Documentation

Why our API?

Lowest Cost for Open-Source Models

Strongest Throughput & TTFT-to-Price Performance Ratio

Plug & Play, No Infrastructure Needed

Scalable Enterprise Deployment & Billed per Usage

Private Endpoints Available on Private Offer

Model Catalog

CompactifAI Only

Market-Leading Price

TOP Speed-to-Price Ratio

Best Value

Multimodal

Hypernova 60B

CompactifAI

Input Cost

$0.04/M

Output Cost

$0.14/M

Buy with

Whisper Large V3 Turbo Slim

CompactifAI

Transcription Cost

$0.000134/Min

Buy with

Nemotron 3 Nano Omni

Input Cost

$0.20/M

Output Cost

$0.80/M

Buy with

OpenAI gpt-oss-20b

Input Cost

$0.03/M

Output Cost

$0.10/M

Buy with

OpenAI gpt-oss-120b

Input Cost

$0.05/M

Output Cost

$0.23/M

Buy with

Whisper Large V3

Transcription Cost

$0.00034/Min (Audio)

Buy with

GLM 5.1

Input Cost

$0.95/M

Output Cost

$3.15/M

Buy with

Llama 3.3 70B Slim

CompactifAI

Input Cost

$0.11/M

Output Cost

$0.21/M

Benchmark

Buy with

Mistral Small 3.1 Slim

CompactifAI

Input Cost

$0.05/M

Output Cost

$0.08/M

Benchmark

Buy with

Hypernova 60B

CompactifAI

Input Cost

$0.04/M

Output Cost

$0.14/M

Whisper Large V3 Turbo Slim

CompactifAI

Transcription Cost

$0.000134/Min

Nemotron 3 Nano Omni

Input Cost

$0.20/M

Output Cost

$0.80/M

OpenAI gpt-oss-20b

Input Cost

$0.03/M

Output Cost

$0.10/M

OpenAI gpt-oss-120b

Input Cost

$0.05/M

Output Cost

$0.23/M

Whisper Large V3

Transcription Cost

$0.00034/Min (Audio)

GLM 5.1

Input Cost

$0.95/M

Output Cost

$3.15/M

Llama 3.3 70B Slim

CompactifAI

Input Cost

$0.11/M

Output Cost

$0.21/M

Mistral Small 3.1 Slim

CompactifAI

Input Cost

$0.05/M

Output Cost

$0.08/M

Need a Private Endpoint or Have Questions?

Our team is ready to help you with custom deployments, private offers, and any technical questions you may have.

About Private Deployment