Skip to content
Deepinfra logo

Deepinfra Pricing 2026

Plans, hidden costs, and cheaper alternatives compared

Is Deepinfra worth the price?

75/10

Deepinfra offers a highly granular and generally fair pricing model, especially for specific AI models.

The 'moonshotai/Kimi-K2.5' model, for instance, is quite expensive at $0.45/M in and $2.80/M out compared to other models like 'zai-org/GLM-4.7-Flash' at $0.06/M in and $0.40/M out. This platform is best for developers and businesses with specific AI model needs who can closely monitor their token usage.

Pricing Plans

moonshotai/Kimi-K2.5 (text-generation)

$0.45/M in • $2.80/M out

  • 256k context window
  • $0.09 cached / 1M tokens

zai-org/GLM-4.7-Flash (text-generation)

$0.06/M in • $0.40/M out

  • bfloat16
  • 198k context window
  • $0.01 cached / 1M tokens

nvidia/Nemotron-3-Nano-30B-A3B (text-generation)

$0.05/M in • $0.20/M out

  • fp4
  • 256k context window

NVIDIA gpu-rental On-Demand DGX B200 GPUs

$2.49/instance-hour

deepseek-ai/DeepSeek-V3.2 (text-generation)

$0.26/M in • $0.38/M out

  • fp4
  • 160k context window
  • $0.13 cached / 1M tokens

Bria/fibo_edit (text-to-image)

$0.00/image

  • Free for a limited time

Bria/video_eraser (text-to-video)

$0.14/second

Bria/video_foreground_mask (text-to-video)

$0.14/second

Bria/video_increase_resolution (text-to-video)

$0.14/second

Bria/video_mask_by_key_points (text-to-video)

$0.14/second

Bria/video_mask_by_prompt (text-to-video)

$0.14/second

Bria/video_remove_background (text-to-video)

$0.14/second

PrunaAI/p-image (text-to-image)

$0.005/image

PrunaAI/p-image-Edit (text-to-image)

$0.01/image

bosonai/HiggsAudioV2.5 (text-to-speech)

$20.00 per 1M characters

ResembleAI/chatterbox-turbo (text-to-speech)

$1.00 per 1M characters

Hidden Costs & Gotchas

High costs for certain premium models

Potential for rapid token consumption

GPU rental adds significant hourly cost

Which Plan Do You Need?

Developers needing specific AI models

Cost-conscious, usage-based projects

Teams with variable inference loads

How Deepinfra Compares to Competitors

Compared to OpenAI, Deepinfra's pricing for some models like 'zai-org/GLM-4.7-Flash' ($0.06/M in, $0.40/M out) can be more competitive than OpenAI's GPT-3.5 Turbo. However, for high-end models, OpenAI often offers more integrated features. Deepinfra's GPU rental at $2.49/instance-hour is comparable to other cloud providers but requires careful management.

Deepinfra Pricing FAQ

How much does Deepinfra cost?

Deepinfra starts at $2.49/month on the NVIDIA gpu-rental On-Demand DGX B200 GPUs plan. A free plan is also available with limited features.

Does Deepinfra have a free plan?

Yes. Deepinfra offers a free plan called "moonshotai/Kimi-K2.5 (text-generation)". It includes: 256k context window, $0.09 cached / 1M tokens.

Is there a cheaper alternative to Deepinfra?

Yes. Popular alternatives to Deepinfra include Hugging Face, Fireworks AI, OpenAI API, Anthropic Agent SDK. Free alternatives include Hugging Face, Anthropic Agent SDK. Compare them side-by-side on Toolradar.

Cheaper alternatives to Deepinfra

Direct competitors with similar features. Many offer free tiers or lower per-seat pricing.