Skip to content
Together AI logo

Run open-source LLMs with serverless inference and fine-tuning

Visit Website
Reviews onG2
5 reviews tracked·2 press mentions

The Bottom Line

Entry price

Paid plans only

Biggest pro

Many open models

Biggest con

Smaller than big providers

TL;DR - Together AI

  • Together AI is an inference platform for open-source AI models
  • It provides fast, affordable access to leading open models
  • Pay-per-token pricing starting at $0.20/million tokens
Pricing: Paid only
Best for: Enterprises & pros
4.8/5 across review platforms

What is Together AI?

Editorial review
Together AI is a platform for running open-source LLMs. Features serverless inference, fine-tuning, and GPU cloud with competitive pricing for Llama, FLUX, and more.

Available on: Web

Pros & Cons

Pros

  • Many open models
  • Competitive pricing
  • Fast inference
  • Good for startups
  • Fine-tuning available

Cons

  • Smaller than big providers
  • Model quality varies
  • Support basic
  • Documentation gaps
  • Newer platform

Ratings Across the Web

4.8(5 reviews)

Ratings aggregated from independent review platforms. Learn more

Key Features

LLM inferenceImage generationFine-tuningGPU cloudBatch processingCode executionDedicated endpointsOpen-source modelsCompetitive pricingSingle-tenant options

Pricing Plans

Serverless Inference

null

  • Pay per 1M tokens
  • Llama 3.1 8B: $0.18/1M tokens
  • Llama 3.1 405B: $3.50/1M tokens
  • FLUX.1 dev: $0.025/megapixel
  • Batch API: 50% lower cost

Fine-Tuning

null

  • $0.48-2.90/1M tokens (by model size)
  • DeepSeek, GLM, Kimi support
  • Minimum charges for specialized models

GPU Cloud

null

  • Instant Clusters: $2.20-5.50/hr/GPU
  • Dedicated Endpoints: $2.10-4.99/hr
  • Single-tenant deployment

Reviews

4.8/5

Across 5 verified user reviews on G2

Add your hands-on experience to help the next buyer.

Best Together AI Alternatives

Top alternatives based on features, pricing, and user needs.

Most buyers shortlist 2 or 3 tools before committing. Pull a side-by-side comparison or browse the full alternatives shortlist below.

Explore More

Together AI FAQ

How does Together AI pricing work?

Together AI uses pay-as-you-go pricing per 1M tokens for inference, per 1M tokens for fine-tuning, and per-hour for GPU cloud.

What models does Together AI support?

Together AI supports Llama 3.1, FLUX.1, DeepSeek, GLM, Kimi, and many other open-source models.

Is there a batch discount?

Yes, Batch API offers 50% lower cost for most models.

Can I get dedicated infrastructure?

Yes, Dedicated Endpoints provide single-tenant deployment at $2.10-4.99/hour.

Source: together.ai

Guides & Articles