Skip to content

TL;DR - Together AI

  • Together AI is an inference platform for open-source AI models
  • It provides fast, affordable access to leading open models
  • Pay-per-token pricing starting at $0.20/million tokens
Pricing: Paid only
Best for: Enterprises & pros
4.8/5 across review platforms

Pros & Cons

Pros

  • Many open models
  • Competitive pricing
  • Fast inference
  • Good for startups
  • Fine-tuning available

Cons

  • Smaller than big providers
  • Model quality varies
  • Support basic
  • Documentation gaps
  • Newer platform

Ratings Across the Web

4.8(5 reviews)

Ratings aggregated from independent review platforms. Learn more

Key Features

LLM inferenceImage generationFine-tuningGPU cloudBatch processingCode executionDedicated endpointsOpen-source modelsCompetitive pricingSingle-tenant options

Pricing Plans

Serverless Inference

  • Pay per 1M tokens
  • Llama 3.1 8B: $0.18/1M tokens
  • Llama 3.1 405B: $3.50/1M tokens
  • FLUX.1 dev: $0.025/megapixel
  • Batch API: 50% lower cost

Fine-Tuning

  • $0.48-2.90/1M tokens (by model size)
  • DeepSeek, GLM, Kimi support
  • Minimum charges for specialized models

GPU Cloud

  • Instant Clusters: $2.20-5.50/hr/GPU
  • Dedicated Endpoints: $2.10-4.99/hr
  • Single-tenant deployment
Together AI is a platform for running open-source LLMs. Features serverless inference, fine-tuning, and GPU cloud with competitive pricing for Llama, FLUX, and more.

Reviews

Be the first to review Together AI

Your take helps the next buyer. Verified LinkedIn reviewers get a badge.

Write a review

Best Together AI Alternatives

Top alternatives based on features, pricing, and user needs.

View full list →

Explore More

Together AI FAQ

How does Together AI pricing work?

Together AI uses pay-as-you-go pricing per 1M tokens for inference, per 1M tokens for fine-tuning, and per-hour for GPU cloud.

What models does Together AI support?

Together AI supports Llama 3.1, FLUX.1, DeepSeek, GLM, Kimi, and many other open-source models.

Is there a batch discount?

Yes, Batch API offers 50% lower cost for most models.

Can I get dedicated infrastructure?

Yes, Dedicated Endpoints provide single-tenant deployment at $2.10-4.99/hour.

Source: together.ai