
Run open-source LLMs with serverless inference and fine-tuning
Visit WebsiteThe Bottom Line
Entry price
Paid plans only
Biggest pro
Many open models
Biggest con
Smaller than big providers
TL;DR - Together AI
- Together AI is an inference platform for open-source AI models
- It provides fast, affordable access to leading open models
- Pay-per-token pricing starting at $0.20/million tokens
What is Together AI?
Available on: Web
Pros & Cons
Pros
- Many open models
- Competitive pricing
- Fast inference
- Good for startups
- Fine-tuning available
Cons
- Smaller than big providers
- Model quality varies
- Support basic
- Documentation gaps
- Newer platform
Ratings Across the Web
Ratings aggregated from independent review platforms. Learn more
Key Features
Pricing Plans
Serverless Inference
null
- Pay per 1M tokens
- Llama 3.1 8B: $0.18/1M tokens
- Llama 3.1 405B: $3.50/1M tokens
- FLUX.1 dev: $0.025/megapixel
- Batch API: 50% lower cost
Fine-Tuning
null
- $0.48-2.90/1M tokens (by model size)
- DeepSeek, GLM, Kimi support
- Minimum charges for specialized models
GPU Cloud
null
- Instant Clusters: $2.20-5.50/hr/GPU
- Dedicated Endpoints: $2.10-4.99/hr
- Single-tenant deployment
Reviews
Across 5 verified user reviews on G2
Add your hands-on experience to help the next buyer.
Best Together AI Alternatives
Top alternatives based on features, pricing, and user needs.
High-performance AI infrastructure for developers to deploy, train, and scale ML workloads.
The end-to-end AI cloud that simplifies building and deploying models with GPU infrastructure.
Platform for scaling Ray and Python AI applications
Run, fine-tune, and deploy open-source ML models via API
Accelerate AI model deployment and optimize performance across diverse hardware.
Still deciding?
Most buyers shortlist 2 or 3 tools before committing. Pull a side-by-side comparison or browse the full alternatives shortlist below.
Explore More
Together AI FAQ
How does Together AI pricing work?
What models does Together AI support?
Is there a batch discount?
Can I get dedicated infrastructure?
Source: together.ai