Registry / Tools / Together AI

Together AI facts

tool_together_ai

Platform

Together AI

AI cloud platform for model APIs, inference, fine-tuning, evaluations, GPU clusters, and sandboxes.

last verified: 2026-05-19pricing checked: 2026-05-18

Official website Pricing page Documentation API docs

Fact summary

Name: Together AI
Company / maker: Together AI
Category: Model & Infrastructure
Entity type: Platform
Availability: Public
Pricing model: Usage Based
Pricing unit: month
Primary pricing route: Model Api
Delivery modes: APISDKWeb AppCLI
Supported platforms: Web
Data state: source backed

Official links

Formal source rows with URL, source type, and freshness.

Link	URL	source type	checked
Official website	www.together.ai	official-site	2026-05-10
Pricing page	www.together.ai/pricing	official-pricing-page	2026-05-18
Documentation	docs.together.ai	official-documentation	2026-05-10
API docs	docs.together.ai	official-documentation	2026-05-10

Access and delivery

Supported platforms and delivery modes are intentionally separate.

Supported platforms

Web

Delivery modes

APISDKWeb AppCLI

Capabilities

Capability tags

Model ApisModel HostingEvaluation MonitoringData Etl

Input types

TextImageAudioVideoStructured DataFunction CallSpreadsheetUrl

Output types

TextImageAudioVideoStructured JsonEmbeddingsDocument

Pricing summary

Derived from official pricing routes where available

pricingModel: Usage Based
pricingUnit: month
startingPrice: $0.03 / month
currency: USD
pricingLastChecked: 2026-05-18

Pricing routes

Each route is a first-class record with source URL and checked date.

GPU clusters

routeType: api-usage

Official pricing lists on-demand and reserved GPU cluster hourly pricing for H100, H200, and B200 hardware.

Dedicated inference

routeType: api-usage

Official pricing lists dedicated inference GPU instance prices such as H100, H200, and B200 hourly pricing.

Serverless inference API

routeType: model-api

primary route

Official pricing lists serverless inference prices per 1M tokens and model-specific prices across chat, vision, image, audio, video, embeddings, rerank, and moderation.

Pricing plans

plan	track	monthly	usage limit	features	checked
Code Interpreter	api	0.03	per 60-minute session	Execute LLM-generated code securely using the API	2026-05-18
Managed Storage	api	0.16	per GiB/month for Shared Filesystem	High-bandwidth, parallel filesystem colocated with compute	2026-05-18
Fine-Tuning Standard	api	-	Usage-based per 1M tokens processed; rates vary by model size and method	Supervised Fine-Tuning, Direct Preference Optimization, LoRA and full fine-tuning options	2026-05-18
Code Sandbox	api	-	Per vCPU $0.0446/hour; per GiB RAM $0.0149/hour	Customize a deployment of VM sandboxes for large development environments	2026-05-18
Dedicated Inference 1x B200 180GB	api	-	$9.95 per hour	Guaranteed performance with no sharing, Support for custom models, Autoscaling and traffic spike handling	2026-05-18
Dedicated Inference 1x H200 141GB	api	-	$5.49 per hour	Guaranteed performance with no sharing, Support for custom models, Autoscaling and traffic spike handling	2026-05-18
Dedicated Inference 1x H100 80GB	api	-	$3.99 per hour	Guaranteed performance with no sharing, Support for custom models, Autoscaling and traffic spike handling	2026-05-18
Serverless Inference	api	-	Usage-based model pricing per 1M tokens, per image, per audio minute, per video, or related unit depending on modality	Chat, Vision, Image, Audio, Video, Transcribe, Embeddings, Rerank, Moderation	2026-05-18

Evidence and sources

source URL	source type	used for	checked
www.together.ai/support	official-documentation	support	2026-05-10
docs.together.ai	official-documentation	documentation, api-access, capabilities	2026-05-10
www.together.ai/pricing	official-pricing-page	pricing, pricing-routes	2026-05-18
www.together.ai	official-site	identity, capabilities, access	2026-05-10

Freshness and recent changes

Facts verified

2026-05-19

Identity and core facts

Pricing checked

2026-05-18

Pricing routes and plans

Record updated

2026-05-10

Public record freshness

One-hop registry pages derived from this record's modeled facts. This section does not infer peer tools.

Together AI facts

Together AI

Fact summary

Official links

Access and delivery

Supported platforms

Delivery modes

Capabilities

Capability tags

Pricing summary

Pricing summary

Pricing routes

Pricing plans

Evidence and sources

Freshness and recent changes

Related pages

Together AI

Model & Infrastructure

Usage Based

Model Api

Related registry paths