Together AI

Together AI is a cloud platform that provides developers and AI researchers with access to high-performance GPU clusters and tools for building, training, fine-tuning, and running open-source generative AI models. It offers both serverless and dedicated endpoints for inference and training.

Use tool

Pricing

Llama 4 Maverick (Input)

$/mo

Llama 4 Maverick Input usage
Price per 1M tokens

Llama 4 Maverick (Output)

$/mo

Llama 4 Maverick Output usage
Price per 1M tokens

Llama 4 Scout (Input)

$/mo

Llama 4 Scout Input usage
Price per 1M tokens

Llama 4 Scout (Output)

$/mo

Llama 4 Scout Output usage
Price per 1M tokens

Llama 3.3 70B Instruct-Turbo (Input)

$/mo

Llama 3.3 70B Instruct-Turbo Input usage
Price per 1M tokens

Llama 3.3 70B Instruct-Turbo (Output)

$/mo

Llama 3.3 70B Instruct-Turbo Output usage
Price per 1M tokens

Llama 3.2 3B Instruct Turbo (Input)

$/mo

Llama 3.2 3B Instruct Turbo Input usage
Price per 1M tokens

Llama 3.2 3B Instruct Turbo (Output)

$/mo

Llama 3.2 3B Instruct Turbo Output usage
Price per 1M tokens

Llama 3.1 405B Instruct Turbo (Input)

$/mo

Llama 3.1 405B Instruct Turbo Input usage
Price per 1M tokens

Llama 3.1 405B Instruct Turbo (Output)

$/mo

Llama 3.1 405B Instruct Turbo Output usage
Price per 1M tokens

Llama 3.1 70B Instruct Turbo (Input)

$/mo

Llama 3.1 70B Instruct Turbo Input usage
Price per 1M tokens

Llama 3.1 70B Instruct Turbo (Output)

$/mo

Llama 3.1 70B Instruct Turbo Output usage
Price per 1M tokens

Llama 3.1 8B Instruct Turbo (Input)

$/mo

Llama 3.1 8B Instruct Turbo Input usage
Price per 1M tokens

Llama 3.1 8B Instruct Turbo (Output)

$/mo

Llama 3.1 8B Instruct Turbo Output usage
Price per 1M tokens

Llama 3 8B Instruct Lite (Input)

$/mo

Llama 3 8B Instruct Lite Input usage
Price per 1M tokens

Llama 3 8B Instruct Lite (Output)

$/mo

Llama 3 8B Instruct Lite Output usage
Price per 1M tokens

Llama 3 70B Instruct Reference (Input)

$/mo

Llama 3 70B Instruct Reference Input usage
Price per 1M tokens

Llama 3 70B Instruct Reference (Output)

$/mo

Llama 3 70B Instruct Reference Output usage
Price per 1M tokens

Llama 3 70B Instruct Turbo (Input)

$/mo

Llama 3 70B Instruct Turbo Input usage
Price per 1M tokens

Llama 3 70B Instruct Turbo (Output)

$/mo

Llama 3 70B Instruct Turbo Output usage
Price per 1M tokens

LLaMA-2 (Input)

$/mo

LLaMA-2 Input usage
Price per 1M tokens

LLaMA-2 (Output)

$/mo

LLaMA-2 Output usage
Price per 1M tokens

DeepSeek-R1 (Input)

$/mo

DeepSeek-R1 Input usage
Price per 1M tokens

DeepSeek-R1 (Output)

$/mo

DeepSeek-R1 Output usage
Price per 1M tokens

DeepSeek R1 Distilled Qwen 14B (Input)

$/mo

DeepSeek R1 Distilled Qwen 14B Input usage
Price per 1M tokens

DeepSeek R1 Distilled Qwen 14B (Output)

$/mo

DeepSeek R1 Distilled Qwen 14B Output usage
Price per 1M tokens

DeepSeek R1 Distilled Llama 70B (Input)

$/mo

DeepSeek R1 Distilled Llama 70B Input usage
Price per 1M tokens

DeepSeek R1 Distilled Llama 70B (Output)

$/mo

DeepSeek R1 Distilled Llama 70B Output usage
Price per 1M tokens

DeepSeek R1-0528-tput (Input)

$/mo

DeepSeek R1-0528-tput Input usage
Price per 1M tokens

DeepSeek R1-0528-tput (Output)

$/mo

DeepSeek R1-0528-tput Output usage
Price per 1M tokens

DeepSeek-V3-1 (Input)

$/mo

DeepSeek-V3-1 Input usage
Price per 1M tokens

DeepSeek-V3-1 (Output)

$/mo

DeepSeek-V3-1 Output usage
Price per 1M tokens

DeepSeek-V3 (Input)

$/mo

DeepSeek-V3 Input usage
Price per 1M tokens

DeepSeek-V3 (Output)

$/mo

DeepSeek-V3 Output usage
Price per 1M tokens

gpt-oss-120B (Input)

$/mo

gpt-oss-120B Input usage
Price per 1M tokens

gpt-oss-120B (Output)

$/mo

gpt-oss-120B Output usage
Price per 1M tokens

gpt-oss-20B (Input)

$/mo

gpt-oss-20B Input usage
Price per 1M tokens

gpt-oss-20B (Output)

$/mo

gpt-oss-20B Output usage
Price per 1M tokens

Qwen3-Coder 480B A35B Instruct (Input)

$/mo

Qwen3-Coder 480B A35B Instruct Input usage
Price per 1M tokens

Qwen3-Coder 480B A35B Instruct (Output)

$/mo

Qwen3-Coder 480B A35B Instruct Output usage
Price per 1M tokens

Qwen3 235B A22B Instruct 2507 FP8 (Input)

$/mo

Qwen3 235B A22B Instruct 2507 FP8 Input usage
Price per 1M tokens

Qwen3 235B A22B Instruct 2507 FP8 (Output)

$/mo

Qwen3 235B A22B Instruct 2507 FP8 Output usage
Price per 1M tokens

Qwen3 235B A22B Thinking 2507 FP8 (Input)

$/mo

Qwen3 235B A22B Thinking 2507 FP8 Input usage
Price per 1M tokens

Qwen3 235B A22B Thinking 2507 FP8 (Output)

$/mo

Qwen3 235B A22B Thinking 2507 FP8 Output usage
Price per 1M tokens

Qwen3 235B A22B FP8 Throughput (Input)

$/mo

Qwen3 235B A22B FP8 Throughput Input usage
Price per 1M tokens

Qwen3 235B A22B FP8 Throughput (Output)

$/mo

Qwen3 235B A22B FP8 Throughput Output usage
Price per 1M tokens

Qwen 2.5 72B (Input)

$/mo

Qwen 2.5 72B Input usage
Price per 1M tokens

Qwen 2.5 72B (Output)

$/mo

Qwen 2.5 72B Output usage
Price per 1M tokens

Qwen2.5-VL 72B Instruct (Input)

$/mo

Qwen2.5-VL 72B Instruct Input usage
Price per 1M tokens

Qwen2.5-VL 72B Instruct (Output)

$/mo

Qwen2.5-VL 72B Instruct Output usage
Price per 1M tokens

Qwen2.5 Coder 32B Instruct (Input)

$/mo

Qwen2.5 Coder 32B Instruct Input usage
Price per 1M tokens

Qwen2.5 Coder 32B Instruct (Output)

$/mo

Qwen2.5 Coder 32B Instruct Output usage
Price per 1M tokens

Qwen2.5 7B Instruct Turbo (Input)

$/mo

Qwen2.5 7B Instruct Turbo Input usage
Price per 1M tokens

Qwen2.5 7B Instruct Turbo (Output)

$/mo

Qwen2.5 7B Instruct Turbo Output usage
Price per 1M tokens

Qwen QwQ-32B (Input)

$/mo

Qwen QwQ-32B Input usage
Price per 1M tokens

Qwen QwQ-32B (Output)

$/mo

Qwen QwQ-32B Output usage
Price per 1M tokens

GLM-4.5-Air (Input)

$/mo

GLM-4.5-Air Input usage
Price per 1M tokens

GLM-4.5-Air (Output)

$/mo

GLM-4.5-Air Output usage
Price per 1M tokens

Kimi K2 Instruct (Input)

$/mo

Kimi K2 Instruct Input usage
Price per 1M tokens

Kimi K2 Instruct (Output)

$/mo

Kimi K2 Instruct Output usage
Price per 1M tokens

Kimi K2 Thinking (Input)

$/mo

Kimi K2 Thinking Input usage
Price per 1M tokens

Kimi K2 Thinking (Output)

$/mo

Kimi K2 Thinking Output usage
Price per 1M tokens

Kimi K2 0905 (Input)

$/mo

Kimi K2 0905 Input usage
Price per 1M tokens

Kimi K2 0905 (Output)

$/mo

Kimi K2 0905 Output usage
Price per 1M tokens

Mistral (7B) Instruct v0.2 (Input)

$/mo

Mistral (7B) Instruct v0.2 Input usage
Price per 1M tokens

Mistral (7B) Instruct v0.2 (Output)

$/mo

Mistral (7B) Instruct v0.2 Output usage
Price per 1M tokens

Mistral Instruct (Input)

$/mo

Mistral Instruct Input usage
Price per 1M tokens

Mistral Instruct (Output)

$/mo

Mistral Instruct Output usage
Price per 1M tokens

Mistral Small 3 (Input)

$/mo

Mistral Small 3 Input usage
Price per 1M tokens

Mistral Small 3 (Output)

$/mo

Mistral Small 3 Output usage
Price per 1M tokens

Mixtral 8x7B Instruct v0.1 (Input)

$/mo

Mixtral 8x7B Instruct v0.1 Input usage
Price per 1M tokens

Mixtral 8x7B Instruct v0.1 (Output)

$/mo

Mixtral 8x7B Instruct v0.1 Output usage
Price per 1M tokens

Marin 8B Instruct (Input)

$/mo

Marin 8B Instruct Input usage
Price per 1M tokens

Marin 8B Instruct (Output)

$/mo

Marin 8B Instruct Output usage
Price per 1M tokens

Arcee AI AFM-4.5B (Input)

$/mo

Arcee AI AFM-4.5B Input usage
Price per 1M tokens

Arcee AI AFM-4.5B (Output)

$/mo

Arcee AI AFM-4.5B Output usage
Price per 1M tokens

Arcee AI Coder-Large (Input)

$/mo

Arcee AI Coder-Large Input usage
Price per 1M tokens

Arcee AI Coder-Large (Output)

$/mo

Arcee AI Coder-Large Output usage
Price per 1M tokens

Arcee AI Maestro (Input)

$/mo

Arcee AI Maestro Input usage
Price per 1M tokens

Arcee AI Maestro (Output)

$/mo

Arcee AI Maestro Output usage
Price per 1M tokens

Arcee AI Virtuoso-Large (Input)

$/mo

Arcee AI Virtuoso-Large Input usage
Price per 1M tokens

Arcee AI Virtuoso-Large (Output)

$/mo

Arcee AI Virtuoso-Large Output usage
Price per 1M tokens

Cogito v2 preview - 109B MoE (Input)

$/mo

Cogito v2 preview - 109B MoE Input usage
Price per 1M tokens

Cogito v2 preview - 109B MoE (Output)

$/mo

Cogito v2 preview - 109B MoE Output usage
Price per 1M tokens

Cogito v2 preview - 405B (Input)

$/mo

Cogito v2 preview - 405B Input usage
Price per 1M tokens

Cogito v2 preview - 405B (Output)

$/mo

Cogito v2 preview - 405B Output usage
Price per 1M tokens

Cogito v2 preview - 671B MoE (Input)

$/mo

Cogito v2 preview - 671B MoE Input usage
Price per 1M tokens

Cogito v2 preview - 671B MoE (Output)

$/mo

Cogito v2 preview - 671B MoE Output usage
Price per 1M tokens

Cogito v2 preview - 70B (Input)

$/mo

Cogito v2 preview - 70B Input usage
Price per 1M tokens

Cogito v2 preview - 70B (Output)

$/mo

Cogito v2 preview - 70B Output usage
Price per 1M tokens

Refuel LLM-2 (Input)

$/mo

Refuel LLM-2 Input usage
Price per 1M tokens

Refuel LLM-2 (Output)

$/mo

Refuel LLM-2 Output usage
Price per 1M tokens

Refuel LLM-2 Small (Input)

$/mo

Refuel LLM-2 Small Input usage
Price per 1M tokens

Refuel LLM-2 Small (Output)

$/mo

Refuel LLM-2 Small Output usage
Price per 1M tokens

Typhoon 2 70B Instruct (Input)

$/mo

Typhoon 2 70B Instruct Input usage
Price per 1M tokens

Typhoon 2 70B Instruct (Output)

$/mo

Typhoon 2 70B Instruct Output usage
Price per 1M tokens

gemma-3n-E4B-it (Input)

$/mo

gemma-3n-E4B-it Input usage
Price per 1M tokens

gemma-3n-E4B-it (Output)

$/mo

gemma-3n-E4B-it Output usage
Price per 1M tokens

FLUX.1 Krea [dev]

$/mo

FLUX.1 Krea [dev] image generation
Price per MP
Default steps: 28

FLUX.1 Kontext [dev]

$/mo

FLUX.1 Kontext [dev] image generation
Price per MP
Default steps: 28

FLUX.1 Kontext [pro]

$/mo

FLUX.1 Kontext [pro] image generation
Price per MP
Default steps: 28

FLUX.1 Kontext [max]

$/mo

FLUX.1 Kontext [max] image generation
Price per MP
Default steps: 28

FLUX1.1 [pro]

$/mo

FLUX1.1 [pro] image generation
Price per MP

FLUX.1 [dev]

$/mo

FLUX.1 [dev] image generation
Price per MP
Default steps: 28

FLUX.1 [pro]

$/mo

FLUX.1 [pro] image generation
Price per MP
Default steps: 28

FLUX.1 [schnell]

$/mo

FLUX.1 [schnell] image generation
Price per MP
Default steps: 4

FLUX.1 Canny [pro]

$/mo

FLUX.1 Canny [pro] image generation
Price per MP

Google Imagen 4.0 Preview

$/mo

Google Imagen 4.0 Preview image generation
Price per MP

Google Imagen 4.0 Fast

$/mo

Google Imagen 4.0 Fast image generation
Price per MP

Google Imagen 4.0 Ultra

$/mo

Google Imagen 4.0 Ultra image generation
Price per MP

Gemini Flash Image 2.5 (Nano Banana)

$/mo

Gemini Flash Image 2.5 (Nano Banana) image generation
Price per MP

ByteDance Seedream 3.0

$/mo

ByteDance Seedream 3.0 image generation
Price per MP

ByteDance Seedream 4.0

$/mo

ByteDance Seedream 4.0 image generation
Price per MP

ByteDance SeedEdit

$/mo

ByteDance SeedEdit image generation
Price per MP

Qwen Image Edit

$/mo

Qwen Image Edit image generation
Price per MP

Qwen Image

$/mo

Qwen Image image generation
Price per MP

Juggernaut Pro Flux by RunDiffusion

$/mo

Juggernaut Pro Flux by RunDiffusion image generation
Price per MP

Juggernaut Lightning Flux by RunDiffusion

$/mo

Juggernaut Lightning Flux by RunDiffusion image generation
Price per MP

HiDream-I1-Full

$/mo

HiDream-I1-Full image generation
Price per MP

HiDream-I1-Dev

$/mo

HiDream-I1-Dev image generation
Price per MP

HiDream-I1-Fast

$/mo

HiDream-I1-Fast image generation
Price per MP

Ideogram 3.0

$/mo

Ideogram 3.0 image generation
Price per MP

Dreamshaper

$/mo

Dreamshaper image generation
Price per MP

SD XL

$/mo

SD XL image generation
Price per MP

Stable Diffusion 3

$/mo

Stable Diffusion 3 image generation
Price per MP

Cartesia Sonic-2

$/mo

Cartesia Sonic-2 speech synthesis/processing
Price per 1M characters

MiniMax 01 Director (720p/5s)

$/mo

MiniMax 01 Director video generation
Price per video (720p/5s)

MiniMax Hailuo 02 (768p/10s)

$/mo

MiniMax Hailuo 02 video generation
Price per video (768p/10s)

MiniMax Hailuo 02 (1080p/6s)

$/mo

MiniMax Hailuo 02 video generation
Price per video (1080p/6s)

Google Veo 2.0 (720p/5s)

$/mo

Google Veo 2.0 video generation
Price per video (720p/5s)

Google Veo 3.0 (720p/8s)

$/mo

Google Veo 3.0 video generation
Price per video (720p/8s)

Google Veo 3.0 + Audio (720p/8s with audio)

$/mo

Google Veo 3.0 + Audio video generation
Price per video (720p/8s with audio)

Google Veo 3.0 Fast (1080p/8s)

$/mo

Google Veo 3.0 Fast video generation
Price per video (1080p/8s)

Google Veo 3.0 Fast + Audio (1080p/8s with audio)

$/mo

Google Veo 3.0 Fast + Audio video generation
Price per video (1080p/8s with audio)

ByteDance Seedance 1.0 Lite (720p/5s)

$/mo

ByteDance Seedance 1.0 Lite video generation
Price per video (720p/5s)

ByteDance Seedance 1.0 Pro (1080p/5s)

$/mo

ByteDance Seedance 1.0 Pro video generation
Price per video (1080p/5s)

PixVerse v5 (1080p/5s)

$/mo

PixVerse v5 video generation
Price per video (1080p/5s)

Kling 2.1 Master (1080p/5s)

$/mo

Kling 2.1 Master video generation
Price per video (1080p/5s)

Kling 2.1 Standard (720p/5s)

$/mo

Kling 2.1 Standard video generation
Price per video (720p/5s)

Kling 2.1 Pro (1080p/5s)

$/mo

Kling 2.1 Pro video generation
Price per video (1080p/5s)

Kling 2.0 Master (1080p/5s)

$/mo

Kling 2.0 Master video generation
Price per video (1080p/5s)

Kling 1.6 Standard (720p/5s)

$/mo

Kling 1.6 Standard video generation
Price per video (720p/5s)

Kling 1.6 Pro (1080p/5s)

$/mo

Kling 1.6 Pro video generation
Price per video (1080p/5s)

Wan 2.2 I2V (720p/5s)

$/mo

Wan 2.2 I2V video generation
Price per video (720p/5s)

Wan 2.2 T2V (720p/8s)

$/mo

Wan 2.2 T2V video generation
Price per video (720p/8s)

Vidu 2.0 (720p/8s)

$/mo

Vidu 2.0 video generation
Price per video (720p/8s)

Vidu Q1 (1080p/5s)

$/mo

Vidu Q1 video generation
Price per video (1080p/5s)

Sora 2 (720p/8s)

$/mo

Sora 2 video generation
Price per video (720p/8s)

Sora 2 Pro (720p/8s)

$/mo

Sora 2 Pro video generation
Price per video (720p/8s)

Sora 2 Pro (1080p/8s)

$/mo

Sora 2 Pro video generation
Price per video (1080p/8s)

Whisper Large v3

$/mo

Whisper Large v3 automatic speech recognition
Price per audio minute

BGE-Base-EN v1.5

$/mo

BGE-Base-EN v1.5 vector embeddings
Price per 1M tokens

BGE-Large-EN v1.5

$/mo

BGE-Large-EN v1.5 vector embeddings
Price per 1M tokens

GTE ModernBERT base

$/mo

GTE ModernBERT base vector embeddings
Price per 1M tokens

Multilingual e5 large instruct

$/mo

Multilingual e5 large instruct vector embeddings
Price per 1M tokens

M2-BERT 80M 32K Retrieval

$/mo

M2-BERT 80M 32K Retrieval vector embeddings
Price per 1M tokens

Mxbai Rerank Large V2

$/mo

Mxbai Rerank Large V2 search relevance reranking
Price per 1M tokens

Salesforce Llama Rank V1 (8B)

$/mo

Salesforce Llama Rank V1 (8B) search relevance reranking
Price per 1M tokens

VirtueGuard Text Lite

$/mo

VirtueGuard Text Lite content filtering and classification
Price per 1M tokens

Llama Guard 4 12B

$/mo

Llama Guard 4 12B content filtering and classification
Price per 1M tokens

Llama Guard 3 11B Vision Turbo

$/mo

Llama Guard 3 11B Vision Turbo content filtering and classification
Price per 1M tokens

Llama Guard 3 8B

$/mo

Llama Guard 3 8B content filtering and classification
Price per 1M tokens

Llama Guard 2 8B

$/mo

Llama Guard 2 8B content filtering and classification
Price per 1M tokens

Dedicated Endpoint - 1x H200 141GB

$/mo

Guaranteed performance
Support for custom models
Autoscaling & traffic spike handling
Hardware: 1x H200 141GB
Price per hour

Dedicated Endpoint - 1x H100 80GB

$/mo

Guaranteed performance
Support for custom models
Autoscaling & traffic spike handling
Hardware: 1x H100 80GB
Price per hour

Dedicated Endpoint - 1x A100 SXM 80GB

$/mo

Guaranteed performance
Support for custom models
Autoscaling & traffic spike handling
Hardware: 1x A100 SXM 80GB
Price per hour

Dedicated Endpoint - 1x A100 SXM 40GB

$/mo

Guaranteed performance
Support for custom models
Autoscaling & traffic spike handling
Hardware: 1x A100 SXM 40GB
Price per hour

Dedicated Endpoint - 1x A100 PCIe 80GB

$/mo

Guaranteed performance
Support for custom models
Autoscaling & traffic spike handling
Hardware: 1x A100 PCIe 80GB
Price per hour

Dedicated Endpoint - 1x L40S 48GB

$/mo

Guaranteed performance
Support for custom models
Autoscaling & traffic spike handling
Hardware: 1x L40S 48GB
Price per hour

Fine-tuning Up to 16B - Supervised Fine-Tuning LoRA

$/mo

Supervised Fine-Tuning LoRA (Up to 16B model size)
Price per token processed

Fine-tuning Up to 16B - Direct Preference Optimization LoRA

$/mo

Direct Preference Optimization LoRA (Up to 16B model size)
Price per token processed

Fine-tuning Up to 16B - Supervised Full Fine-Tuning

$/mo

Supervised Full Fine-Tuning (Up to 16B model size)
Price per token processed

Fine-tuning Up to 16B - Direct Preference Optimization Full Fine-Tuning

$/mo

Direct Preference Optimization Full Fine-Tuning (Up to 16B model size)
Price per token processed

Fine-tuning 17B-69B - Supervised Fine-Tuning LoRA

$/mo

Supervised Fine-Tuning LoRA (17B-69B model size)
Price per token processed

Fine-tuning 17B-69B - Direct Preference Optimization LoRA

$/mo

Direct Preference Optimization LoRA (17B-69B model size)
Price per token processed

Fine-tuning 17B-69B - Supervised Full Fine-Tuning

$/mo

Supervised Full Fine-Tuning (17B-69B model size)
Price per token processed

Fine-tuning 17B-69B - Direct Preference Optimization Full Fine-Tuning

$/mo

Direct Preference Optimization Full Fine-Tuning (17B-69B model size)
Price per token processed

Fine-tuning 70-100B - Supervised Fine-Tuning LoRA

$/mo

Supervised Fine-Tuning LoRA (70-100B model size)
Price per token processed

Fine-tuning 70-100B - Direct Preference Optimization LoRA

$/mo

Direct Preference Optimization LoRA (70-100B model size)
Price per token processed

Fine-tuning 70-100B - Supervised Full Fine-Tuning

$/mo

Supervised Full Fine-Tuning (70-100B model size)
Price per token processed

Fine-tuning 70-100B - Direct Preference Optimization Full Fine-Tuning

$/mo

Direct Preference Optimization Full Fine-Tuning (70-100B model size)
Price per token processed

Specialized Fine-tuning gpt-oss-120B - Supervised Fine-Tuning LoRA

$/mo

Supervised Fine-Tuning LoRA for gpt-oss-120B model
Limited to LoRA fine-tuning
Minimum charge: $6.00
Price per token processed

Specialized Fine-tuning gpt-oss-120B - Direct Preference Optimization LoRA

$/mo

Direct Preference Optimization LoRA for gpt-oss-120B model
Limited to LoRA fine-tuning
Minimum charge: $6.00
Price per token processed

Specialized Fine-tuning Llama 4 Scout Instruct - Supervised Fine-Tuning LoRA

$/mo

Supervised Fine-Tuning LoRA for Llama 4 Scout Instruct model
Limited to LoRA fine-tuning
Minimum charge: $6.00
Price per token processed

Specialized Fine-tuning Llama 4 Scout Instruct - Direct Preference Optimization LoRA

$/mo

Direct Preference Optimization LoRA for Llama 4 Scout Instruct model
Limited to LoRA fine-tuning
Minimum charge: $6.00
Price per token processed

Specialized Fine-tuning Llama 4 Maverick Instruct - Supervised Fine-Tuning LoRA

$/mo

Supervised Fine-Tuning LoRA for Llama 4 Maverick Instruct model
Limited to LoRA fine-tuning
Minimum charge: $16.00
Price per token processed

Specialized Fine-tuning Llama 4 Maverick Instruct - Direct Preference Optimization LoRA

$/mo

Direct Preference Optimization LoRA for Llama 4 Maverick Instruct model
Limited to LoRA fine-tuning
Minimum charge: $16.00
Price per token processed

Specialized Fine-tuning DeepSeek Models - Supervised Fine-Tuning LoRA

$/mo

Supervised Fine-Tuning LoRA for DeepSeek-R1, DeepSeek-R1-0528, DeepSeek-V3, DeepSeek-V3-0324, DeepSeek-V3.1, DeepSeek-V3.1-Base models
Limited to LoRA fine-tuning
Minimum charge: $20.00
Price per token processed

Specialized Fine-tuning DeepSeek Models - Direct Preference Optimization LoRA

$/mo

Direct Preference Optimization LoRA for DeepSeek-R1, DeepSeek-R1-0528, DeepSeek-V3, DeepSeek-V3-0324, DeepSeek-V3.1, DeepSeek-V3.1-Base models
Limited to LoRA fine-tuning
Minimum charge: $20.00
Price per token processed

Specialized Fine-tuning Qwen3-Coder-480B-A35B-Instruct - Supervised Fine-Tuning LoRA

$/mo

Supervised Fine-Tuning LoRA for Qwen3-Coder-480B-A35B-Instruct model
Limited to LoRA fine-tuning
Minimum charge: $18.00
Price per token processed

Specialized Fine-tuning Qwen3-Coder-480B-A35B-Instruct - Direct Preference Optimization LoRA

$/mo

Direct Preference Optimization LoRA for Qwen3-Coder-480B-A35B-Instruct model
Limited to LoRA fine-tuning
Minimum charge: $18.00
Price per token processed

Specialized Fine-tuning Qwen3-235B-A22B Models - Supervised Fine-Tuning LoRA

$/mo

Supervised Fine-Tuning LoRA for Qwen3-235B-A22B, Qwen3-235B-A22B-Instruct-2507 models
Limited to LoRA fine-tuning
No minimum charge
Price per token processed

Specialized Fine-tuning Qwen3-235B-A22B Models - Direct Preference Optimization LoRA

$/mo

Direct Preference Optimization LoRA for Qwen3-235B-A22B, Qwen3-235B-A22B-Instruct-2507 models
Limited to LoRA fine-tuning
No minimum charge
Price per token processed

Code Sandbox (vCPU)

$/mo

Code Sandbox vCPU usage
Price per vCPU per hour
Customize VM sandboxes

Code Sandbox (GiB RAM)

$/mo

Code Sandbox GiB RAM usage
Price per GiB RAM per hour
Customize VM sandboxes

Code Interpreter Session

$/mo

Code Interpreter session (60 minutes)
Price per session

Instant Cluster - NVIDIA HGX H100 SXM

$/mo

Ready to use, self-service GPUs
NVIDIA HGX H100 SXM
Price per hour per GPU

Instant Cluster - NVIDIA HGX H200

$/mo

Ready to use, self-service GPUs
NVIDIA HGX H200
Price per hour per GPU

Instant Cluster - NVIDIA HGX B200

$/mo

Ready to use, self-service GPUs
NVIDIA HGX B200
Price per hour per GPU

Reserved Cluster - NVIDIA GB200 NVL72 384GB HBM3e

$/mo

Dedicated capacity
Expert support
Hardware: NVIDIA GB200 NVL72
GPU Memory: 384GB HBM3e
Price per hour

Reserved Cluster - NVIDIA B200 192GB HBM3e

$/mo

Dedicated capacity
Expert support
Hardware: NVIDIA B200
GPU Memory: 192GB HBM3e
Price per hour

Reserved Cluster - NVIDIA H200 141GB HBM3e

$/mo

Dedicated capacity
Expert support
Hardware: NVIDIA H200
GPU Memory: 141GB HBM3e
Starting at price per hour

Reserved Cluster - NVIDIA H100 80GB HBM2e

$/mo

Dedicated capacity
Expert support
Hardware: NVIDIA H100
GPU Memory: 80GB HBM2e
Starting at price per hour

Reserved Cluster - NVIDIA A100 80GB HBM2e

$/mo

Dedicated capacity
Expert support
Hardware: NVIDIA A100
GPU Memory: 80GB HBM2e
Starting at price per hour

Frontier AI Factory

$/mo

Large-scale, custom-built private GPU clusters
NVIDIA Blackwell GPUs at scale
Custom quote required

Shared Filesystem Storage

$/mo

High-bandwidth, parallel filesystem
Colocated with compute
Price per GiB per month

Details

Pricing Tier

Freemium

Sponsor

Ad space