
Together AI
Together AI is a cloud platform that provides developers and AI researchers with access to high-performance GPU clusters and tools for building, training, fine-tuning, and running open-source generative AI models. It offers both serverless and dedicated endpoints for inference and training.
Pricing
Llama 4 Maverick (Input)
$/mo
- Llama 4 Maverick Input usage
- Price per 1M tokens
Llama 4 Maverick (Output)
$/mo
- Llama 4 Maverick Output usage
- Price per 1M tokens
Llama 4 Scout (Input)
$/mo
- Llama 4 Scout Input usage
- Price per 1M tokens
Llama 4 Scout (Output)
$/mo
- Llama 4 Scout Output usage
- Price per 1M tokens
Llama 3.3 70B Instruct-Turbo (Input)
$/mo
- Llama 3.3 70B Instruct-Turbo Input usage
- Price per 1M tokens
Llama 3.3 70B Instruct-Turbo (Output)
$/mo
- Llama 3.3 70B Instruct-Turbo Output usage
- Price per 1M tokens
Llama 3.2 3B Instruct Turbo (Input)
$/mo
- Llama 3.2 3B Instruct Turbo Input usage
- Price per 1M tokens
Llama 3.2 3B Instruct Turbo (Output)
$/mo
- Llama 3.2 3B Instruct Turbo Output usage
- Price per 1M tokens
Llama 3.1 405B Instruct Turbo (Input)
$/mo
- Llama 3.1 405B Instruct Turbo Input usage
- Price per 1M tokens
Llama 3.1 405B Instruct Turbo (Output)
$/mo
- Llama 3.1 405B Instruct Turbo Output usage
- Price per 1M tokens
Llama 3.1 70B Instruct Turbo (Input)
$/mo
- Llama 3.1 70B Instruct Turbo Input usage
- Price per 1M tokens
Llama 3.1 70B Instruct Turbo (Output)
$/mo
- Llama 3.1 70B Instruct Turbo Output usage
- Price per 1M tokens
Llama 3.1 8B Instruct Turbo (Input)
$/mo
- Llama 3.1 8B Instruct Turbo Input usage
- Price per 1M tokens
Llama 3.1 8B Instruct Turbo (Output)
$/mo
- Llama 3.1 8B Instruct Turbo Output usage
- Price per 1M tokens
Llama 3 8B Instruct Lite (Input)
$/mo
- Llama 3 8B Instruct Lite Input usage
- Price per 1M tokens
Llama 3 8B Instruct Lite (Output)
$/mo
- Llama 3 8B Instruct Lite Output usage
- Price per 1M tokens
Llama 3 70B Instruct Reference (Input)
$/mo
- Llama 3 70B Instruct Reference Input usage
- Price per 1M tokens
Llama 3 70B Instruct Reference (Output)
$/mo
- Llama 3 70B Instruct Reference Output usage
- Price per 1M tokens
Llama 3 70B Instruct Turbo (Input)
$/mo
- Llama 3 70B Instruct Turbo Input usage
- Price per 1M tokens
Llama 3 70B Instruct Turbo (Output)
$/mo
- Llama 3 70B Instruct Turbo Output usage
- Price per 1M tokens
LLaMA-2 (Input)
$/mo
- LLaMA-2 Input usage
- Price per 1M tokens
LLaMA-2 (Output)
$/mo
- LLaMA-2 Output usage
- Price per 1M tokens
DeepSeek-R1 (Input)
$/mo
- DeepSeek-R1 Input usage
- Price per 1M tokens
DeepSeek-R1 (Output)
$/mo
- DeepSeek-R1 Output usage
- Price per 1M tokens
DeepSeek R1 Distilled Qwen 14B (Input)
$/mo
- DeepSeek R1 Distilled Qwen 14B Input usage
- Price per 1M tokens
DeepSeek R1 Distilled Qwen 14B (Output)
$/mo
- DeepSeek R1 Distilled Qwen 14B Output usage
- Price per 1M tokens
DeepSeek R1 Distilled Llama 70B (Input)
$/mo
- DeepSeek R1 Distilled Llama 70B Input usage
- Price per 1M tokens
DeepSeek R1 Distilled Llama 70B (Output)
$/mo
- DeepSeek R1 Distilled Llama 70B Output usage
- Price per 1M tokens
DeepSeek R1-0528-tput (Input)
$/mo
- DeepSeek R1-0528-tput Input usage
- Price per 1M tokens
DeepSeek R1-0528-tput (Output)
$/mo
- DeepSeek R1-0528-tput Output usage
- Price per 1M tokens
DeepSeek-V3-1 (Input)
$/mo
- DeepSeek-V3-1 Input usage
- Price per 1M tokens
DeepSeek-V3-1 (Output)
$/mo
- DeepSeek-V3-1 Output usage
- Price per 1M tokens
DeepSeek-V3 (Input)
$/mo
- DeepSeek-V3 Input usage
- Price per 1M tokens
DeepSeek-V3 (Output)
$/mo
- DeepSeek-V3 Output usage
- Price per 1M tokens
gpt-oss-120B (Input)
$/mo
- gpt-oss-120B Input usage
- Price per 1M tokens
gpt-oss-120B (Output)
$/mo
- gpt-oss-120B Output usage
- Price per 1M tokens
gpt-oss-20B (Input)
$/mo
- gpt-oss-20B Input usage
- Price per 1M tokens
gpt-oss-20B (Output)
$/mo
- gpt-oss-20B Output usage
- Price per 1M tokens
Qwen3-Coder 480B A35B Instruct (Input)
$/mo
- Qwen3-Coder 480B A35B Instruct Input usage
- Price per 1M tokens
Qwen3-Coder 480B A35B Instruct (Output)
$/mo
- Qwen3-Coder 480B A35B Instruct Output usage
- Price per 1M tokens
Qwen3 235B A22B Instruct 2507 FP8 (Input)
$/mo
- Qwen3 235B A22B Instruct 2507 FP8 Input usage
- Price per 1M tokens
Qwen3 235B A22B Instruct 2507 FP8 (Output)
$/mo
- Qwen3 235B A22B Instruct 2507 FP8 Output usage
- Price per 1M tokens
Qwen3 235B A22B Thinking 2507 FP8 (Input)
$/mo
- Qwen3 235B A22B Thinking 2507 FP8 Input usage
- Price per 1M tokens
Qwen3 235B A22B Thinking 2507 FP8 (Output)
$/mo
- Qwen3 235B A22B Thinking 2507 FP8 Output usage
- Price per 1M tokens
Qwen3 235B A22B FP8 Throughput (Input)
$/mo
- Qwen3 235B A22B FP8 Throughput Input usage
- Price per 1M tokens
Qwen3 235B A22B FP8 Throughput (Output)
$/mo
- Qwen3 235B A22B FP8 Throughput Output usage
- Price per 1M tokens
Qwen 2.5 72B (Input)
$/mo
- Qwen 2.5 72B Input usage
- Price per 1M tokens
Qwen 2.5 72B (Output)
$/mo
- Qwen 2.5 72B Output usage
- Price per 1M tokens
Qwen2.5-VL 72B Instruct (Input)
$/mo
- Qwen2.5-VL 72B Instruct Input usage
- Price per 1M tokens
Qwen2.5-VL 72B Instruct (Output)
$/mo
- Qwen2.5-VL 72B Instruct Output usage
- Price per 1M tokens
Qwen2.5 Coder 32B Instruct (Input)
$/mo
- Qwen2.5 Coder 32B Instruct Input usage
- Price per 1M tokens
Qwen2.5 Coder 32B Instruct (Output)
$/mo
- Qwen2.5 Coder 32B Instruct Output usage
- Price per 1M tokens
Qwen2.5 7B Instruct Turbo (Input)
$/mo
- Qwen2.5 7B Instruct Turbo Input usage
- Price per 1M tokens
Qwen2.5 7B Instruct Turbo (Output)
$/mo
- Qwen2.5 7B Instruct Turbo Output usage
- Price per 1M tokens
Qwen QwQ-32B (Input)
$/mo
- Qwen QwQ-32B Input usage
- Price per 1M tokens
Qwen QwQ-32B (Output)
$/mo
- Qwen QwQ-32B Output usage
- Price per 1M tokens
GLM-4.5-Air (Input)
$/mo
- GLM-4.5-Air Input usage
- Price per 1M tokens
GLM-4.5-Air (Output)
$/mo
- GLM-4.5-Air Output usage
- Price per 1M tokens
Kimi K2 Instruct (Input)
$/mo
- Kimi K2 Instruct Input usage
- Price per 1M tokens
Kimi K2 Instruct (Output)
$/mo
- Kimi K2 Instruct Output usage
- Price per 1M tokens
Kimi K2 Thinking (Input)
$/mo
- Kimi K2 Thinking Input usage
- Price per 1M tokens
Kimi K2 Thinking (Output)
$/mo
- Kimi K2 Thinking Output usage
- Price per 1M tokens
Kimi K2 0905 (Input)
$/mo
- Kimi K2 0905 Input usage
- Price per 1M tokens
Kimi K2 0905 (Output)
$/mo
- Kimi K2 0905 Output usage
- Price per 1M tokens
Mistral (7B) Instruct v0.2 (Input)
$/mo
- Mistral (7B) Instruct v0.2 Input usage
- Price per 1M tokens
Mistral (7B) Instruct v0.2 (Output)
$/mo
- Mistral (7B) Instruct v0.2 Output usage
- Price per 1M tokens
Mistral Instruct (Input)
$/mo
- Mistral Instruct Input usage
- Price per 1M tokens
Mistral Instruct (Output)
$/mo
- Mistral Instruct Output usage
- Price per 1M tokens
Mistral Small 3 (Input)
$/mo
- Mistral Small 3 Input usage
- Price per 1M tokens
Mistral Small 3 (Output)
$/mo
- Mistral Small 3 Output usage
- Price per 1M tokens
Mixtral 8x7B Instruct v0.1 (Input)
$/mo
- Mixtral 8x7B Instruct v0.1 Input usage
- Price per 1M tokens
Mixtral 8x7B Instruct v0.1 (Output)
$/mo
- Mixtral 8x7B Instruct v0.1 Output usage
- Price per 1M tokens
Marin 8B Instruct (Input)
$/mo
- Marin 8B Instruct Input usage
- Price per 1M tokens
Marin 8B Instruct (Output)
$/mo
- Marin 8B Instruct Output usage
- Price per 1M tokens
Arcee AI AFM-4.5B (Input)
$/mo
- Arcee AI AFM-4.5B Input usage
- Price per 1M tokens
Arcee AI AFM-4.5B (Output)
$/mo
- Arcee AI AFM-4.5B Output usage
- Price per 1M tokens
Arcee AI Coder-Large (Input)
$/mo
- Arcee AI Coder-Large Input usage
- Price per 1M tokens
Arcee AI Coder-Large (Output)
$/mo
- Arcee AI Coder-Large Output usage
- Price per 1M tokens
Arcee AI Maestro (Input)
$/mo
- Arcee AI Maestro Input usage
- Price per 1M tokens
Arcee AI Maestro (Output)
$/mo
- Arcee AI Maestro Output usage
- Price per 1M tokens
Arcee AI Virtuoso-Large (Input)
$/mo
- Arcee AI Virtuoso-Large Input usage
- Price per 1M tokens
Arcee AI Virtuoso-Large (Output)
$/mo
- Arcee AI Virtuoso-Large Output usage
- Price per 1M tokens
Cogito v2 preview - 109B MoE (Input)
$/mo
- Cogito v2 preview - 109B MoE Input usage
- Price per 1M tokens
Cogito v2 preview - 109B MoE (Output)
$/mo
- Cogito v2 preview - 109B MoE Output usage
- Price per 1M tokens
Cogito v2 preview - 405B (Input)
$/mo
- Cogito v2 preview - 405B Input usage
- Price per 1M tokens
Cogito v2 preview - 405B (Output)
$/mo
- Cogito v2 preview - 405B Output usage
- Price per 1M tokens
Cogito v2 preview - 671B MoE (Input)
$/mo
- Cogito v2 preview - 671B MoE Input usage
- Price per 1M tokens
Cogito v2 preview - 671B MoE (Output)
$/mo
- Cogito v2 preview - 671B MoE Output usage
- Price per 1M tokens
Cogito v2 preview - 70B (Input)
$/mo
- Cogito v2 preview - 70B Input usage
- Price per 1M tokens
Cogito v2 preview - 70B (Output)
$/mo
- Cogito v2 preview - 70B Output usage
- Price per 1M tokens
Refuel LLM-2 (Input)
$/mo
- Refuel LLM-2 Input usage
- Price per 1M tokens
Refuel LLM-2 (Output)
$/mo
- Refuel LLM-2 Output usage
- Price per 1M tokens
Refuel LLM-2 Small (Input)
$/mo
- Refuel LLM-2 Small Input usage
- Price per 1M tokens
Refuel LLM-2 Small (Output)
$/mo
- Refuel LLM-2 Small Output usage
- Price per 1M tokens
Typhoon 2 70B Instruct (Input)
$/mo
- Typhoon 2 70B Instruct Input usage
- Price per 1M tokens
Typhoon 2 70B Instruct (Output)
$/mo
- Typhoon 2 70B Instruct Output usage
- Price per 1M tokens
gemma-3n-E4B-it (Input)
$/mo
- gemma-3n-E4B-it Input usage
- Price per 1M tokens
gemma-3n-E4B-it (Output)
$/mo
- gemma-3n-E4B-it Output usage
- Price per 1M tokens
FLUX.1 Krea [dev]
$/mo
- FLUX.1 Krea [dev] image generation
- Price per MP
- Default steps: 28
FLUX.1 Kontext [dev]
$/mo
- FLUX.1 Kontext [dev] image generation
- Price per MP
- Default steps: 28
FLUX.1 Kontext [pro]
$/mo
- FLUX.1 Kontext [pro] image generation
- Price per MP
- Default steps: 28
FLUX.1 Kontext [max]
$/mo
- FLUX.1 Kontext [max] image generation
- Price per MP
- Default steps: 28
FLUX1.1 [pro]
$/mo
- FLUX1.1 [pro] image generation
- Price per MP
FLUX.1 [dev]
$/mo
- FLUX.1 [dev] image generation
- Price per MP
- Default steps: 28
FLUX.1 [pro]
$/mo
- FLUX.1 [pro] image generation
- Price per MP
- Default steps: 28
FLUX.1 [schnell]
$/mo
- FLUX.1 [schnell] image generation
- Price per MP
- Default steps: 4
FLUX.1 Canny [pro]
$/mo
- FLUX.1 Canny [pro] image generation
- Price per MP
Google Imagen 4.0 Preview
$/mo
- Google Imagen 4.0 Preview image generation
- Price per MP
Google Imagen 4.0 Fast
$/mo
- Google Imagen 4.0 Fast image generation
- Price per MP
Google Imagen 4.0 Ultra
$/mo
- Google Imagen 4.0 Ultra image generation
- Price per MP
Gemini Flash Image 2.5 (Nano Banana)
$/mo
- Gemini Flash Image 2.5 (Nano Banana) image generation
- Price per MP
ByteDance Seedream 3.0
$/mo
- ByteDance Seedream 3.0 image generation
- Price per MP
ByteDance Seedream 4.0
$/mo
- ByteDance Seedream 4.0 image generation
- Price per MP
ByteDance SeedEdit
$/mo
- ByteDance SeedEdit image generation
- Price per MP
Qwen Image Edit
$/mo
- Qwen Image Edit image generation
- Price per MP
Qwen Image
$/mo
- Qwen Image image generation
- Price per MP
Juggernaut Pro Flux by RunDiffusion
$/mo
- Juggernaut Pro Flux by RunDiffusion image generation
- Price per MP
Juggernaut Lightning Flux by RunDiffusion
$/mo
- Juggernaut Lightning Flux by RunDiffusion image generation
- Price per MP
HiDream-I1-Full
$/mo
- HiDream-I1-Full image generation
- Price per MP
HiDream-I1-Dev
$/mo
- HiDream-I1-Dev image generation
- Price per MP
HiDream-I1-Fast
$/mo
- HiDream-I1-Fast image generation
- Price per MP
Ideogram 3.0
$/mo
- Ideogram 3.0 image generation
- Price per MP
Dreamshaper
$/mo
- Dreamshaper image generation
- Price per MP
SD XL
$/mo
- SD XL image generation
- Price per MP
Stable Diffusion 3
$/mo
- Stable Diffusion 3 image generation
- Price per MP
Cartesia Sonic-2
$/mo
- Cartesia Sonic-2 speech synthesis/processing
- Price per 1M characters
MiniMax 01 Director (720p/5s)
$/mo
- MiniMax 01 Director video generation
- Price per video (720p/5s)
MiniMax Hailuo 02 (768p/10s)
$/mo
- MiniMax Hailuo 02 video generation
- Price per video (768p/10s)
MiniMax Hailuo 02 (1080p/6s)
$/mo
- MiniMax Hailuo 02 video generation
- Price per video (1080p/6s)
Google Veo 2.0 (720p/5s)
$/mo
- Google Veo 2.0 video generation
- Price per video (720p/5s)
Google Veo 3.0 (720p/8s)
$/mo
- Google Veo 3.0 video generation
- Price per video (720p/8s)
Google Veo 3.0 + Audio (720p/8s with audio)
$/mo
- Google Veo 3.0 + Audio video generation
- Price per video (720p/8s with audio)
Google Veo 3.0 Fast (1080p/8s)
$/mo
- Google Veo 3.0 Fast video generation
- Price per video (1080p/8s)
Google Veo 3.0 Fast + Audio (1080p/8s with audio)
$/mo
- Google Veo 3.0 Fast + Audio video generation
- Price per video (1080p/8s with audio)
ByteDance Seedance 1.0 Lite (720p/5s)
$/mo
- ByteDance Seedance 1.0 Lite video generation
- Price per video (720p/5s)
ByteDance Seedance 1.0 Pro (1080p/5s)
$/mo
- ByteDance Seedance 1.0 Pro video generation
- Price per video (1080p/5s)
PixVerse v5 (1080p/5s)
$/mo
- PixVerse v5 video generation
- Price per video (1080p/5s)
Kling 2.1 Master (1080p/5s)
$/mo
- Kling 2.1 Master video generation
- Price per video (1080p/5s)
Kling 2.1 Standard (720p/5s)
$/mo
- Kling 2.1 Standard video generation
- Price per video (720p/5s)
Kling 2.1 Pro (1080p/5s)
$/mo
- Kling 2.1 Pro video generation
- Price per video (1080p/5s)
Kling 2.0 Master (1080p/5s)
$/mo
- Kling 2.0 Master video generation
- Price per video (1080p/5s)
Kling 1.6 Standard (720p/5s)
$/mo
- Kling 1.6 Standard video generation
- Price per video (720p/5s)
Kling 1.6 Pro (1080p/5s)
$/mo
- Kling 1.6 Pro video generation
- Price per video (1080p/5s)
Wan 2.2 I2V (720p/5s)
$/mo
- Wan 2.2 I2V video generation
- Price per video (720p/5s)
Wan 2.2 T2V (720p/8s)
$/mo
- Wan 2.2 T2V video generation
- Price per video (720p/8s)
Vidu 2.0 (720p/8s)
$/mo
- Vidu 2.0 video generation
- Price per video (720p/8s)
Vidu Q1 (1080p/5s)
$/mo
- Vidu Q1 video generation
- Price per video (1080p/5s)
Sora 2 (720p/8s)
$/mo
- Sora 2 video generation
- Price per video (720p/8s)
Sora 2 Pro (720p/8s)
$/mo
- Sora 2 Pro video generation
- Price per video (720p/8s)
Sora 2 Pro (1080p/8s)
$/mo
- Sora 2 Pro video generation
- Price per video (1080p/8s)
Whisper Large v3
$/mo
- Whisper Large v3 automatic speech recognition
- Price per audio minute
BGE-Base-EN v1.5
$/mo
- BGE-Base-EN v1.5 vector embeddings
- Price per 1M tokens
BGE-Large-EN v1.5
$/mo
- BGE-Large-EN v1.5 vector embeddings
- Price per 1M tokens
GTE ModernBERT base
$/mo
- GTE ModernBERT base vector embeddings
- Price per 1M tokens
Multilingual e5 large instruct
$/mo
- Multilingual e5 large instruct vector embeddings
- Price per 1M tokens
M2-BERT 80M 32K Retrieval
$/mo
- M2-BERT 80M 32K Retrieval vector embeddings
- Price per 1M tokens
Mxbai Rerank Large V2
$/mo
- Mxbai Rerank Large V2 search relevance reranking
- Price per 1M tokens
Salesforce Llama Rank V1 (8B)
$/mo
- Salesforce Llama Rank V1 (8B) search relevance reranking
- Price per 1M tokens
VirtueGuard Text Lite
$/mo
- VirtueGuard Text Lite content filtering and classification
- Price per 1M tokens
Llama Guard 4 12B
$/mo
- Llama Guard 4 12B content filtering and classification
- Price per 1M tokens
Llama Guard 3 11B Vision Turbo
$/mo
- Llama Guard 3 11B Vision Turbo content filtering and classification
- Price per 1M tokens
Llama Guard 3 8B
$/mo
- Llama Guard 3 8B content filtering and classification
- Price per 1M tokens
Llama Guard 2 8B
$/mo
- Llama Guard 2 8B content filtering and classification
- Price per 1M tokens
Dedicated Endpoint - 1x H200 141GB
$/mo
- Guaranteed performance
- Support for custom models
- Autoscaling & traffic spike handling
- Hardware: 1x H200 141GB
- Price per hour
Dedicated Endpoint - 1x H100 80GB
$/mo
- Guaranteed performance
- Support for custom models
- Autoscaling & traffic spike handling
- Hardware: 1x H100 80GB
- Price per hour
Dedicated Endpoint - 1x A100 SXM 80GB
$/mo
- Guaranteed performance
- Support for custom models
- Autoscaling & traffic spike handling
- Hardware: 1x A100 SXM 80GB
- Price per hour
Dedicated Endpoint - 1x A100 SXM 40GB
$/mo
- Guaranteed performance
- Support for custom models
- Autoscaling & traffic spike handling
- Hardware: 1x A100 SXM 40GB
- Price per hour
Dedicated Endpoint - 1x A100 PCIe 80GB
$/mo
- Guaranteed performance
- Support for custom models
- Autoscaling & traffic spike handling
- Hardware: 1x A100 PCIe 80GB
- Price per hour
Dedicated Endpoint - 1x L40S 48GB
$/mo
- Guaranteed performance
- Support for custom models
- Autoscaling & traffic spike handling
- Hardware: 1x L40S 48GB
- Price per hour
Fine-tuning Up to 16B - Supervised Fine-Tuning LoRA
$/mo
- Supervised Fine-Tuning LoRA (Up to 16B model size)
- Price per token processed
Fine-tuning Up to 16B - Direct Preference Optimization LoRA
$/mo
- Direct Preference Optimization LoRA (Up to 16B model size)
- Price per token processed
Fine-tuning Up to 16B - Supervised Full Fine-Tuning
$/mo
- Supervised Full Fine-Tuning (Up to 16B model size)
- Price per token processed
Fine-tuning Up to 16B - Direct Preference Optimization Full Fine-Tuning
$/mo
- Direct Preference Optimization Full Fine-Tuning (Up to 16B model size)
- Price per token processed
Fine-tuning 17B-69B - Supervised Fine-Tuning LoRA
$/mo
- Supervised Fine-Tuning LoRA (17B-69B model size)
- Price per token processed
Fine-tuning 17B-69B - Direct Preference Optimization LoRA
$/mo
- Direct Preference Optimization LoRA (17B-69B model size)
- Price per token processed
Fine-tuning 17B-69B - Supervised Full Fine-Tuning
$/mo
- Supervised Full Fine-Tuning (17B-69B model size)
- Price per token processed
Fine-tuning 17B-69B - Direct Preference Optimization Full Fine-Tuning
$/mo
- Direct Preference Optimization Full Fine-Tuning (17B-69B model size)
- Price per token processed
Fine-tuning 70-100B - Supervised Fine-Tuning LoRA
$/mo
- Supervised Fine-Tuning LoRA (70-100B model size)
- Price per token processed
Fine-tuning 70-100B - Direct Preference Optimization LoRA
$/mo
- Direct Preference Optimization LoRA (70-100B model size)
- Price per token processed
Fine-tuning 70-100B - Supervised Full Fine-Tuning
$/mo
- Supervised Full Fine-Tuning (70-100B model size)
- Price per token processed
Fine-tuning 70-100B - Direct Preference Optimization Full Fine-Tuning
$/mo
- Direct Preference Optimization Full Fine-Tuning (70-100B model size)
- Price per token processed
Specialized Fine-tuning gpt-oss-120B - Supervised Fine-Tuning LoRA
$/mo
- Supervised Fine-Tuning LoRA for gpt-oss-120B model
- Limited to LoRA fine-tuning
- Minimum charge: $6.00
- Price per token processed
Specialized Fine-tuning gpt-oss-120B - Direct Preference Optimization LoRA
$/mo
- Direct Preference Optimization LoRA for gpt-oss-120B model
- Limited to LoRA fine-tuning
- Minimum charge: $6.00
- Price per token processed
Specialized Fine-tuning Llama 4 Scout Instruct - Supervised Fine-Tuning LoRA
$/mo
- Supervised Fine-Tuning LoRA for Llama 4 Scout Instruct model
- Limited to LoRA fine-tuning
- Minimum charge: $6.00
- Price per token processed
Specialized Fine-tuning Llama 4 Scout Instruct - Direct Preference Optimization LoRA
$/mo
- Direct Preference Optimization LoRA for Llama 4 Scout Instruct model
- Limited to LoRA fine-tuning
- Minimum charge: $6.00
- Price per token processed
Specialized Fine-tuning Llama 4 Maverick Instruct - Supervised Fine-Tuning LoRA
$/mo
- Supervised Fine-Tuning LoRA for Llama 4 Maverick Instruct model
- Limited to LoRA fine-tuning
- Minimum charge: $16.00
- Price per token processed
Specialized Fine-tuning Llama 4 Maverick Instruct - Direct Preference Optimization LoRA
$/mo
- Direct Preference Optimization LoRA for Llama 4 Maverick Instruct model
- Limited to LoRA fine-tuning
- Minimum charge: $16.00
- Price per token processed
Specialized Fine-tuning DeepSeek Models - Supervised Fine-Tuning LoRA
$/mo
- Supervised Fine-Tuning LoRA for DeepSeek-R1, DeepSeek-R1-0528, DeepSeek-V3, DeepSeek-V3-0324, DeepSeek-V3.1, DeepSeek-V3.1-Base models
- Limited to LoRA fine-tuning
- Minimum charge: $20.00
- Price per token processed
Specialized Fine-tuning DeepSeek Models - Direct Preference Optimization LoRA
$/mo
- Direct Preference Optimization LoRA for DeepSeek-R1, DeepSeek-R1-0528, DeepSeek-V3, DeepSeek-V3-0324, DeepSeek-V3.1, DeepSeek-V3.1-Base models
- Limited to LoRA fine-tuning
- Minimum charge: $20.00
- Price per token processed
Specialized Fine-tuning Qwen3-Coder-480B-A35B-Instruct - Supervised Fine-Tuning LoRA
$/mo
- Supervised Fine-Tuning LoRA for Qwen3-Coder-480B-A35B-Instruct model
- Limited to LoRA fine-tuning
- Minimum charge: $18.00
- Price per token processed
Specialized Fine-tuning Qwen3-Coder-480B-A35B-Instruct - Direct Preference Optimization LoRA
$/mo
- Direct Preference Optimization LoRA for Qwen3-Coder-480B-A35B-Instruct model
- Limited to LoRA fine-tuning
- Minimum charge: $18.00
- Price per token processed
Specialized Fine-tuning Qwen3-235B-A22B Models - Supervised Fine-Tuning LoRA
$/mo
- Supervised Fine-Tuning LoRA for Qwen3-235B-A22B, Qwen3-235B-A22B-Instruct-2507 models
- Limited to LoRA fine-tuning
- No minimum charge
- Price per token processed
Specialized Fine-tuning Qwen3-235B-A22B Models - Direct Preference Optimization LoRA
$/mo
- Direct Preference Optimization LoRA for Qwen3-235B-A22B, Qwen3-235B-A22B-Instruct-2507 models
- Limited to LoRA fine-tuning
- No minimum charge
- Price per token processed
Code Sandbox (vCPU)
$/mo
- Code Sandbox vCPU usage
- Price per vCPU per hour
- Customize VM sandboxes
Code Sandbox (GiB RAM)
$/mo
- Code Sandbox GiB RAM usage
- Price per GiB RAM per hour
- Customize VM sandboxes
Code Interpreter Session
$/mo
- Code Interpreter session (60 minutes)
- Price per session
Instant Cluster - NVIDIA HGX H100 SXM
$/mo
- Ready to use, self-service GPUs
- NVIDIA HGX H100 SXM
- Price per hour per GPU
Instant Cluster - NVIDIA HGX H200
$/mo
- Ready to use, self-service GPUs
- NVIDIA HGX H200
- Price per hour per GPU
Instant Cluster - NVIDIA HGX B200
$/mo
- Ready to use, self-service GPUs
- NVIDIA HGX B200
- Price per hour per GPU
Reserved Cluster - NVIDIA GB200 NVL72 384GB HBM3e
$/mo
- Dedicated capacity
- Expert support
- Hardware: NVIDIA GB200 NVL72
- GPU Memory: 384GB HBM3e
- Price per hour
Reserved Cluster - NVIDIA B200 192GB HBM3e
$/mo
- Dedicated capacity
- Expert support
- Hardware: NVIDIA B200
- GPU Memory: 192GB HBM3e
- Price per hour
Reserved Cluster - NVIDIA H200 141GB HBM3e
$/mo
- Dedicated capacity
- Expert support
- Hardware: NVIDIA H200
- GPU Memory: 141GB HBM3e
- Starting at price per hour
Reserved Cluster - NVIDIA H100 80GB HBM2e
$/mo
- Dedicated capacity
- Expert support
- Hardware: NVIDIA H100
- GPU Memory: 80GB HBM2e
- Starting at price per hour
Reserved Cluster - NVIDIA A100 80GB HBM2e
$/mo
- Dedicated capacity
- Expert support
- Hardware: NVIDIA A100
- GPU Memory: 80GB HBM2e
- Starting at price per hour
Frontier AI Factory
$/mo
- Large-scale, custom-built private GPU clusters
- NVIDIA Blackwell GPUs at scale
- Custom quote required
Shared Filesystem Storage
$/mo
- High-bandwidth, parallel filesystem
- Colocated with compute
- Price per GiB per month
Details
Pricing Tier
FreemiumCategories
Developer ToolsData Analysis
Target Audience
General PublicStartups
Integrations
Apriel+4 moreKimi K2
Sponsor
Ad space