
AssemblyAI
AssemblyAI operates on a Pay-As-You-Go model. Pricing for pre-recorded audio is based on the duration of the audio file submitted and the speech model selected. A generous free plan is available for getting started.
Pricing
Free Tier
$/mo
- Access to industry-leading Speech-to-Text and Audio Intelligence models
- Up to 185 hours of pre-recorded audio transcription for free
- Up to 333 hours of streaming audio transcription for free
- Up to 5 new streams per minute
- Developer docs, community support, and resources
- $50 in credits to use towards Speech-to-Text APIs
Universal Pre-recorded Speech-to-Text (Pay-as-you-go)
$/mo
- Fast, accurate transcription across 99 languages
- Exceptional accuracy out of the box
- Built-in diarization, language detection, formatting, filler words, keyterms prompting, custom spelling
Universal Pre-recorded Speech-to-Text (Custom)
$/mo
- Custom rate limits
- Enhanced concurrency
- Enterprise-grade flexibility tailored to AI workloads
- Unlimited access to Speech-to-Text, Audio Intelligence, and LeMUR
- Unlimited concurrent streams and pre-recorded concurrency starting at 200 files
- Dedicated technical support and customized SLAs and SLOs
- BAA for HIPAA and compliance with EU Data Residency standards
- Self-hosted deployments (On-prem, EU, VPC)
Slam-1 Beta Pre-recorded Speech-to-Text (Pay-as-you-go)
$/mo
- Highest accuracy transcription powered by LLM intelligence
- Understands context, not just words
- Only available in English
Universal-Streaming Speech-to-Text (Pay-as-you-go)
$/mo
- Ultra-fast, ultra-accurate real-time transcription
- Built-in turn detection
- Unlimited concurrency
- Auto punctuation and casing
- Next-gen end-of-turn detection
- ITM/formatting
Universal-Streaming Multilingual Speech-to-Text (Pay-as-you-go)
$/mo
- Ultra-fast, ultra-accurate real-time transcription in six languages
- Built-in turn detection
- Unlimited concurrency
Keyterms Prompting Add-on (Pay-as-you-go)
$/mo
- Improve recognition accuracy for specific words and phrases
Universal-Streaming Speech-to-Text (Custom)
$/mo
- Custom rate limits
- Enhanced concurrency
- Enterprise-grade flexibility tailored to AI workloads
- Unlimited access to Speech-to-Text, Audio Intelligence, and LeMUR
- Unlimited concurrent streams and pre-recorded concurrency starting at 200 files
- Dedicated technical support and customized SLAs and SLOs
- BAA for HIPAA and compliance with EU Data Residency standards
- Self-hosted deployments (On-prem, EU, VPC)
Speech Understanding (Custom)
$/mo
- Custom rate limits
- Enhanced concurrency
- Enterprise-grade flexibility tailored to AI workloads
- Unlimited access to Speech-to-Text, Audio Intelligence, and LeMUR
- Unlimited concurrent streams and pre-recorded concurrency starting at 200 files
- Dedicated technical support and customized SLAs and SLOs
- BAA for HIPAA and compliance with EU Data Residency standards
- Self-hosted deployments (On-prem, EU, VPC)
Guardrails (Custom)
$/mo
- Custom rate limits
- Enhanced concurrency
- Enterprise-grade flexibility tailored to AI workloads
- Unlimited access to Speech-to-Text, Audio Intelligence, and LeMUR
- Unlimited concurrent streams and pre-recorded concurrency starting at 200 files
- Dedicated technical support and customized SLAs and SLOs
- BAA for HIPAA and compliance with EU Data Residency standards
- Self-hosted deployments (On-prem, EU, VPC)
GPT-5 LLM
$/mo
- Input: $1.25 / 1m tokens
- Output: $10.00 / 1m tokens
GPT-5-Mini LLM
$/mo
- Input: $0.25 / 1m tokens
- Output: $2.00 / 1m tokens
GPT-5 Nano LLM
$/mo
- Input: $0.05 / 1m tokens
- Output: $0.40 / 1m tokens
GPT 4.1 LLM
$/mo
- Input: $2.00 / 1m tokens
- Output: $8.00 / 1m tokens
gpt-oss-20b LLM
$/mo
- Input: $0.07 / 1m tokens
- Output: $0.30 / 1m tokens
gpt-oss-120b LLM
$/mo
- Input: $0.15 / 1m tokens
- Output: $0.60 / 1m tokens
ChatGPT-4o LLM
$/mo
- Input: $5.00 / 1m tokens
- Output: $15.00 / 1m tokens
Gemini 2.5 Flash Lite LLM
$/mo
- Input: $0.10 / 1m tokens
- Output: $0.40 / 1m tokens
Gemini 2.5 Flash LLM
$/mo
- Input: $0.30 / 1m tokens
- Output: $2.50 / 1m tokens
Gemini 2.5 Pro LLM
$/mo
- Input: $1.25 / 1m tokens
- Output: $10.00 / 1m tokens
Claude 4.5 Sonnet LLM
$/mo
- Input: $3.00 / 1m tokens
- Output: $15.00 / 1m tokens
Claude 4.5 Haiku LLM
$/mo
- Input: $1.00 / 1m tokens
- Output: $5.00 / 1m tokens
Claude 4 Sonnet LLM
$/mo
- Input: $3.00 / 1m tokens
- Output: $15.00 / 1m tokens
Claude 4 Opus LLM
$/mo
- Input: $15.00 / 1m tokens
- Output: $75.00 / 1m tokens
Claude 3.5 Haiku LLM
$/mo
- Input: $0.80 / 1m tokens
- Output: $4.00 / 1m tokens
Claude 3 Haiku LLM
$/mo
- Input: $0.25 / 1m tokens
- Output: $1.25 / 1m tokens
LLM Models (Custom)
$/mo
- Custom rate limits
- Enhanced concurrency
- Enterprise-grade flexibility tailored to AI workloads
- Unlimited access to Speech-to-Text, Audio Intelligence, and LeMUR
- Unlimited concurrent streams and pre-recorded concurrency starting at 200 files
- Dedicated technical support and customized SLAs and SLOs
- BAA for HIPAA and compliance with EU Data Residency standards
- Self-hosted deployments (On-prem, EU, VPC)
Details
Pricing Tier
FreemiumCategories
Developer ToolsData Analysis
Target Audience
General PublicStartups
Sponsor
Ad space