AssemblyAI

AssemblyAI operates on a Pay-As-You-Go model. Pricing for pre-recorded audio is based on the duration of the audio file submitted and the speech model selected. A generous free plan is available for getting started.

Use tool

Pricing

Free Tier

$/mo

Access to industry-leading Speech-to-Text and Audio Intelligence models
Up to 185 hours of pre-recorded audio transcription for free
Up to 333 hours of streaming audio transcription for free
Up to 5 new streams per minute
Developer docs, community support, and resources
$50 in credits to use towards Speech-to-Text APIs

Universal Pre-recorded Speech-to-Text (Pay-as-you-go)

$/mo

Fast, accurate transcription across 99 languages
Exceptional accuracy out of the box
Built-in diarization, language detection, formatting, filler words, keyterms prompting, custom spelling

Universal Pre-recorded Speech-to-Text (Custom)

$/mo

Custom rate limits
Enhanced concurrency
Enterprise-grade flexibility tailored to AI workloads
Unlimited access to Speech-to-Text, Audio Intelligence, and LeMUR
Unlimited concurrent streams and pre-recorded concurrency starting at 200 files
Dedicated technical support and customized SLAs and SLOs
BAA for HIPAA and compliance with EU Data Residency standards
Self-hosted deployments (On-prem, EU, VPC)

Slam-1 Beta Pre-recorded Speech-to-Text (Pay-as-you-go)

$/mo

Highest accuracy transcription powered by LLM intelligence
Understands context, not just words
Only available in English

Universal-Streaming Speech-to-Text (Pay-as-you-go)

$/mo

Ultra-fast, ultra-accurate real-time transcription
Built-in turn detection
Unlimited concurrency
Auto punctuation and casing
Next-gen end-of-turn detection
ITM/formatting

Universal-Streaming Multilingual Speech-to-Text (Pay-as-you-go)

$/mo

Ultra-fast, ultra-accurate real-time transcription in six languages
Built-in turn detection
Unlimited concurrency

Keyterms Prompting Add-on (Pay-as-you-go)

$/mo

Improve recognition accuracy for specific words and phrases

Universal-Streaming Speech-to-Text (Custom)

$/mo

Custom rate limits
Enhanced concurrency
Enterprise-grade flexibility tailored to AI workloads
Unlimited access to Speech-to-Text, Audio Intelligence, and LeMUR
Unlimited concurrent streams and pre-recorded concurrency starting at 200 files
Dedicated technical support and customized SLAs and SLOs
BAA for HIPAA and compliance with EU Data Residency standards
Self-hosted deployments (On-prem, EU, VPC)

Speech Understanding (Custom)

$/mo

Custom rate limits
Enhanced concurrency
Enterprise-grade flexibility tailored to AI workloads
Unlimited access to Speech-to-Text, Audio Intelligence, and LeMUR
Unlimited concurrent streams and pre-recorded concurrency starting at 200 files
Dedicated technical support and customized SLAs and SLOs
BAA for HIPAA and compliance with EU Data Residency standards
Self-hosted deployments (On-prem, EU, VPC)

Guardrails (Custom)

$/mo

Custom rate limits
Enhanced concurrency
Enterprise-grade flexibility tailored to AI workloads
Unlimited access to Speech-to-Text, Audio Intelligence, and LeMUR
Unlimited concurrent streams and pre-recorded concurrency starting at 200 files
Dedicated technical support and customized SLAs and SLOs
BAA for HIPAA and compliance with EU Data Residency standards
Self-hosted deployments (On-prem, EU, VPC)

GPT-5 LLM

$/mo

Input: $1.25 / 1m tokens
Output: $10.00 / 1m tokens

GPT-5-Mini LLM

$/mo

Input: $0.25 / 1m tokens
Output: $2.00 / 1m tokens

GPT-5 Nano LLM

$/mo

Input: $0.05 / 1m tokens
Output: $0.40 / 1m tokens

GPT 4.1 LLM

$/mo

Input: $2.00 / 1m tokens
Output: $8.00 / 1m tokens

gpt-oss-20b LLM

$/mo

Input: $0.07 / 1m tokens
Output: $0.30 / 1m tokens

gpt-oss-120b LLM

$/mo

Input: $0.15 / 1m tokens
Output: $0.60 / 1m tokens

ChatGPT-4o LLM

$/mo

Input: $5.00 / 1m tokens
Output: $15.00 / 1m tokens

Gemini 2.5 Flash Lite LLM

$/mo

Input: $0.10 / 1m tokens
Output: $0.40 / 1m tokens

Gemini 2.5 Flash LLM

$/mo

Input: $0.30 / 1m tokens
Output: $2.50 / 1m tokens

Gemini 2.5 Pro LLM

$/mo

Input: $1.25 / 1m tokens
Output: $10.00 / 1m tokens

Claude 4.5 Sonnet LLM

$/mo

Input: $3.00 / 1m tokens
Output: $15.00 / 1m tokens

Claude 4.5 Haiku LLM

$/mo

Input: $1.00 / 1m tokens
Output: $5.00 / 1m tokens

Claude 4 Sonnet LLM

$/mo

Input: $3.00 / 1m tokens
Output: $15.00 / 1m tokens

Claude 4 Opus LLM

$/mo

Input: $15.00 / 1m tokens
Output: $75.00 / 1m tokens

Claude 3.5 Haiku LLM

$/mo

Input: $0.80 / 1m tokens
Output: $4.00 / 1m tokens

Claude 3 Haiku LLM

$/mo

Input: $0.25 / 1m tokens
Output: $1.25 / 1m tokens

LLM Models (Custom)

$/mo

Custom rate limits
Enhanced concurrency
Enterprise-grade flexibility tailored to AI workloads
Unlimited access to Speech-to-Text, Audio Intelligence, and LeMUR
Unlimited concurrent streams and pre-recorded concurrency starting at 200 files
Dedicated technical support and customized SLAs and SLOs
BAA for HIPAA and compliance with EU Data Residency standards
Self-hosted deployments (On-prem, EU, VPC)

Details

Pricing Tier

Freemium

Sponsor

Ad space