AssemblyAI

AssemblyAI

AssemblyAI operates on a Pay-As-You-Go model. Pricing for pre-recorded audio is based on the duration of the audio file submitted and the speech model selected. A generous free plan is available for getting started.

Pricing

Free Tier
$/mo
  • Access to industry-leading Speech-to-Text and Audio Intelligence models
  • Up to 185 hours of pre-recorded audio transcription for free
  • Up to 333 hours of streaming audio transcription for free
  • Up to 5 new streams per minute
  • Developer docs, community support, and resources
  • $50 in credits to use towards Speech-to-Text APIs
Universal Pre-recorded Speech-to-Text (Pay-as-you-go)
$/mo
  • Fast, accurate transcription across 99 languages
  • Exceptional accuracy out of the box
  • Built-in diarization, language detection, formatting, filler words, keyterms prompting, custom spelling
Universal Pre-recorded Speech-to-Text (Custom)
$/mo
  • Custom rate limits
  • Enhanced concurrency
  • Enterprise-grade flexibility tailored to AI workloads
  • Unlimited access to Speech-to-Text, Audio Intelligence, and LeMUR
  • Unlimited concurrent streams and pre-recorded concurrency starting at 200 files
  • Dedicated technical support and customized SLAs and SLOs
  • BAA for HIPAA and compliance with EU Data Residency standards
  • Self-hosted deployments (On-prem, EU, VPC)
Slam-1 Beta Pre-recorded Speech-to-Text (Pay-as-you-go)
$/mo
  • Highest accuracy transcription powered by LLM intelligence
  • Understands context, not just words
  • Only available in English
Universal-Streaming Speech-to-Text (Pay-as-you-go)
$/mo
  • Ultra-fast, ultra-accurate real-time transcription
  • Built-in turn detection
  • Unlimited concurrency
  • Auto punctuation and casing
  • Next-gen end-of-turn detection
  • ITM/formatting
Universal-Streaming Multilingual Speech-to-Text (Pay-as-you-go)
$/mo
  • Ultra-fast, ultra-accurate real-time transcription in six languages
  • Built-in turn detection
  • Unlimited concurrency
Keyterms Prompting Add-on (Pay-as-you-go)
$/mo
  • Improve recognition accuracy for specific words and phrases
Universal-Streaming Speech-to-Text (Custom)
$/mo
  • Custom rate limits
  • Enhanced concurrency
  • Enterprise-grade flexibility tailored to AI workloads
  • Unlimited access to Speech-to-Text, Audio Intelligence, and LeMUR
  • Unlimited concurrent streams and pre-recorded concurrency starting at 200 files
  • Dedicated technical support and customized SLAs and SLOs
  • BAA for HIPAA and compliance with EU Data Residency standards
  • Self-hosted deployments (On-prem, EU, VPC)
Speech Understanding (Custom)
$/mo
  • Custom rate limits
  • Enhanced concurrency
  • Enterprise-grade flexibility tailored to AI workloads
  • Unlimited access to Speech-to-Text, Audio Intelligence, and LeMUR
  • Unlimited concurrent streams and pre-recorded concurrency starting at 200 files
  • Dedicated technical support and customized SLAs and SLOs
  • BAA for HIPAA and compliance with EU Data Residency standards
  • Self-hosted deployments (On-prem, EU, VPC)
Guardrails (Custom)
$/mo
  • Custom rate limits
  • Enhanced concurrency
  • Enterprise-grade flexibility tailored to AI workloads
  • Unlimited access to Speech-to-Text, Audio Intelligence, and LeMUR
  • Unlimited concurrent streams and pre-recorded concurrency starting at 200 files
  • Dedicated technical support and customized SLAs and SLOs
  • BAA for HIPAA and compliance with EU Data Residency standards
  • Self-hosted deployments (On-prem, EU, VPC)
GPT-5 LLM
$/mo
  • Input: $1.25 / 1m tokens
  • Output: $10.00 / 1m tokens
GPT-5-Mini LLM
$/mo
  • Input: $0.25 / 1m tokens
  • Output: $2.00 / 1m tokens
GPT-5 Nano LLM
$/mo
  • Input: $0.05 / 1m tokens
  • Output: $0.40 / 1m tokens
GPT 4.1 LLM
$/mo
  • Input: $2.00 / 1m tokens
  • Output: $8.00 / 1m tokens
gpt-oss-20b LLM
$/mo
  • Input: $0.07 / 1m tokens
  • Output: $0.30 / 1m tokens
gpt-oss-120b LLM
$/mo
  • Input: $0.15 / 1m tokens
  • Output: $0.60 / 1m tokens
ChatGPT-4o LLM
$/mo
  • Input: $5.00 / 1m tokens
  • Output: $15.00 / 1m tokens
Gemini 2.5 Flash Lite LLM
$/mo
  • Input: $0.10 / 1m tokens
  • Output: $0.40 / 1m tokens
Gemini 2.5 Flash LLM
$/mo
  • Input: $0.30 / 1m tokens
  • Output: $2.50 / 1m tokens
Gemini 2.5 Pro LLM
$/mo
  • Input: $1.25 / 1m tokens
  • Output: $10.00 / 1m tokens
Claude 4.5 Sonnet LLM
$/mo
  • Input: $3.00 / 1m tokens
  • Output: $15.00 / 1m tokens
Claude 4.5 Haiku LLM
$/mo
  • Input: $1.00 / 1m tokens
  • Output: $5.00 / 1m tokens
Claude 4 Sonnet LLM
$/mo
  • Input: $3.00 / 1m tokens
  • Output: $15.00 / 1m tokens
Claude 4 Opus LLM
$/mo
  • Input: $15.00 / 1m tokens
  • Output: $75.00 / 1m tokens
Claude 3.5 Haiku LLM
$/mo
  • Input: $0.80 / 1m tokens
  • Output: $4.00 / 1m tokens
Claude 3 Haiku LLM
$/mo
  • Input: $0.25 / 1m tokens
  • Output: $1.25 / 1m tokens
LLM Models (Custom)
$/mo
  • Custom rate limits
  • Enhanced concurrency
  • Enterprise-grade flexibility tailored to AI workloads
  • Unlimited access to Speech-to-Text, Audio Intelligence, and LeMUR
  • Unlimited concurrent streams and pre-recorded concurrency starting at 200 files
  • Dedicated technical support and customized SLAs and SLOs
  • BAA for HIPAA and compliance with EU Data Residency standards
  • Self-hosted deployments (On-prem, EU, VPC)

Details

Pricing Tier

Freemium

Categories

Developer ToolsData Analysis

Target Audience

General PublicStartups

Sponsor

Ad space