
Speechmatics
Speechmatics is a speech intelligence company that provides advanced Automatic Speech Recognition (ASR), Speech-to-Text (STT), Text-to-Speech (TTS), and voice AI infrastructure. Their technology enables organizations to transcribe, translate, summarize, and analyze voice data with high accuracy across multiple languages and accents.
Pricing
Free
$/mo
- 480 minutes free Speech-to-Text per month
- 1 million characters (~20hrs) free Text-to-Speech per month
- Speech-to-Text: 55+ languages
- Speech-to-Text: Standard and Enhanced accuracy
- Speech-to-Text: Industry-leading accent coverage
- Speech-to-Text: Real-time latency <1s
- Speech-to-Text: Language identification
- Speech-to-Text: Speaker diarization
- Speech-to-Text: Custom dictionary
- Speech-to-Text: Precise timestamps
- Speech-to-Text: Advanced punctuation and casing
- Speech-to-Text: Numeral formatting
- Speech-to-Text: Profanity and disfluency detection
- Speech-to-Text: Multi-channel support
- Speech-to-Text: Subtitle formatting options
- Speech-to-Text: Audio events
- Text-to-Speech: Low-latency (ideal for voice agents)
- Text-to-Speech: English (more languages coming soon)
- SaaS deployment
- Real-time session concurrency: 2 sessions
- Batch job creation: 1 job per second
- Voice Agent conversation concurrency: 3 conversations
Pro
$/mo
- 480 minutes free Speech-to-Text per month
- 1 million characters (~20hrs) free Text-to-Speech per month
- Speech-to-Text: 55+ languages
- Speech-to-Text: Standard and Enhanced accuracy
- Speech-to-Text: Industry-leading accent coverage
- Speech-to-Text: Real-time latency <1s
- Speech-to-Text: Language identification
- Speech-to-Text: Speaker diarization
- Speech-to-Text: Custom dictionary
- Speech-to-Text: Precise timestamps
- Speech-to-Text: Advanced punctuation and casing
- Speech-to-Text: Numeral formatting
- Speech-to-Text: Profanity and disfluency detection
- Speech-to-Text: Multi-channel support
- Speech-to-Text: Subtitle formatting options
- Speech-to-Text: Audio events
- Text-to-Speech: Low-latency (ideal for voice agents)
- Text-to-Speech: English (more languages coming soon)
- SaaS deployment
- Real-time session concurrency: 50 sessions
- Batch job creation: 10 jobs per second
- Voice Agent conversation concurrency: 6 conversations
- Online email support
- 20% discount over 500 hr/month
Enterprise
$/mo
- Includes all features from other plans, including audio alignment
- No rate limits
- Privacy-first deployment options
- Multi-region cloud options
- Custom models
- SaaS or On-premises deployment
- Lowest-latency, highest privacy with STT & TTS in your environment
- Highest concurrency
- Custom voice development
- Custom language development
- Volume discounts available
- Prioritized service and support
- Early access to new features
- Speech-to-Text bolt-ons: Translation, Summaries, Chapters, Sentiment, Topics, Early access to new capabilities
- Speech-to-Text deployment options: Private Cloud, Container, Virtual Appliance, On-Device, GPU & CPU based models, Multi-region cloud US, EU or Australia
- Text-to-Speech deployment options: Private Cloud, Container, Virtual Appliance, On-Device, GPU & CPU based models, Multi-region cloud US, EU or Australia
- Customer community
Details
Pricing Tier
FreemiumCategories
Developer ToolsData Analysis
Target Audience
General PublicStartups
Integrations
Cookiebot+4 more
Sponsor
Ad space