Speechmatics

Speechmatics

Speechmatics is a speech intelligence company that provides advanced Automatic Speech Recognition (ASR), Speech-to-Text (STT), Text-to-Speech (TTS), and voice AI infrastructure. Their technology enables organizations to transcribe, translate, summarize, and analyze voice data with high accuracy across multiple languages and accents.

Pricing

Free
$/mo
  • 480 minutes free Speech-to-Text per month
  • 1 million characters (~20hrs) free Text-to-Speech per month
  • Speech-to-Text: 55+ languages
  • Speech-to-Text: Standard and Enhanced accuracy
  • Speech-to-Text: Industry-leading accent coverage
  • Speech-to-Text: Real-time latency <1s
  • Speech-to-Text: Language identification
  • Speech-to-Text: Speaker diarization
  • Speech-to-Text: Custom dictionary
  • Speech-to-Text: Precise timestamps
  • Speech-to-Text: Advanced punctuation and casing
  • Speech-to-Text: Numeral formatting
  • Speech-to-Text: Profanity and disfluency detection
  • Speech-to-Text: Multi-channel support
  • Speech-to-Text: Subtitle formatting options
  • Speech-to-Text: Audio events
  • Text-to-Speech: Low-latency (ideal for voice agents)
  • Text-to-Speech: English (more languages coming soon)
  • SaaS deployment
  • Real-time session concurrency: 2 sessions
  • Batch job creation: 1 job per second
  • Voice Agent conversation concurrency: 3 conversations
Pro
$/mo
  • 480 minutes free Speech-to-Text per month
  • 1 million characters (~20hrs) free Text-to-Speech per month
  • Speech-to-Text: 55+ languages
  • Speech-to-Text: Standard and Enhanced accuracy
  • Speech-to-Text: Industry-leading accent coverage
  • Speech-to-Text: Real-time latency <1s
  • Speech-to-Text: Language identification
  • Speech-to-Text: Speaker diarization
  • Speech-to-Text: Custom dictionary
  • Speech-to-Text: Precise timestamps
  • Speech-to-Text: Advanced punctuation and casing
  • Speech-to-Text: Numeral formatting
  • Speech-to-Text: Profanity and disfluency detection
  • Speech-to-Text: Multi-channel support
  • Speech-to-Text: Subtitle formatting options
  • Speech-to-Text: Audio events
  • Text-to-Speech: Low-latency (ideal for voice agents)
  • Text-to-Speech: English (more languages coming soon)
  • SaaS deployment
  • Real-time session concurrency: 50 sessions
  • Batch job creation: 10 jobs per second
  • Voice Agent conversation concurrency: 6 conversations
  • Online email support
  • 20% discount over 500 hr/month
Enterprise
$/mo
  • Includes all features from other plans, including audio alignment
  • No rate limits
  • Privacy-first deployment options
  • Multi-region cloud options
  • Custom models
  • SaaS or On-premises deployment
  • Lowest-latency, highest privacy with STT & TTS in your environment
  • Highest concurrency
  • Custom voice development
  • Custom language development
  • Volume discounts available
  • Prioritized service and support
  • Early access to new features
  • Speech-to-Text bolt-ons: Translation, Summaries, Chapters, Sentiment, Topics, Early access to new capabilities
  • Speech-to-Text deployment options: Private Cloud, Container, Virtual Appliance, On-Device, GPU & CPU based models, Multi-region cloud US, EU or Australia
  • Text-to-Speech deployment options: Private Cloud, Container, Virtual Appliance, On-Device, GPU & CPU based models, Multi-region cloud US, EU or Australia
  • Customer community

Details

Pricing Tier

Freemium

Categories

Developer ToolsData Analysis

Target Audience

General PublicStartups

Integrations

Cookiebot+4 more

Sponsor

Ad space