Platforms
AI Voice Agent Platforms
Every voice-agent platform we have profiled, in one place. It spans turn-key agents that run the whole call, the voice engines and speech-to-text that power them, the developer frameworks you build one from, and the enterprise suites. Each carries sourced, dated pricing.
Some links here are affiliate links, we may earn a commission. How this works.
Full platforms 9
Turn-key agents that run the whole call: speech in, a reply, voice out, over a phone line.
Run big outbound calling campaigns on one flat per-minute rate, best if you have engineers to set it up.
Cloud call-centre software with deep calling and CRM features, now adding AI voice agents as a per-minute add-on.
An AI 'employee' builder that also answers the phone, easy for small teams, though voice is one job among many.
A done-for-you AI receptionist for small trades and clinics, with texting and booking built in, priced by minute.
Get a working phone agent live in days, not months, with the call handling built in and the AI brain your choice.
Build a phone receptionist by dragging boxes together, no code needed, though the bill grows with your call volume.
Get the voice agent and the phone network it runs on from one provider, useful when you want billing in one place.
Build a phone agent exactly the way you want it, choosing each part yourself, if you have a developer to hand.
The best canvas for designing chat and voice agents, but a builder you wire up, not a turnkey phone product.
Voice engines 5
Text-to-speech: the voice itself, for narration or as the voice inside an agent.
The speed specialist whose fast, natural speech is what keeps a live phone agent feeling real rather than laggy.
The most natural-sounding AI voice we have heard, whether you are voicing a video or putting it on a live phone line.
A voice that picks up how the caller is feeling and answers in kind, for warmer and more human conversations.
Studio-quality voiceover for your video, course or advert, no microphone needed. Built for narration, not live calls.
Enterprise text-to-speech built for high-stakes phone calls, where a mispronounced name loses the customer.
Speech-to-text 3
Transcription engines, the ears of an agent: accurate, fast and cheap.
Accurate streaming speech-to-text with built-in audio intelligence, for teams who want the listening half done well.
Fast, accurate speech-to-text to power high-volume voice apps, for teams happy to build on a developer API.
Enterprise speech-to-text with very broad language coverage and real on-prem options, for teams who self-host.
Developer frameworks 3
The wiring you assemble in code, powerful and cheap if you have engineers.
The open-source real-time stack that carries voice-agent audio, plus a framework to wire your own STT, LLM and voice.
OpenAI's speech-to-speech model and API for building your own voice agent, billed by audio tokens, not by the minute.
Open-source Python framework where you pick every voice-agent part, free to self-host, with Daily's cloud for scaling.
Enterprise platforms 3
Custom-priced, sales-led suites built for large contact centres.
Enterprise omnichannel AI for voice and chat, used by airlines and carmakers, now owned by NICE. Quote-only, sales-led.
Enterprise voice assistants for big contact centres, with brand-name customers and quote-only pricing.
Enterprise AI agents for customer support, voice and chat, billed per resolved conversation rather than per minute.
Common questions
What is an AI voice agent platform?
How many AI voice agent platforms are there?
How do I choose between them?
New to the categories? What each kind means. Or jump to the rankings, the price index or the cost calculator.