Introduction to AI Voice Agent
Edesy AI Voice Agent is a real-time conversational AI platform that enables you to build, deploy, and scale intelligent voice agents for customer service, sales automation, and more.
Why Edesy Voice Agent?
| Feature | Edesy | Competitors |
|---|---|---|
| End-to-end Latency | < 500ms | 700-1000ms |
| Indian Telephony | Exotel, Alohaa native | Limited |
| Indian Languages | Hindi, Tamil, Telugu, Assamese | Limited |
| Self-hosted Option | Yes | No |
| Pricing | Flexible | Per-minute |
Key Features
Multi-Provider Architecture
Switch between providers without code changes:
- Telephony: Twilio, Exotel, Plivo, Telnyx, Alohaa
- STT: Deepgram, Google Chirp, Azure, ElevenLabs, AssemblyAI
- TTS: Cartesia, ElevenLabs, Google, Azure, OpenAI
- LLM: OpenAI GPT-4o, Gemini 2.0/2.5, Claude, Azure OpenAI
Low Latency Pipeline
Our frame-based pipeline architecture delivers industry-leading response times:
User Speech → VAD Detection → STT (streaming) → LLM → TTS (streaming) → Audio Output
↓ ↓ ↓ ↓ ↓ ↓
~50ms ~100ms ~150ms ~200ms ~100ms ~50ms
Total: ~500ms
Native Audio Support
With Gemini Live 2.0/2.5, bypass STT and TTS entirely for even lower latency:
User Speech → Gemini Live (Audio-to-Audio) → Audio Output
Total: ~300ms
Use Cases
- Customer Support: 24/7 automated support with human-like conversations
- Sales Outreach: Automated outbound calling campaigns
- Appointment Booking: Voice-enabled scheduling and reminders
- Order Status: Real-time order tracking via voice
- Lead Qualification: Automated lead screening and qualification
Getting Started
- Quick Start Guide - Deploy your first voice agent in 10 minutes
- Architecture Overview - Understand how the system works
- Telephony Setup - Configure your phone provider
Platform Components
┌─────────────────────────────────────────────────────────┐
│ Edesy Platform │
├─────────────────────────────────────────────────────────┤
│ Dashboard (Next.js) │
│ ├── Agent Management │
│ ├── Call Analytics │
│ └── Provider Configuration │
├─────────────────────────────────────────────────────────┤
│ Voice Engine (Go) │
│ ├── WebSocket Handler │
│ ├── Frame Pipeline (STT → LLM → TTS) │
│ ├── VAD (Silero) │
│ └── Interruption Handler │
├─────────────────────────────────────────────────────────┤
│ Integrations │
│ ├── Telephony (Twilio, Exotel, Plivo) │
│ ├── STT (Deepgram, Google, Azure) │
│ ├── TTS (Cartesia, ElevenLabs, Google) │
│ └── LLM (OpenAI, Gemini, Claude) │
└─────────────────────────────────────────────────────────┘
Next Steps
Ready to build your first voice agent? Start with our Quick Start Guide.