Why pay ₹40,000 when Vapi/Retell have built-in RAG?

Platform RAG features are usually 'upload docs, get answers' with no tuning. That works for demos. In production, retrieval accuracy drops dramatically and the agent starts giving wrong answers. Our setup includes the tuning work — chunk size, embedding choice, retrieval strategy, re-ranking — that platform docs don't tell you to do.

How is Advanced RAG different from Quick?

Quick: single embedding model, vector search only, single language. Good for small corpora (≤100 docs) and English-only. Advanced: hybrid retrieval (vector + keyword + reranker), multilingual embeddings, structured data integration, quality monitoring dashboard. Required for 1000+ docs or non-English.

What document formats can you ingest?

PDF (including scanned), HTML, Markdown, DOCX, Notion exports, Confluence pages, Google Docs, plain text, CSV (for structured data). We can also ingest from databases via SQL queries.

Can it answer in Hindi if my docs are in English?

Yes — multilingual embeddings handle cross-language retrieval. Customer asks in Hindi → relevant English document chunks retrieved → voice agent translates and answers in Hindi. Standard in Advanced package; add-on for Quick.

How do we update the knowledge base?

We set up a document update workflow: you push new/changed docs to a designated folder/endpoint, the system re-ingests and re-embeds within hours. CRON-based for predictable schedules; webhook-based for real-time updates.

What's the accuracy improvement vs platform-native RAG?

Typical improvement: retrieval precision @ top-3 from 50–60% (platform default) to 80–90% (our tuning). The exact lift depends on corpus complexity. We benchmark before/after during the build.

Does this work with Vapi / Retell / Bolna / Bland?

Yes — and Edesy, Synthflow, Voiceflow. The RAG endpoint can be hosted on your infra (so the voice agent platform is interchangeable) or on the platform's native store (locked to that platform but simpler ops).

How big can the knowledge base be?

Quick: up to 100 documents or 500 pages. Advanced: up to 1000 documents or 5000 pages. Beyond that, it's a Custom Project — usually for enterprise BFSI/legal/healthcare with massive doc corpora.

How do you handle confidential/PII documents?

PII can be redacted during ingestion (Aadhaar, PAN, account numbers, names — patterns you specify). Documents stay in your account/infra. For BFSI/healthcare, we sign DPA before access.

What about citation accuracy for compliance?

Every retrieved chunk is tagged with source document + page/section. Voice agent can be configured to say 'According to your refund policy, section 3.2...' — full traceability for compliance audit.

Can we share the maintenance retainer with other services?

Yes. If you're already on a maintenance retainer for CRM Integration or Voice Agent Build, RAG maintenance can be bundled into one combined retainer at a small discount. Quote during discovery.

Voice agent configuration service

Production-Grade RAG Setup for FAQ-Style Voice Agents

Most platforms have basic RAG built in. Tuning it for accurate retrieval is the hard part. We do document ingestion, embedding selection, retrieval tuning, citation tracking, and multilingual support — so your voice agent answers questions reliably.

Get a Quote from ₹40,000

Book a 30-min scoping call

From ₹40,000

Flat one-time price

1–2 weeks

Standard delivery

1000+ docs

Advanced tier capacity

Multilingual

Hindi + regional

Knowledge base setup pricing

Two packages depending on document volume and retrieval complexity. 40% deposit, 60% on delivery.

Quick RAG Setup

₹40,000one-time+ ₹2,500/month maintenance

Standard knowledge base for FAQ-style voice agents

Up to 100 documents or 500 pages ingested
Standard chunk size + embedding (OpenAI ada-002 or Cohere multilingual)
Vector store setup (Pinecone, Weaviate, Qdrant, or platform-native)
Retrieval tuning for top-3/top-5 relevance
Citation tracking (agent quotes source document)
Source document update workflow (push new docs, re-index)
Benchmark queries: 25 test questions verified
1 round of revisions
14 days bug-fix warranty
Delivery: 1 week

What we set up

Production RAG isn't just upload-and-go. Every piece needs tuning for your specific corpus.

Document ingestion pipeline

PDFs, HTML pages, Notion exports, Confluence pages, custom databases. We handle parsing, OCR for scanned PDFs, table extraction, metadata preservation.

Chunk strategy + size tuning

Default chunk size rarely works. We test 256/512/1024 token chunks against your real queries to find what gives best retrieval recall + precision.

Embedding model selection

OpenAI ada-002, Cohere multilingual v3, Voyage AI, BGE — different embeddings work better for different corpora. We benchmark + choose.

Vector store deployment

Pinecone, Weaviate, Qdrant, pgvector, or your voice agent platform's native store. We deploy + configure for your scale.

Hybrid retrieval (Advanced)

Vector search alone misses keyword-exact queries. We combine vector + BM25 + metadata filtering for higher recall on production queries.

Re-ranking with cross-encoder (Advanced)

Top-10 vector results re-ranked by Cohere Rerank or Voyage Rerank. Pushes top-3 precision from 60% to 85%+ typically.

Citation tracking

Voice agent says 'According to your refund policy...' and the source document is logged for compliance + transparency.

Multilingual support (Advanced)

Documents in English, knowledge accessed in Hindi/regional via multilingual embeddings. Critical for Indian customer bases.

Quality monitoring (Advanced)

Retrieval hit rate, hallucination flags, low-confidence queries logged for review. Dashboard for ongoing tuning.

Common RAG problems we fix

Most teams build basic RAG in a week, then discover problems in production:

Voice agent gives confidently wrong answers (hallucinations from irrelevant chunks)
Customer asks in Hindi, knowledge is in English — generic embeddings fail
Tables and structured data lost during ingestion (PDF tables → meaningless text)
Top-3 retrieval returns the same document section 3 times (diversity problem)
Knowledge base updated, but agent still cites old answers (stale embeddings)
Citation says 'document.pdf page 42' but customer can't access the source
Long-form documents chunked mid-sentence, breaking context

Use cases that benefit most from RAG

Customer support FAQ bots (refund policies, shipping, warranty)
Healthcare patient education (procedure prep, medication info)
EdTech course inquiries (curriculum, fees, scholarships)
BFSI policy info (loan eligibility, KYC requirements, terms)
Real estate project details (amenities, pricing, RERA info)
E-commerce product info (specs, compatibility, returns)
Internal IT helpdesk (password resets, software access, policies)

How the RAG setup works

Predictable 5-step process from kickoff to deployment

Discovery call

30-min call: document sources, languages, use case, expected query volume. We get sample documents + query examples.

Fixed-price quote

Scope document with exact deliverables, document count, retrieval architecture, and price. Two days to deliver.

40% deposit, build begins

Within 2 business days. We get access to source documents and your voice agent platform.

Ingest + tune + benchmark

Documents chunked + embedded, vector store deployed, retrieval tuned against benchmark queries, voice agent wired to RAG endpoint.

Deploy + handoff

Production deployed, monitoring set up, final demo + documentation, 60% balance, maintenance retainer starts.

Knowledge Base / RAG FAQ

Related services

Voice Agent Build

Need the voice agent itself? We build production agents on every major platform.

Voice Agent Audit

Already have an agent? Start with an audit before adding RAG.

CRM Integration

Sync voice agent + knowledge base events to your CRM.

Campaign Management

We run RAG-powered agents end-to-end as a managed service.

WhatsApp + Voice

Send sourced answers via WhatsApp after the call.

Edesy Voice Agent

Our own platform — native knowledge base support included.

Ready to set up your knowledge base?

Book a 30-min scoping call. We'll send a fixed-price quote within one business day.

Get a Quote Talk to Sales