Voice AI·Apr 5, 2026

AI Voice Agents for Business Automation: Building Conversational Phone Assistants with n8n, Twilio & ElevenLabs

A comprehensive guide to building AI-powered voice agents for business automation. Learn how to create conversational phone assistants using n8n, Twilio, and ElevenLabs that handle inbound calls, schedule appointments, and integrate with your existing business systems.

Tropical Media

AI Voice Agents for Business Automation: Building Conversational Phone Assistants with n8n, Twilio & ElevenLabs

The way businesses handle phone communication is undergoing a revolutionary transformation. In 2026, AI voice agents have moved from experimental technology to essential business infrastructure. Forward-thinking companies are deploying conversational AI assistants that answer calls 24/7, schedule appointments, qualify leads, and integrate seamlessly with existing CRM and booking systems—all without human intervention until truly needed.

This comprehensive guide will walk you through building sophisticated AI voice agents using n8n as the automation orchestrator, Twilio for telephony infrastructure, and ElevenLabs for natural-sounding text-to-speech. Whether you're looking to automate customer support, handle appointment scheduling, or create a virtual receptionist, you'll learn how to build production-ready voice AI workflows that deliver measurable business value.

Why AI Voice Agents Are Essential in 2026

The Business Case for Voice Automation

Market Growth and Adoption:

The conversational AI market is experiencing explosive growth with a projected 23.7% CAGR, with North America holding over 26% market share. Research from RingCentral's Agentic AI study reveals that business leaders expect their preference for interacting with AI agents via voice to grow from 14% today to 23% within two years—a significant shift away from text-based chatbots.

Quantifiable Benefits:

Companies implementing AI voice agents are reporting transformative results:

Metric	Improvement	Source
Cost Reduction	Up to 68%	Post-implementation studies
Ticket Capacity	3× increase	Support team scaling
Response Time	24/7 availability	Zero wait times
Lead Qualification	40% faster	Sales pipeline efficiency
Customer Satisfaction	85%+ CSAT scores	Consistent service quality

The Human Element Challenge:

Traditional automated phone systems (IVR menus) frustrate customers with rigid options and endless button-pressing. Modern AI voice agents powered by large language models (LLMs) offer:

Natural conversation: Understanding context, not just keywords
Flexible responses: Adapting to caller needs dynamically
Emotional intelligence: Recognizing sentiment and urgency
Seamless handoffs: Transferring to humans with full context

Use Cases Across Industries

Healthcare:

Appointment scheduling and reminders
Prescription refill requests
Initial symptom triage
Insurance verification

Real Estate:

Property inquiry handling
Viewing scheduling
Lead qualification
Follow-up calls

Professional Services:

Client intake and onboarding
Meeting scheduling
Document collection
Payment processing

E-commerce:

Order status inquiries
Return and exchange handling
Upselling and cross-selling
Abandoned cart recovery

Hospitality:

Reservation management
Room service requests
Local recommendations
Concierge services

Understanding the Technology Stack

n8n: The Automation Orchestrator

n8n serves as the brain of your voice agent, handling:

Workflow Logic:

Call flow management and routing
Data processing and transformation
Integration with external systems
Conditional logic and decision trees

AI Agent Capabilities (n8n 2.0+):

Native LangChain integration
LLM-powered decision making
Persistent agent memory
Sandboxed code execution
Data sovereignty controls

Key Advantages:

Self-hosted option: Complete data control for sensitive industries
400+ integrations: Native connectors for CRMs, calendars, databases
Visual workflow builder: No-code creation of complex logic
Fair-code license: No usage limits or per-transaction costs

Twilio: The Telephony Infrastructure

Twilio provides the telephony layer that bridges traditional phone systems with modern AI:

Programmable Voice:

Phone number provisioning in 100+ countries
Inbound call handling and routing
Outbound call initiation
Call recording and transcription
Conference calling capabilities

TwiML (Twilio Markup Language): Twilio uses XML-based instructions to control call behavior:

<!-- Example TwiML response -->
<Response>
  <Say>Hello, thank you for calling. Connecting you to our AI assistant.</Say>
  <Connect>
    <Stream url="wss://your-server.com/media-stream" />
  </Connect>
</Response>

Real-Time Media Streams: WebSocket connections enable streaming audio between callers and your AI in real-time with sub-second latency.

ElevenLabs: Natural Voice Synthesis

ElevenLabs has emerged as the leading platform for AI voice generation:

Voice Capabilities:

5,000+ voices across 70+ languages
Voice cloning: Create custom voices from samples
Emotional expression: Adjust tone, pace, and emphasis
Low latency: Streaming synthesis for real-time conversation

Conversational AI Features:

Interruption handling: Respond to barge-ins naturally
Turn-taking: Properly manage conversation flow
Context awareness: Maintain consistent voice characteristics

Business Applications:

Brand voice consistency: Custom-trained voices matching your brand
Digital twins: Cloning founder or team member voices
Multilingual support: Natural accent handling across languages

Architecture: How It All Works Together

High-Level System Design

┌─────────────────────────────────────────────────────────────────┐
│                         Caller                                  │
└─────────────────────┬───────────────────────────────────────────┘
                      │ Phone Call
                      ▼
┌─────────────────────────────────────────────────────────────────┐
│                        Twilio                                   │
│  ┌─────────────────────────────────────────────────────────┐   │
│  │  Phone Number → TwiML Webhook → Media Stream (WebSocket) │   │
│  └─────────────────────────────────────────────────────────┘   │
└─────────────────────┬───────────────────────────────────────────┘
                      │ WebSocket
                      ▼
┌─────────────────────────────────────────────────────────────────┐
│                     n8n Workflow                                │
│  ┌──────────────┐    ┌──────────────┐    ┌──────────────────┐  │
│  │  WebSocket   │───▶│  AI Agent    │───▶│  ElevenLabs TTS  │  │
│  │   Server     │◄───│  Processing  │◄───│   (Voice Out)    │  │
│  └──────────────┘    └──────────────┘    └──────────────────┘  │
│         │                   │                       │          │
│         ▼                   ▼                       ▼          │
│  ┌─────────────────────────────────────────────────────────┐   │
│  │              Business System Integrations                │   │
│  │  CRM | Calendar | Database | Payment | Notifications    │   │
│  └─────────────────────────────────────────────────────────┘   │
└─────────────────────────────────────────────────────────────────┘

Call Flow Sequence

1. Caller dials phone number
         ↓
2. Twilio receives call, triggers webhook to n8n
         ↓
3. n8n workflow initiates AI agent session
         ↓
4. ElevenLabs generates greeting ("Hello, how can I help?")
         ↓
5. Twilio streams greeting to caller
         ↓
6. Caller speaks → Audio streamed via WebSocket to n8n
         ↓
7. n8n transcribes speech (Deepgram/Whisper)
         ↓
8. Transcript sent to LLM (OpenAI/Claude/Ollama)
         ↓
9. LLM generates response
         ↓
10. ElevenLabs synthesizes response audio
         ↓
11. Audio streamed back to caller
         ↓
12. Repeat 6-11 until call ends or transfer requested
         ↓
13. n8n logs call, updates CRM, sends notifications

Alternative: ElevenLabs Conversational AI

ElevenLabs now offers a native Conversational AI platform that can be integrated with n8n:

┌─────────────────────────────────────────────────────────────────┐
│              ElevenLabs Conversational AI                       │
│  ┌──────────────┐    ┌──────────────┐    ┌──────────────┐      │
│  │   Speech     │───▶│   LLM with   │───▶│    Voice     │      │
│  │  Recognition │    │  Tool Calls  │    │  Synthesis   │      │
│  └──────────────┘    └──────┬───────┘    └──────────────┘      │
│                             │                                  │
│                             ▼                                  │
│                    ┌────────────────┐                          │
│                    │  n8n Webhook   │◄── Tool execution        │
│                    │  (Tool Calls)  │                          │
│                    └───────┬────────┘                          │
│                            │                                   │
└────────────────────────────┼───────────────────────────────────┘
                             │ API/WebSocket
                             ▼
                    ┌────────────────┐
                    │ Business Logic │
                    │  CRM/Calendar  │
                    └────────────────┘

This architecture simplifies implementation by handling speech recognition, LLM processing, and voice synthesis within ElevenLabs, using n8n only for tool execution and business logic.

Implementation Guide

Phase 1: Infrastructure Setup

Step 1: Twilio Account Configuration

Sign Up and Setup:

Create a Twilio account at https://www.twilio.com
Verify your phone number
Complete identity verification (required for production numbers)

Purchase a Phone Number:

# Using Twilio CLI
twilio phone-numbers:buy:local --country-code US --area-code 555

# Or via Console: Phone Numbers → Manage → Buy a number

Configure Webhook URL:

Navigate to Phone Numbers → Manage → Active Numbers
Click your number
Configure "A Call Comes In" webhook:
- Method: HTTP POST
- URL: https://your-n8n-instance.com/webhook/twilio-voice

Get API Credentials:

Account SID: Found in Console Dashboard
Auth Token: Found in Console Dashboard (keep secure!)

Step 2: ElevenLabs Account Setup

Create Account:

Sign up at https://elevenlabs.io
Choose a plan (free tier available for testing)

Create a Voice Assistant:

Navigate to "Conversational AI" → "Agents"
Click "Create Agent"
Configure:
- Name: Your assistant name (e.g., "Acme Receptionist")
- System prompt: Define personality and behavior
- Voice: Select or clone a voice

System Prompt Example:

You are Sarah, the friendly receptionist at Acme Dental Practice. 
You handle appointment scheduling, answer questions about services, 
and provide general information. Be warm, professional, and efficient.

Key information:
- Office hours: Monday-Friday 9 AM - 6 PM
- Services: General dentistry, orthodontics, teeth whitening
- New patient consultations are free
- Emergency line available 24/7

Always confirm appointment details before ending calls.

Get API Key:

Go to Profile → API Keys
Create new API key with appropriate permissions
Save the key securely (starts with sk_)

Step 3: n8n Installation

Docker Deployment (Recommended):

# docker-compose.yml
version: '3.8'

services:
  n8n:
    image: n8nio/n8n:latest
    restart: always
    ports:
      - "5678:5678"
    environment:
      - N8N_BASIC_AUTH_ACTIVE=true
      - N8N_BASIC_AUTH_USER=admin
      - N8N_BASIC_AUTH_PASSWORD=your_secure_password
      - GENERIC_TIMEZONE=UTC
      - WEBHOOK_URL=https://your-domain.com/
    volumes:
      - ./n8n-data:/home/node/.n8n
    networks:
      - automation-network

  # Optional: Add webhook tunnel for local development
  ngrok:
    image: ngrok/ngrok:latest
    command: http n8n:5678 --authtoken=${NGROK_AUTH_TOKEN}
    networks:
      - automation-network

networks:
  automation-network:
    driver: bridge

Deploy:

# Create directories
mkdir -p n8n-data

# Start services
docker-compose up -d

# Check logs
docker-compose logs -f n8n

Verify Installation:

Access n8n at http://localhost:5678
Complete setup wizard
Create your first workflow

Phase 2: Basic Voice Agent Workflow

Workflow 1: Simple Incoming Call Handler

This basic workflow answers calls and plays a pre-recorded message:

Step 1: Create Webhook Trigger

In n8n, create a new workflow
Add "Webhook" node:
- Method: POST
- Path: twilio-voice
- Response Mode: Last Node

Step 2: Generate TwiML Response

Add a "Set" node to create TwiML:

// Node: Generate TwiML
const twiml = `<?xml version="1.0" encoding="UTF-8"?>
<Response>
  <Say voice="Polly.Joanna">
    Hello! Thank you for calling Tropical Media. 
    Our AI assistant is currently in development. 
    Please leave a message after the tone.
  </Say>
  <Record maxLength="120" />
</Response>`;

return [{
  json: {
    headers: {
      'Content-Type': 'text/xml'
    },
    statusCode: 200,
    body: twiml
  }
}];

Step 3: Configure Response

Add an "HTTP Response" node:

Status Code: 200
Content-Type: text/xml
Body: {{ $json.body }}

Test the Workflow:

Save and activate the workflow
Call your Twilio number
You should hear the message

Workflow 2: AI-Powered Conversation Handler

This advanced workflow handles real-time AI conversations:

Architecture Overview:

[Twilio Call] → [Webhook: Initial Setup] → [WebSocket Server]
                                              ↓
[ElevenLabs] ← [n8n AI Processing] ← [Speech-to-Text]
     ↓              ↓
[Voice Output] → [Caller]

Step 1: Initial Call Webhook

Create a workflow triggered by Twilio incoming calls:

// Node: Parse Twilio Payload
const twilioData = $input.first().json.body;

return [{
  json: {
    callSid: twilioData.CallSid,
    from: twilioData.From,
    to: twilioData.To,
    callStatus: twilioData.CallStatus,
    direction: twilioData.Direction
  }
}];

Step 2: Generate Initial TwiML with Stream

// Node: Generate Stream TwiML
const callSid = $input.first().json.callSid;

const twiml = `<?xml version="1.0" encoding="UTF-8"?>
<Response>
  <Connect>
    <Stream url="wss://your-n8n-instance.com/ws/voice/${callSid}" />
  </Connect>
</Response>`;

return [{
  json: {
    headers: { 'Content-Type': 'text/xml' },
    statusCode: 200,
    body: twiml
  }
}];

Step 3: WebSocket Server for Real-Time Processing

This requires the n8n WebSocket node or custom code. Here's the conceptual flow:

// Node: WebSocket Handler (Execute Once for all items)
const WebSocket = require('ws');
const axios = require('axios');

// Configuration
const ELEVENLABS_API_KEY = $env.ELEVENLABS_API_KEY;
const ELEVENLABS_AGENT_ID = $env.ELEVENLABS_AGENT_ID;
const DEEPGRAM_API_KEY = $env.DEEPGRAM_API_KEY;
const OPENAI_API_KEY = $env.OPENAI_API_KEY;

// WebSocket server setup
const wss = new WebSocket.Server({ port: 8080 });

wss.on('connection', (ws, req) => {
  const callSid = req.url.split('/').pop();
  console.log(`Call connected: ${callSid}`);
  
  // Initialize conversation context
  let conversationHistory = [];
  let isProcessing = false;
  
  // Handle audio from Twilio
  ws.on('message', async (data) => {
    const msg = JSON.parse(data);
    
    switch(msg.event) {
      case 'connected':
        // Send greeting via ElevenLabs
        const greeting = await generateSpeech(
          "Hello! I'm your AI assistant. How can I help you today?"
        );
        ws.send(JSON.stringify({
          event: 'media',
          streamSid: msg.streamSid,
          media: { payload: greeting }
        }));
        break;
        
      case 'media':
        // Received audio from caller
        if (isProcessing) return;
        isProcessing = true;
        
        // Convert mulaw to PCM (Twilio format)
        const audioBuffer = Buffer.from(msg.media.payload, 'base64');
        
        // Transcribe with Deepgram
        const transcript = await transcribeAudio(audioBuffer);
        
        if (transcript) {
          // Add to history
          conversationHistory.push({
            role: 'user',
            content: transcript
          });
          
          // Get AI response
          const aiResponse = await getAIResponse(conversationHistory);
          
          // Generate speech
          const audioResponse = await generateSpeech(aiResponse);
          
          // Send back to caller
          ws.send(JSON.stringify({
            event: 'media',
            streamSid: msg.streamSid,
            media: { payload: audioResponse }
          }));
          
          // Update history
          conversationHistory.push({
            role: 'assistant',
            content: aiResponse
          });
        }
        
        isProcessing = false;
        break;
        
      case 'stop':
        console.log(`Call ended: ${callSid}`);
        // Log conversation to CRM
        await logCall(callSid, conversationHistory);
        ws.close();
        break;
    }
  });
});

// Helper: Transcribe audio using Deepgram
async function transcribeAudio(audioBuffer) {
  const response = await axios.post(
    'https://api.deepgram.com/v1/listen?model=nova-2&smart_format=true',
    audioBuffer,
    {
      headers: {
        'Authorization': `Token ${DEEPGRAM_API_KEY}`,
        'Content-Type': 'audio/wav'
      }
    }
  );
  
  return response.data.results?.channels[0]?.alternatives[0]?.transcript;
}

// Helper: Get AI response
async function getAIResponse(history) {
  const response = await axios.post(
    'https://api.openai.com/v1/chat/completions',
    {
      model: 'gpt-4o',
      messages: [
        {
          role: 'system',
          content: 'You are a helpful assistant for a dental practice...'
        },
        ...history
      ],
      max_tokens: 150
    },
    {
      headers: {
        'Authorization': `Bearer ${OPENAI_API_KEY}`,
        'Content-Type': 'application/json'
      }
    }
  );
  
  return response.data.choices[0].message.content;
}

// Helper: Generate speech with ElevenLabs
async function generateSpeech(text) {
  const response = await axios.post(
    `https://api.elevenlabs.io/v1/text-to-speech/${ELEVENLABS_VOICE_ID}/stream`,
    {
      text: text,
      model_id: 'eleven_turbo_v2_5',
      voice_settings: {
        stability: 0.5,
        similarity_boost: 0.5
      }
    },
    {
      headers: {
        'xi-api-key': ELEVENLABS_API_KEY,
        'Content-Type': 'application/json'
      },
      responseType: 'arraybuffer'
    }
  );
  
  // Convert to mulaw format for Twilio
  return Buffer.from(response.data).toString('base64');
}

// Helper: Log call to CRM
async function logCall(callSid, history) {
  await axios.post('https://your-crm.com/api/calls', {
    callSid,
    transcript: history,
    timestamp: new Date().toISOString()
  });
}

Note: For production use, consider using ElevenLabs' native Conversational AI with n8n tool integration instead of building the full WebSocket pipeline yourself.

Phase 3: ElevenLabs Conversational AI Integration

This simplified approach leverages ElevenLabs' built-in capabilities:

Step 1: Create ElevenLabs Agent

In ElevenLabs dashboard, go to Conversational AI → Agents
Click "Create Agent"
Configure the agent:

System Prompt:

You are a helpful AI receptionist for a dental practice. Your name is Sarah.

Your capabilities:
- Schedule appointments for existing and new patients
- Answer questions about services and pricing
- Provide office hours and location information
- Handle emergencies by transferring to the on-call dentist

When scheduling appointments:
1. Collect patient's full name
2. Confirm phone number
3. Ask for preferred date and time
4. Ask for reason for visit (checkup, cleaning, etc.)
5. Confirm all details before booking

Always be warm, professional, and patient. If you don't know something, 
offer to have a human representative call them back.

Voice Settings:

Select a warm, professional voice
Adjust stability (0.5) and clarity (0.75)

LLM Configuration:

Model: GPT-4o or Claude 3.5 Sonnet
Temperature: 0.7
Max tokens: 300

Step 2: Configure Tool Calls

Set up tools that the AI can invoke via webhooks to n8n:

Tool 1: Check Availability

{
  "name": "check_availability",
  "description": "Check available appointment slots for a given date",
  "parameters": {
    "type": "object",
    "properties": {
      "date": {
        "type": "string",
        "format": "date",
        "description": "Date to check (YYYY-MM-DD)"
      },
      "appointment_type": {
        "type": "string",
        "enum": ["cleaning", "checkup", "consultation", "emergency"],
        "description": "Type of appointment"
      }
    },
    "required": ["date", "appointment_type"]
  }
}

Tool 2: Book Appointment

{
  "name": "book_appointment",
  "description": "Book an appointment for a patient",
  "parameters": {
    "type": "object",
    "properties": {
      "patient_name": {
        "type": "string",
        "description": "Patient's full name"
      },
      "phone": {
        "type": "string",
        "description": "Patient's phone number"
      },
      "date": {
        "type": "string",
        "format": "date"
      },
      "time": {
        "type": "string",
        "pattern": "^([0-1]?[0-9]|2[0-3]):[0-5][0-9]$"
      },
      "appointment_type": {
        "type": "string"
      },
      "notes": {
        "type": "string"
      }
    },
    "required": ["patient_name", "phone", "date", "time", "appointment_type"]
  }
}

Tool 3: Send Summary

{
  "name": "send_summary",
  "description": "Send call summary to reception staff via Slack",
  "parameters": {
    "type": "object",
    "properties": {
      "summary": {
        "type": "string",
        "description": "Summary of the call"
      },
      "urgency": {
        "type": "string",
        "enum": ["low", "medium", "high"]
      }
    },
    "required": ["summary"]
  }
}

Step 3: Create n8n Tool Execution Workflow

Create a workflow that handles tool calls from ElevenLabs:

Webhook Trigger:

Method: POST
Path: elevenlabs-tools
Authentication: Header auth with ElevenLabs secret

Parse Tool Call:

// Node: Parse Tool Request
const body = $input.first().json.body;

return [{
  json: {
    toolName: body.tool_name,
    parameters: body.parameters,
    callId: body.call_id,
    conversationId: body.conversation_id
  }
}];

Route to Handler:

Use a Switch node based on toolName:

Case: check_availability

Connect to your calendar/CRM system:

// Node: Check Calendar
const { date, appointment_type } = $input.first().json.parameters;

// Query your booking system (example with Cal.com API)
const availability = await $httpRequest({
  method: 'GET',
  url: `https://api.cal.com/v1/availability`,
  qs: {
    apiKey: $env.CAL_API_KEY,
    dateFrom: date,
    dateTo: date,
    eventTypeId: getEventTypeId(appointment_type)
  }
});

// Format available slots
const slots = availability.slots.map(slot => 
  new Date(slot.time).toLocaleTimeString('en-US', { 
    hour: '2-digit', 
    minute: '2-digit',
    hour12: true 
  })
);

return [{
  json: {
    result: {
      available: slots.length > 0,
      slots: slots.slice(0, 5), // Return top 5 slots
      date: date
    }
  }
}];

function getEventTypeId(type) {
  const map = {
    'cleaning': 123,
    'checkup': 124,
    'consultation': 125,
    'emergency': 126
  };
  return map[type] || 124;
}

Case: book_appointment

// Node: Book Appointment
const params = $input.first().json.parameters;

// Create booking
const booking = await $httpRequest({
  method: 'POST',
  url: 'https://api.cal.com/v1/bookings',
  body: {
    apiKey: $env.CAL_API_KEY,
    eventTypeId: getEventTypeId(params.appointment_type),
    start: `${params.date}T${params.time}`,
    name: params.patient_name,
    email: `temp@${params.phone}.placeholder`, // Placeholder
    location: 'Phone consultation',
    metadata: {
      phone: params.phone,
      source: 'ai_voice_agent',
      notes: params.notes || ''
    }
  }
});

// Store in CRM
await $httpRequest({
  method: 'POST',
  url: 'https://your-crm.com/api/appointments',
  body: {
    patient_name: params.patient_name,
    phone: params.phone,
    date: params.date,
    time: params.time,
    type: params.appointment_type,
    booking_id: booking.id,
    source: 'voice_agent'
  },
  headers: {
    'Authorization': `Bearer ${$env.CRM_API_KEY}`
  }
});

// Send confirmation SMS
await $httpRequest({
  method: 'POST',
  url: 'https://api.twilio.com/2010-04-01/Accounts/YOUR_SID/Messages.json',
  body: {
    To: params.phone,
    From: $env.TWILIO_PHONE_NUMBER,
    Body: `Hi ${params.patient_name}, your appointment is confirmed for ${params.date} at ${params.time}. Reply CANCEL to reschedule.`
  },
  auth: {
    username: $env.TWILIO_ACCOUNT_SID,
    password: $env.TWILIO_AUTH_TOKEN
  }
});

return [{
  json: {
    result: {
      success: true,
      booking_id: booking.id,
      confirmation_sent: true
    }
  }
}];

Case: send_summary

// Node: Send to Slack
const { summary, urgency = 'low' } = $input.first().json.parameters;

const urgencyEmoji = {
  'high': '🔴',
  'medium': '🟡',
  'low': '🟢'
};

await $httpRequest({
  method: 'POST',
  url: $env.SLACK_WEBHOOK_URL,
  body: {
    text: `${urgencyEmoji[urgency]} *AI Voice Agent Summary*`,
    blocks: [
      {
        type: 'header',
        text: {
          type: 'plain_text',
          text: 'AI Voice Agent Call Summary'
        }
      },
      {
        type: 'section',
        fields: [
          {
            type: 'mrkdwn',
            text: `*Urgency:*\n${urgency}`
          },
          {
            type: 'mrkdwn',
            text: `*Time:*\n${new Date().toLocaleString()}`
          }
        ]
      },
      {
        type: 'section',
        text: {
          type: 'mrkdwn',
          text: `*Summary:*\n${summary}`
        }
      }
    ]
  }
});

return [{
  json: {
    result: {
      sent: true,
      channel: 'reception-team'
    }
  }
}];

Format Response:

// Node: Return to ElevenLabs
const toolResult = $input.first().json.result;

return [{
  json: {
    statusCode: 200,
    body: {
      result: toolResult
    }
  }
}];

Step 4: Connect Twilio to ElevenLabs

Configure Twilio to forward calls to ElevenLabs:

Method 1: Direct Integration (Recommended)

In ElevenLabs, get your agent's phone number integration URL

In Twilio, set your number's webhook to:

https://api.elevenlabs.io/v1/convai/twilio/outbound/{AGENT_ID}

Method 2: Via n8n Proxy

If you need custom logic before connecting:

// Node: Proxy to ElevenLabs
const callSid = $input.first().json.body.CallSid;
const from = $input.first().json.body.From;

// Custom logic: Check if caller is blocked
const isBlocked = await checkBlockedList(from);

if (isBlocked) {
  return [{
    json: {
      headers: { 'Content-Type': 'text/xml' },
      body: `<?xml version="1.0"?>
<Response>
  <Say>This number has been blocked.</Say>
  <Hangup/>
</Response>`
    }
  }];
}

// Forward to ElevenLabs
return [{
  json: {
    headers: { 'Content-Type': 'text/xml' },
    body: `<?xml version="1.0"?>
<Response>
  <Connect>
    <Stream url="wss://api.elevenlabs.io/v1/convai/twilio/connect/${AGENT_ID}" />
  </Connect>
</Response>`
  }
}];

Phase 4: Advanced Features

Feature 1: Call Transfer to Human

Add the ability to transfer calls to human agents:

// Node: Transfer Call
const { callSid, transferTo } = $input.first().json.parameters;

// Twilio transfer using Dial
const twiml = `<?xml version="1.0"?>
<Response>
  <Say>I'm transferring you to a team member now. Please hold.</Say>
  <Dial>
    <Number>${transferTo}</Number>
  </Dial>
</Response>`;

// Update call via Twilio API
await $httpRequest({
  method: 'POST',
  url: `https://api.twilio.com/2010-04-01/Accounts/${TWILIO_SID}/Calls/${callSid}`,
  body: {
    Twiml: twiml
  },
  auth: {
    username: $env.TWILIO_ACCOUNT_SID,
    password: $env.TWILIO_AUTH_TOKEN
  }
});

return [{
  json: {
    result: {
      transferred: true,
      to: transferTo
    }
  }
}];

Add to ElevenLabs tools:

{
  "name": "transfer_to_human",
  "description": "Transfer the call to a human representative",
  "parameters": {
    "type": "object",
    "properties": {
      "department": {
        "type": "string",
        "enum": ["sales", "support", "billing", "emergency"]
      },
      "reason": {
        "type": "string",
        "description": "Reason for transfer"
      }
    },
    "required": ["department"]
  }
}

Feature 2: Multi-Language Support

Configure the agent to detect and respond in multiple languages:

// Node: Language Detection
const message = $input.first().json.body.message;

// Use OpenAI to detect language
const detection = await $httpRequest({
  method: 'POST',
  url: 'https://api.openai.com/v1/chat/completions',
  body: {
    model: 'gpt-4o-mini',
    messages: [
      {
        role: 'system',
        content: 'Detect the language of the input message. Return ONLY the ISO 639-1 code (e.g., "en", "es", "de", "th").'
      },
      {
        role: 'user',
        content: message
      }
    ]
  },
  headers: {
    'Authorization': `Bearer ${$env.OPENAI_API_KEY}`
  }
});

const language = detection.choices[0].message.content.trim();

// Map to ElevenLabs voice
const voiceMap = {
  'en': 'XB0fDUnXU5powFXDhCwa',  // English
  'es': 'TX3AEvVoIzMeN6jBfAjZ',  // Spanish
  'de': 'Hh1cR3w8I9fHNkEn11VW',  // German
  'th': 'custom-thai-voice-id'     // Thai
};

return [{
  json: {
    language: language,
    voice_id: voiceMap[language] || voiceMap['en'],
    system_prompt: getLocalizedPrompt(language)
  }
}];

function getLocalizedPrompt(lang) {
  const prompts = {
    'en': 'You are a helpful assistant...',
    'es': 'Eres un asistente servicial...',
    'de': 'Sie sind ein hilfsbereiter Assistent...',
    'th': 'คุณเป็นผู้ช่วยที่เป็นประโยชน์...'
  };
  return prompts[lang] || prompts['en'];
}

Feature 3: Post-Call Analytics

Store and analyze call data for insights:

// Node: Store Call Data
const callData = $input.first().json.body;

// Store in database
await $httpRequest({
  method: 'POST',
  url: 'https://your-analytics-db.com/api/calls',
  body: {
    call_id: callData.call_id,
    duration_seconds: callData.duration,
    transcript: callData.transcript,
    tools_used: callData.tools_used,
    sentiment: await analyzeSentiment(callData.transcript),
    outcome: callData.outcome, // 'appointment_booked', 'transferred', 'completed', etc.
    timestamp: new Date().toISOString()
  }
});

// Generate insights
const insights = {
  total_calls: await getTotalCallsToday(),
  avg_duration: await getAverageDuration(),
  booking_rate: await getBookingRate(),
  transfer_rate: await getTransferRate()
};

// Send daily report
if (isEndOfDay()) {
  await $httpRequest({
    method: 'POST',
    url: $env.SLACK_WEBHOOK_URL,
    body: {
      text: `📊 Daily Voice Agent Report\n\n` +
            `Total Calls: ${insights.total_calls}\n` +
            `Avg Duration: ${insights.avg_duration}s\n` +
            `Booking Rate: ${insights.booking_rate}%\n` +
            `Transfer Rate: ${insights.transfer_rate}%`
    }
  });
}

return [{
  json: {
    stored: true,
    insights: insights
  }
}];

async function analyzeSentiment(text) {
  const response = await $httpRequest({
    method: 'POST',
    url: 'https://api.openai.com/v1/chat/completions',
    body: {
      model: 'gpt-4o-mini',
      messages: [
        {
          role: 'system',
          content: 'Analyze sentiment. Return ONLY: POSITIVE, NEUTRAL, or NEGATIVE'
        },
        {
          role: 'user',
          content: text
        }
      ]
    },
    headers: {
      'Authorization': `Bearer ${$env.OPENAI_API_KEY}`
    }
  });
  
  return response.choices[0].message.content.trim();
}

Feature 4: Appointment Reminders (Outbound Calls)

Set up automated outbound calls for appointment reminders:

// Node: Schedule Outbound Calls
// Trigger: Daily at 9 AM (Cron)

// Get tomorrow's appointments
const tomorrow = new Date();
tomorrow.setDate(tomorrow.getDate() + 1);
const dateStr = tomorrow.toISOString().split('T')[0];

const appointments = await $httpRequest({
  method: 'GET',
  url: 'https://your-crm.com/api/appointments',
  qs: {
    date: dateStr,
    status: 'confirmed'
  },
  headers: {
    'Authorization': `Bearer ${$env.CRM_API_KEY}`
  }
});

// Queue reminder calls
for (const appt of appointments) {
  await $httpRequest({
    method: 'POST',
    url: `https://api.twilio.com/2010-04-01/Accounts/${$env.TWILIO_ACCOUNT_SID}/Calls.json`,
    body: {
      To: appt.phone,
      From: $env.TWILIO_PHONE_NUMBER,
      Url: `https://your-n8n-instance.com/webhook/twilio-reminder?appt_id=${appt.id}`,
      StatusCallback: `https://your-n8n-instance.com/webhook/twilio-status`
    },
    auth: {
      username: $env.TWILIO_ACCOUNT_SID,
      password: $env.TWILIO_AUTH_TOKEN
    }
  });
}

return [{
  json: {
    calls_scheduled: appointments.length,
    for_date: dateStr
  }
}];

Reminder Call Handler:

// Node: Reminder Call TwiML
const apptId = $input.first().json.query.appt_id;

// Get appointment details
const appt = await $httpRequest({
  method: 'GET',
  url: `https://your-crm.com/api/appointments/${apptId}`,
  headers: {
    'Authorization': `Bearer ${$env.CRM_API_KEY}`
  }
});

// Use ElevenLabs for natural reminder
const greeting = await generateSpeech(
  `Hello ${appt.patient_name}, this is a reminder from Tropical Dental. ` +
  `You have an appointment scheduled for tomorrow at ${appt.time}. ` +
  `Press 1 to confirm, 2 to cancel, or 3 to reschedule.`
);

const twiml = `<?xml version="1.0"?>
<Response>
  <Play>${greeting}</Play>
  <Gather numDigits="1" action="/webhook/twilio-reminder-response?appt_id=${apptId}" method="POST">
    <Say>Please press 1 to confirm, 2 to cancel, or 3 to reschedule.</Say>
  </Gather>
  <Say>We didn't receive a response. Goodbye.</Say>
</Response>`;

return [{
  json: {
    headers: { 'Content-Type': 'text/xml' },
    body: twiml
  }
}];

Production Considerations

Security Best Practices

1. Webhook Authentication:

Always verify webhook authenticity:

// Node: Verify Twilio Signature
const crypto = require('crypto');

const authToken = $env.TWILIO_AUTH_TOKEN;
const twilioSignature = $input.first().json.headers['x-twilio-signature'];
const url = $input.first().json.originalUrl;
const params = $input.first().json.body;

// Build string to sign
let data = url;
Object.keys(params).sort().forEach(key => {
  data += key + params[key];
});

// Calculate expected signature
const expectedSignature = crypto
  .createHmac('sha1', authToken)
  .update(Buffer.from(data, 'utf8'))
  .digest('base64');

// Verify
if (twilioSignature !== expectedSignature) {
  return [{
    json: {
      statusCode: 403,
      body: { error: 'Invalid signature' }
    }
  }];
}

// Continue processing
return $input.all();

2. Rate Limiting:

Prevent abuse by implementing rate limits:

// Node: Rate Limit Check
const caller = $input.first().json.body.From;
const cacheKey = `rate_limit:${caller}`;

// Check Redis or similar cache
const callCount = await cache.get(cacheKey) || 0;

if (callCount > 10) { // Max 10 calls per hour
  return [{
    json: {
      statusCode: 429,
      body: { error: 'Rate limit exceeded' }
    }
  }];
}

// Increment counter
await cache.set(cacheKey, callCount + 1, 'EX', 3600); // 1 hour TTL

return $input.all();

3. Data Encryption:

Use HTTPS for all webhooks
Encrypt sensitive data at rest
Implement field-level encryption for PII
Regularly rotate API keys

Monitoring and Logging

Call Logging:

// Node: Log Call
const callData = $input.first().json;

await $httpRequest({
  method: 'POST',
  url: 'https://your-logging-service.com/api/logs',
  body: {
    timestamp: new Date().toISOString(),
    level: 'info',
    service: 'voice-agent',
    call_sid: callData.CallSid,
    from: callData.From,
    to: callData.To,
    duration: callData.CallDuration,
    status: callData.CallStatus,
    // Don't log full transcripts to public logs
    transcript_hash: hashTranscript(callData.Transcript)
  }
});

Health Checks:

Monitor system health with periodic checks:

// Workflow: Health Check (runs every 5 minutes)
// Trigger: Cron - */5 * * * *

const checks = [
  checkTwilioConnectivity(),
  checkElevenLabsAPI(),
  checkDatabaseConnection(),
  checkCRMConnection()
];

const results = await Promise.all(checks);

const allHealthy = results.every(r => r.healthy);

if (!allHealthy) {
  await $httpRequest({
    method: 'POST',
    url: $env.PAGERDUTY_WEBHOOK,
    body: {
      severity: 'warning',
      summary: 'Voice Agent System Degraded',
      details: results.filter(r => !r.healthy)
    }
  });
}

return [{
  json: {
    healthy: allHealthy,
    checks: results,
    timestamp: new Date().toISOString()
  }
}];

Cost Optimization

Cost Breakdown (Approximate):

Service	Cost Component	Rate
Twilio	Per-minute inbound	$0.0085/min
Twilio	Per-minute outbound	$0.013/min
Twilio	Phone number	$1.15/month
ElevenLabs	TTS (Turbo v2.5)	$0.10/1000 chars
ElevenLabs	Conv. AI sessions	$0.05/min
OpenAI	GPT-4o	$0.005/1K tokens
Deepgram	Nova-2 transcription	$0.0043/min
n8n	Self-hosted	Server costs only

Cost Example (100 calls/month, 5 min avg):

Twilio: 100 calls × 5 min × $0.0085 = $4.25
Phone number: $1.15
ElevenLabs: 500 min × $0.05 = $25.00
OpenAI: ~$5.00
Deepgram: 500 min × $0.0043 = $2.15
----------------------------------------
Total: ~$37.55/month

vs. Human receptionist: $2,500+/month

Cost Optimization Tips:

Use ElevenLabs' Turbo v2.5 for lower latency and cost
Cache common responses to reduce TTS costs
Implement call timeouts (5-10 min max)
Use shorter system prompts to reduce LLM tokens
Batch webhook processing when possible

Scaling Considerations

Horizontal Scaling:

# docker-compose.prod.yml
version: '3.8'

services:
  n8n:
    image: n8nio/n8n:latest
    deploy:
      replicas: 3
      resources:
        limits:
          cpus: '2'
          memory: 4G
    environment:
      - N8N_REDIS_HOST=redis
      - EXECUTIONS_MODE=queue
    networks:
      - voice-agent-network

  redis:
    image: redis:alpine
    networks:
      - voice-agent-network

  nginx:
    image: nginx:alpine
    ports:
      - "80:80"
      - "443:443"
    volumes:
      - ./nginx.conf:/etc/nginx/nginx.conf
    networks:
      - voice-agent-network

networks:
  voice-agent-network:

Database Scaling:

Use PostgreSQL for production n8n
Implement read replicas for analytics queries
Archive old call data to cold storage

Real-World Case Studies

Case Study 1: Dental Practice Transformation

Company: Sunrise Dental (4-location dental practice)

Challenge:

400+ incoming calls daily
Missed calls during lunch and after hours
3 full-time receptionists ($120K/year combined)
15% no-show rate for appointments

Solution:

AI voice agent for call handling
Appointment scheduling integration
Automated reminder calls
Emergency triage routing

Results:

95% of calls answered (vs. 70% before)
After-hours booking increased 40%
No-show rate dropped to 8%
Reduced to 1.5 FTE receptionists
Annual savings: $75,000
ROI achieved in 3 months

Technical Implementation:

n8n self-hosted on existing server
ElevenLabs with custom voice clone
Integration with Dentrix (dental software)
Twilio for telephony
Cal.com for online scheduling

Case Study 2: Real Estate Lead Qualification

Company: Metro Properties (residential real estate)

Challenge:

200+ inquiries daily from Zillow, Realtor.com
Agents wasting time on unqualified leads
Slow response times losing hot leads
No 24/7 availability

Solution:

AI voice agent for initial inquiry handling
Lead qualification scoring
Automatic property information delivery
Hot lead escalation to agents

Results:

Response time: hours → seconds
Agent productivity up 60%
Qualified lead conversion +35%
$2M in additional sales attributed to voice agent
Cost: $500/month vs. 2 full-time ISA ($120K/year)

Integration Points:

Follow Up Boss CRM
MLS property database
Agent scheduling calendars
SMS follow-up sequences

Case Study 3: SaaS Company Support Automation

Company: CloudSync (B2B SaaS, 5,000 customers)

Challenge:

Support team overwhelmed with L1 tickets
Long hold times during peak hours
Repetitive password reset and billing questions
Support costs growing faster than revenue

Solution:

AI voice agent for tier-1 support
Self-service password resets
Billing inquiry handling
Technical issue triage
Human escalation for complex issues

Results:

70% of calls resolved without human
Average handle time reduced 40%
CSAT scores maintained at 4.5/5
Support team able to focus on complex issues
Monthly support cost reduced from $45K to $28K

Troubleshooting Common Issues

Issue: High Latency in Responses

Symptoms: Long pauses between caller speech and AI response

Solutions:

Use ElevenLabs Turbo v2.5 instead of standard models
Enable streaming responses from OpenAI
Implement connection pooling for API calls
Use WebSocket binary frames instead of base64
Deploy geographically close to users

// Optimization: Streaming OpenAI response
const response = await $httpRequest({
  method: 'POST',
  url: 'https://api.openai.com/v1/chat/completions',
  body: {
    model: 'gpt-4o',
    messages: conversationHistory,
    stream: true  // Enable streaming
  },
  headers: {
    'Authorization': `Bearer ${$env.OPENAI_API_KEY}`,
    'Accept': 'text/event-stream'
  }
});

// Process chunks as they arrive
for await (const chunk of response) {
  const content = parseChunk(chunk);
  if (content) {
    // Generate and send audio incrementally
    await streamAudio(content);
  }
}

Issue: Poor Audio Quality

Symptoms: Robotic voice, audio dropouts, distortion

Solutions:

Adjust ElevenLabs voice settings:
- Stability: 0.5 (higher = more consistent, less expressive)
- Similarity boost: 0.75 (higher = closer to original)
Use appropriate audio format:
- Twilio requires: 8kHz, mono, μ-law (PCMU) codec
- Convert ElevenLabs output (MP3) to μ-law
Check network latency:
- WebSocket ping/pong should be <100ms
- Use Twilio's region-specific endpoints

Issue: Calls Not Connecting

Symptoms: Call goes to voicemail, webhook not triggered

Solutions:

Verify webhook URL is accessible:

# Test from outside your network
curl -X POST https://your-webhook-url.com/webhook/twilio-voice \
  -d "CallSid=test" \
  -d "From=+1234567890"

Check Twilio error logs:
- Console → Monitor → Logs → Errors
Verify SSL certificate:
- Self-signed certs won't work with Twilio
- Use valid CA-issued certificates
Check firewall rules:
- Whitelist Twilio IP ranges
- Allow HTTPS (443) inbound

Issue: Tool Calls Not Executing

Symptoms: AI says it will perform action but nothing happens

Solutions:

Verify webhook URL in ElevenLabs:
- Must be HTTPS
- Must respond within 10 seconds
Check n8n execution logs:
- Look for errors in tool execution workflow
Test tool independently:

curl -X POST https://your-n8n-instance.com/webhook/elevenlabs-tools \
  -H "Content-Type: application/json" \
  -d '{
    "tool_name": "check_availability",
    "parameters": {"date": "2026-04-10", "appointment_type": "cleaning"}
  }'

Verify API credentials:
- Check CRM/calendar API keys are valid
- Ensure proper permissions granted

Future Trends and Roadmap

Q2-Q4 2026 Developments

Multimodal Voice Agents:

Integration with visual data ("Show me product X")
Screen sharing capabilities
Document collaboration during calls

Emotion Detection:

Real-time sentiment analysis
Adaptive responses based on caller mood
Automatic escalation for frustrated callers

Voice Biometrics:

Caller identity verification via voiceprint
Fraud detection
Personalized greetings for known callers

Agent Collaboration:

Multiple AI agents conferring on complex issues
Seamless handoffs between specialized agents
Human-AI hybrid teams

Integration Opportunities

n8n Ecosystem:

Native ElevenLabs nodes (coming Q2 2026)
Built-in Twilio voice workflows
Pre-built voice agent templates

Third-Party Platforms:

Salesforce Service Cloud Voice
HubSpot Calling integration
Zendesk Talk compatibility
Microsoft Teams Phone

Conclusion

AI voice agents represent one of the most practical and impactful applications of artificial intelligence for businesses in 2026. By combining n8n's powerful automation capabilities with Twilio's telephony infrastructure and ElevenLabs' natural voice synthesis, you can build sophisticated voice assistants that handle real business scenarios—from appointment scheduling to lead qualification to customer support.

Key Takeaways:

Start Simple: Begin with basic call handling and gradually add AI capabilities
Integrate Deeply: Connect to your existing CRM, calendar, and business systems
Monitor and Optimize: Track metrics and continuously improve conversation flows
Maintain Human Touch: Always offer human escalation for complex or sensitive situations
Scale Gradually: Start with one use case, prove ROI, then expand

Implementation Roadmap:

Week 1: Set up infrastructure (Twilio, ElevenLabs, n8n)
Week 2: Build basic inbound call handler
Week 3: Add AI conversation capabilities
Week 4: Integrate with business systems (CRM, calendar)
Week 5: Test, refine, and soft launch
Week 6: Full deployment with monitoring

The businesses that thrive in 2026 will be those that successfully blend AI efficiency with human expertise. Voice agents are not about replacing human connection—they're about ensuring that human time is spent where it matters most, while routine interactions are handled instantly and consistently, 24 hours a day.

Ready to build your AI voice agent? Contact Tropical Media for expert consultation on implementing voice AI for your business. We specialize in n8n automation, Twilio integration, and custom AI agent development.

Resources

Documentation

Community

Templates and Examples

Tags: AI Voice Agents, Conversational AI, n8n Automation, Twilio, ElevenLabs, Business Automation, Customer Service, Appointment Scheduling, Voice AI, Phone Automation, Self-Hosted AI, LLM Integration, Workflow Automation

Building Modern Websites with Nuxt and Vue in 2025

Why Nuxt and Vue.js remain the top choice for building fast, SEO-friendly, and maintainable websites and web applications in 2025.

Multi-Agent AI Systems: Orchestrating Intelligent Workflows for Enterprise Automation

Discover how multi-agent AI systems are revolutionizing business automation in 2026. Learn to build coordinated AI agent networks using n8n, LangGraph, and OpenClaw that collaborate autonomously to handle complex enterprise workflows.