π Quick Start
Get your Gemini Live voice agent running in under 5 minutes with this comprehensive setup guide.π Prerequisites
Before starting, ensure you have:Google AI API Key
Access to Googleβs Generative AI API with Gemini Live enabled
Twilio Account
For phone call integration (optional for web-only)
Node.js Environment
Node.js 18+ for running the voice service
TixAE Account
Your TixAE workspace with voice features enabled
π Step 1: Google AI API Setup
Get Your API Key
- Visit the Google AI Studio
- Create a new API key or use existing one
- Important: Ensure your key has access to Gemini Live models
API Key Requirements: Your Google AI API key must have access to the latest Gemini models. Some keys may not have Live API access by default.
Verify API Access
Test your API key with this quick verification:βοΈ Step 2: Configure Your Agent
1. Create Your Agent
In your TixAE dashboard:- Navigate to Create Agent β Voice Agent
- Choose βGoogle Gemini Liveβ as your voice provider
- Set up your basic agent configuration
2. Model Selection
Choose the correct Gemini Live model:Model Recommendations:
gemini-2.0-flash-exp
: Fastest, most reliable (Recommended)gemini-2.0-flash-live-001
: Stable alternative- Avoid:
gemini-2.5-flash-exp-native-audio-thinking-dialog
(tool calling issues)
ποΈ Step 3: Voice Configuration
System Instructions
Configure your agentβs personality and instructions:System Prompt Example
Voice Settings
Configure these essential voice parameters:π§ Step 4: Environment Setup
Environment Variables
Add these to your.env
file:
Package Dependencies
Ensure these packages are installed:package.json
π Step 5: Phone Integration (Optional)
Twilio Setup
If you want phone call capabilities:- Purchase a phone number in Twilio Console
- Configure webhook URL to point to your TixAE endpoint:
- Set HTTP method to
POST
WebRTC Setup
For browser-based calling:- Enable WebRTC in your agent settings
- Configure STUN/TURN servers if behind NAT
- Test browser permissions for microphone access
π§ͺ Step 6: Testing & Verification
Test Your Setup
1
API Key Test
Verify your Google AI API key works with Gemini Live models
2
Voice Test
Test basic voice input/output functionality
3
Tool Calling Test
Verify tools can be called during conversation
4
Phone Test
Make a test call to verify end-to-end functionality
Debug Commands
Use these commands to troubleshoot:β‘ Performance Optimization
Ultra-Fast Audio Processing
Our implementation includes cutting-edge optimizations:- 20ms chunk processing for minimal latency
- Loop-unrolled resampling (6x speed improvement)
- Direct memory operations using bit shifts
- Minimal validation for maximum throughput
Automatic Optimization: These performance enhancements are automatically applied when using TixAEβs Gemini Live integration. No additional configuration required!
π¨ Troubleshooting
Common Issues
β Connection Failed
β Connection Failed
Problem:
Connection closed: 1008 models/gemini-2.5-flash-exp is not found
Solution: Use the correct model name:β Tool Calls Not Working
β Tool Calls Not Working
Problem: AI doesnβt execute tools or responds with βI canβt help with thatβSolutions:
- Ensure
responseModalities
is set to["AUDIO"]
(not["TEXT"]
) - Verify tool schemas are properly formatted
- Check that function responses include the required
id
field
β Audio Quality Issues
β Audio Quality Issues
Problem: Choppy, garbled, or delayed audioSolutions:
- Verify sample rate conversions (8kHz β 48kHz β 16kHz)
- Check network latency to Googleβs servers
- Enable ultra-fast audio processing mode
β Reconnection Problems
β Reconnection Problems
Problem: Agent doesnβt recover from network disconnectionsSolution: Our implementation includes automatic reconnection with:
- 3 retry attempts with exponential backoff
- Context preservation across reconnections
- Multi-language apology messages
π― Next Steps
Add Tool Integration
Integrate external APIs and function calling
Performance Optimization
Ultra-fast audio processing and latency tuning
Advanced Configuration
Reconnection logic, language detection, and more
Production Deployment
Deploy your voice agent to production
π Need Help?
Having trouble with setup?
Check out our troubleshooting guide or contact support for personalized assistance.