> ## Documentation Index
> Fetch the complete documentation index at: https://docs.convocore.ai/llms.txt
> Use this file to discover all available pages before exploring further.

# Agent Tester

> Automatically test your AI agents with AI-driven conversations

# Agent Tester

The Agent Tester is a powerful feature that allows you to automatically test your AI agents using AI-driven conversations. Instead of manually testing your agent, the tester simulates realistic customer interactions and provides comprehensive analysis of your agent's performance.

<Note>
  The Agent Tester uses AI to generate realistic customer messages, simulating real-world interactions to thoroughly evaluate your agent's capabilities.
</Note>

## Getting Started

Navigate to your agent's **Tester** tab from the agent dashboard. You'll see the test configuration panel on the left and the test results on the right.

## Test Configuration

### Test Mode

Select the type of test you want to run based on what you want to evaluate:

<img src="https://mintcdn.com/convocore/75y4x3q_C9jC2e3p/images/agent-tester/test-mode.png?fit=max&auto=format&n=75y4x3q_C9jC2e3p&q=85&s=ac4fd17c770dbe5d82dd7bbeb12daf48" alt="Test Mode Selection" width="420" height="421" data-path="images/agent-tester/test-mode.png" />

| Mode               | Description                                                         |
| ------------------ | ------------------------------------------------------------------- |
| **Full Test**      | Test with all features enabled - prompts, tools, and knowledge base |
| **Prompt Only**    | Test only the AI prompt without tools or KB                         |
| **Prompt + Tools** | Test the prompt with selected tools enabled                         |
| **Prompt + KB**    | Test the prompt with knowledge base enabled                         |

<Tip>
  Use **Full Test** for comprehensive evaluation, or use specific modes to isolate and debug particular aspects of your agent.
</Tip>

### Tools Configuration

When testing with tools, you can select which tools to include in the test:

<img src="https://mintcdn.com/convocore/75y4x3q_C9jC2e3p/images/agent-tester/tools-config.png?fit=max&auto=format&n=75y4x3q_C9jC2e3p&q=85&s=80bd15c4bbdfb915770a00789266731a" alt="Tools Configuration" width="408" height="399" data-path="images/agent-tester/tools-config.png" />

* **Select All / Deselect All**: Quickly toggle all tools
* **Individual Tool Toggle**: Enable/disable specific tools for targeted testing
* **Flask Icon (🧪)**: Click to test a tool individually with AI-generated data
* **Knowledge Base Toggle**: Enable or disable KB access during the test

<Warning>
  Only tools assigned to the agent will appear in this list. Make sure to configure your agent's tools before testing.
</Warning>

### Test Scenarios

Provide context for what the test should focus on:

<img src="https://mintcdn.com/convocore/75y4x3q_C9jC2e3p/images/agent-tester/test-scenarios.png?fit=max&auto=format&n=75y4x3q_C9jC2e3p&q=85&s=399a165d2319ff689736db32d299d972" alt="Test Scenarios" width="401" height="479" data-path="images/agent-tester/test-scenarios.png" />

The **Test Scenario Context** field lets you describe what the test should focus on. The AI tester will generate appropriate customer messages based on this scenario.

**Example scenarios:**

* "Customer wants to book an appointment for next week"
* "User asking about pricing tiers"
* "Customer needs help with a product return"

**Quick Scenario Buttons:**

* **General Inquiry**: Basic questions about your service
* **Booking Scenario**: Test appointment/booking flows
* **Pricing Questions**: Test pricing-related conversations

<Info>
  The scenario describes what the test is about. For example, "Customer wants to book a meeting" will make the AI tester ask for appointments rather than just saying "I want to book a meeting".
</Info>

### Conversation Length

Control how many exchanges the test will run:

<img src="https://mintcdn.com/convocore/75y4x3q_C9jC2e3p/images/agent-tester/conversation-length.png?fit=max&auto=format&n=75y4x3q_C9jC2e3p&q=85&s=19dc2d724bfc99975281764e010575f3" alt="Conversation Length" width="414" height="311" data-path="images/agent-tester/conversation-length.png" />

* **Fixed exchanges toggle**: When enabled, the conversation will run for exactly the specified number of exchanges
* **Maximum Conversation Exchanges**: Set between 2-15 exchanges (User→Bot pairs)
* **Slider**: Quickly adjust the conversation length

<Tip>
  For thorough testing, we recommend at least **5 exchanges** to properly evaluate your agent's capabilities across multiple turns.
</Tip>

## Running a Test

1. Configure your test settings (mode, tools, scenario, length)
2. Click the **Run Test** button
3. Watch the conversation unfold in real-time in the logs
4. Review the comprehensive analysis when complete

## Test Results & Analysis

After the test completes, you'll receive a detailed analysis:

<img src="https://mintcdn.com/convocore/75y4x3q_C9jC2e3p/images/agent-tester/full-analysis.png?fit=max&auto=format&n=75y4x3q_C9jC2e3p&q=85&s=9a60ae3183a96072b71cab2a71f323fb" alt="Test Analysis Results" width="1695" height="798" data-path="images/agent-tester/full-analysis.png" />

### Quality Score

A score out of 10 indicating overall agent performance.

### Test Results Summary

| Category          | Status     | Description                            |
| ----------------- | ---------- | -------------------------------------- |
| Response Quality  | ✅/⚠️/❌     | How well the agent responds            |
| Tool Usage        | ✅/⚠️/❌/N/A | Whether tools were triggered correctly |
| KB Accuracy       | ✅/⚠️/❌/N/A | Knowledge base retrieval accuracy      |
| Conversation Flow | ✅/⚠️/❌     | Natural conversation progression       |

### Analysis Sections

* **Agent Strengths**: What your agent does well
* **Areas for Improvement**: Specific recommendations
* **Tools/Capabilities Analysis**: How tools were used
* **Knowledge Base Analysis**: KB retrieval performance
* **Customer Journey**: End-to-end experience assessment
* **Recommendations**: Actionable improvement suggestions
* **Final Verdict**: Executive summary

## Viewing Logs

Click on **Logs** tab to see the detailed conversation:

* **Sent messages**: What the AI tester sent to your agent
* **Received messages**: Your agent's responses
* **Info messages**: System events and status updates
* **Error messages**: Any issues that occurred

## Tips for Effective Testing

<CardGroup cols={2}>
  <Card title="Use Specific Scenarios" icon="crosshairs">
    The more specific your test scenario, the more realistic and useful the test will be.
  </Card>

  <Card title="Test Different Modes" icon="layer-group">
    Run multiple tests with different modes to isolate issues.
  </Card>

  <Card title="Check Tool Triggers" icon="wrench">
    If testing tools, verify they were actually triggered in the logs.
  </Card>

  <Card title="Review the Journey" icon="route">
    Pay attention to the Customer Journey section for UX insights.
  </Card>
</CardGroup>

## Credit Usage

<Note>
  The Agent Tester consumes credits based on actual token usage, using the same pricing as the gemini-2.5-flash model. Credits are charged at the end of each test session.
</Note>

The credit calculation includes:

* Tokens used for generating test prompts
* Tokens used for follow-up questions
* Tokens used for the final analysis

## Troubleshooting

<AccordionGroup>
  <Accordion title="Test ends too early">
    If your test ends before the configured number of exchanges, check if:

    * Your agent's response triggered a natural conversation end
    * There was a timeout (agent didn't respond within 30 seconds)
    * An error occurred during the test
  </Accordion>

  <Accordion title="Tools not triggering">
    Ensure you've:

    * Selected the tools in the Tools Configuration
    * Used a test scenario that would naturally require the tool
    * Configured the tool correctly in your agent
  </Accordion>

  <Accordion title="Low quality scores">
    Review the analysis for specific recommendations. Common issues include:

    * Vague or generic responses
    * Not using available tools when appropriate
    * Poor conversation flow or context retention
  </Accordion>
</AccordionGroup>

## Related Features

<CardGroup cols={2}>
  <Card title="Tools" icon="wrench" href="/features/tools">
    Learn how to create and configure tools for your agents.
  </Card>

  <Card title="Knowledge Base" icon="brain" href="/agent-creation/knowledgebase/about-the-knowledgebase">
    Set up a knowledge base for your agent.
  </Card>

  <Card title="Canvas" icon="diagram-project" href="/canvas/introduction">
    Build complex conversation flows with the visual canvas.
  </Card>

  <Card title="Analytics" icon="chart-line" href="/features/analytics">
    Track your agent's performance over time.
  </Card>
</CardGroup>
