What It Helps You Catch
- Prompt regressions
- Weak objection handling
- Missing tool coverage
- Broken booking or transfer paths
- Inconsistent qualification behavior
Core Objects
- Scenario: the test case
- Run: one execution over one or more scenarios
- Expected outcome: the condition you want the agent to satisfy
Generate Scenarios
Use generated scenarios when you want a wide spread of realistic tests from a short brief.Run Scenarios
Suggested Testing Process
- Generate or hand-author scenarios for the core jobs the agent must complete.
- Include edge cases, objections, and failure paths.
- Run the candidate version.
- Fix prompts, tools, or settings.
- Re-run before deployment.
What Good Scenarios Look Like
Good scenarios are:- Specific about the caller or sender goal
- Focused on one main success condition
- Diverse in tone and objections
- Tagged so release candidates can be compared over time