- Your agent redirects off-topic questions gracefully, not bluntly.
- It allows a brief moment of warmth on small talk before guiding back to the task.
- It's honest about being AI when asked.
- It declines to give opinions on competitors, politics, or topics outside its scope.
- It resists jailbreak attempts and prompt injection without acknowledging the attempt.
- It handles multi-intent callers by sequencing requests, not dismissing them.
We probe the full range: innocent confusion (wrong number), curiosity (are you a robot?), adversarial attacks (ignore your instructions), and the gray zone in between.