Research

𝜏³-bench: advancing agent benchmarking to knowledge and voice

𝜏³-bench is here. We've expanded agent evaluation to two new frontiers: knowledge retrieval and voice.

18 March 2026

Product Thought leadership

𝜏³-bench: Knowledge

𝜏³-bench is here and we've expanded agent evaluation to knowledge.

18 March 2026

Product Thought leadership

𝜏²-bench: evaluating conversational agents in a dual-control environment

𝜏²-bench challenges AI agents not just to reason and act, but to coordinate, guide, and assist a user in achieving a shared objective. This leap from solo operation to co-ownership of a task pushes agents into a much more demanding space.

11 June 2025

Product Thought leadership

𝜏-bench: benchmarking AI agents for the real-world

Sierra’s AI research team is on a mission to advance the frontier of conversational AI agents. In this research paper, we present a new benchmark for evaluating AI agents' performance and reliability in real-world settings, with dynamic user and tool interaction.

17 June 2024

Product

Discover what Sierra can do for you.

Find out how Sierra can help your business build better, more human customer experiences with AI.

Learn more