Agentic Evaluation Vocabulary

Practice vocabulary for evaluating agentic AI systems: trajectory evaluation, task completion rate, tool call accuracy, and benchmarks.

0 / 5 completed
1 / 5
A researcher says 'We use agent trajectory evaluation.' What does 'trajectory' refer to in this context?