Benchmark Metric

From GM-RKB
(Redirected from Agent Benchmark)
Jump to navigation Jump to search

A Benchmark Metric is a performance metric that can measure AI agent performance (including win rates against human performance).