Benchmark Metric

From GM-RKB
Jump to navigation Jump to search

A Benchmark Metric is a performance metric that can measure AI agent performance (including win rates against human performance).