Agentic System Golden Set Dataset

From GM-RKB
(Redirected from Agent Benchmark Dataset)
Jump to navigation Jump to search

An Agentic System Golden Set Dataset is an evaluation dataset that contains representative task snapshots and expected trajectorys for agentic system regression testing.