Agentic System Golden Set Dataset

From GM-RKB
(Redirected from Canonical Agent Test Set)
Jump to navigation Jump to search

An Agentic System Golden Set Dataset is an evaluation dataset that contains representative task snapshots and expected trajectorys for agentic system regression testing.