Agentic System Golden Set Dataset

From GM-RKB
Jump to navigation Jump to search

An Agentic System Golden Set Dataset is an evaluation dataset that contains representative task snapshots and expected trajectorys for agentic system regression testing.