Terminal-Bench Benchmark

From GM-RKB
Jump to navigation Jump to search

A Terminal-Bench Benchmark is an AI terminal-focused software development benchmark that can be implemented by a terminal-bench evaluation system to solve terminal-based AI tool evaluation tasks.