MT-Bench

From GM-RKB
(Redirected from LMSYS MT-Bench)
Jump to navigation Jump to search

A MT-Bench is a multi-turn LLM inference evaluation task that assesses multi-turn conversational capabilities and instruction-following capabilities of large language models through curated multi-turn prompt sets and automated LLM-based grading by strong LLM judges.



References

2023a

2023a

2023c

2023d