LLM as Judge Performance Metric

From GM-RKB
Jump to navigation Jump to search

A LLM as Judge Performance Metric is a performance metric that quantifies the effectiveness, accuracy, and reliability of large language models when performing evaluation and judgment tasks.