Output-Centric Safety Training Method

From GM-RKB
Jump to navigation Jump to search

An Output-Centric Safety Training Method is an AI safety training method that can support safe completion tasks by generating contextually appropriate responses rather than hard refusals.