Reinforcement Learning Prompt Optimization Method

From GM-RKB
Jump to navigation Jump to search

A Reinforcement Learning Prompt Optimization Method is a prompt optimization method that formulates prompt optimization as an RL problem with policy networks and reward functions.