25 May 2023 주안점 self-provided feedbackgenerate an initial output using an LLM; then, the same LLM provides feedback for its output and uses it to refine itself, iteratively until a stopping condition is met. The stopping condition stop(f bt, t) either stops at a specified timestep t, or extracts a stopping indicator (e.g. a scalar stop score) from the feedback. To inform the model about the pre..