self-refine Iterative Refinement with Self-Feedback 논문리뷰

프롬프팅

self-refine Iterative Refinement with Self-Feedback 논문리뷰

jinuklee 2024. 7. 14. 01:39

25 May 2023

주안점 self-provided feedback

generate an initial output using an LLM; then, the same LLM provides feedback for its output and uses it to refine itself, iteratively

until a stopping condition is met. The stopping condition stop(f bt, t) either stops at a specified timestep t, or extracts a stopping indicator (e.g. a scalar stop score) from the feedback.

To inform the model about the previous iterations, we retain the history of previous feedback and outputs by appending them to the prompt

'프롬프팅' 카테고리의 다른 글

SELF-CONTRADICTORY HALLUCINATIONS OF LLMS:EVALUATION, DETECTION AND MITIGATION 논문리뷰 (0)	2024.08.08
The Prompt Report: A Systematic Survey of Prompting Techniques 논문 리뷰 (프롬프팅 기법에 관해) (0)	2024.07.22

현재글self-refine Iterative Refinement with Self-Feedback 논문리뷰

이진욱님의 블로그

ai research memo for reference

Today :
Yesterday :

일	월	화	수	목	금	토
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31

이진욱님의 블로그