forest of thought 논문 요약

inference-time, RLHF/search (language)

forest of thought 논문 요약

jinuklee 2025. 2. 10. 00:56

Forest-of-Thought: Scaling Test-Time Compute for Enhancing LLM Reasoning

Forest-of-Thought: Scaling Test-Time Compute for Enhancing LLM Reasoning Zhenni Bi Kai Han Chuanjian Liu Yehui Tang Yunhe Wang Abstract Large Language Models (LLMs) have shown remarkable abilities across various language tasks,

arxiv.org

benchmark : GSM 8k , MATH

3.1 FoT framework

suppose n개의 트리

inital root represents initial state or input problem

Sparse Activation.

most relevant tree 가 선택됨

tree 마다 layer의 node 들 중 high score node만을 선택해 expand 하는 구조

If the nodes at a certain level of the tree cannot produce valid outputs, the tree’s splitting process will terminate early, and the activation indicator value will be set to 0.

valid 경우에는 특정 깊이까지 expand되다 종료, indicator value 는 1

문제해결과 이해를 돕기 위해 model’s extensive knowledge base 에서 그에 맞는 텍스트를 가져옴 그런 다음 concatonation

3.2 Dynamic Self-Correction Strategy

흔히 있는 점수 낮은 trajectory에 대한 correction mechanism

3.3 Decision Making Strategy

Consensus-Guided Expert Decision (CGED) strategy

voting 한 정답에 대한 math expert의 accuracy

만약 결과들이 inconsistent 할시 LLM expert가 이에 대한 error examine

'inference-time, RLHF > search (language)' 카테고리의 다른 글

Tree of Thoughts: Deliberate Problem Solvingwith Large Language Models 논문리뷰 (0)	2024.08.29
Beyond a*: Better planning with transformers via search dynamics bootstrapping, (0)	2024.08.20
Monte Carlo Tree Search Boosts Reasoning via Iterative Preference Learning 논문리뷰 (0)	2024.08.17
Agent Q 논문리뷰: Advanced Reasoning and Learningfor Autonomous AI Agents (0)	2024.08.17
M* 논문리뷰 MindStar: Enhancing Math Reasoning in Pre-trainedLLMs at Inference Time (0)	2024.08.17

현재글forest of thought 논문 요약

이진욱님의 블로그 ai research memo for reference

이진욱님의 블로그

ai research memo for reference

Today :
Yesterday :

일	월	화	수	목	금	토
					1	2
3	4	5	6	7	8	9
10	11	12	13	14	15	16
17	18	19	20	21	22	23
24	25	26	27	28	29	30
31

내 블로그 - 관리자 홈 전환	`Q` `Q`
새 글 쓰기	`W` `W`

글 수정 (권한 있는 경우)	`E` `E`
댓글 영역으로 이동	`C` `C`

이 페이지의 URL 복사	`S` `S`
맨 위로 이동	`T` `T`
티스토리 홈 이동	`H` `H`
단축키 안내	`Shift` + `/` `⇧` + `/`

이진욱님의 블로그