inference-time, RLHF/search (language)

M* 논문리뷰 MindStar: Enhancing Math Reasoning in Pre-trainedLLMs at Inference Time

jinuklee 2024. 8. 17. 11:59

https://arxiv.org/pdf/2405.16265