inference-time, RLHF/search (language)

Beyond a*: Better planning with transformers via search dynamics bootstrapping,

jinuklee 2024. 8. 20. 20:53

https://arxiv.org/abs/2402.14083.