inference-time, RLHF/search (language) Beyond a*: Better planning with transformers via search dynamics bootstrapping, jinuklee 2024. 8. 20. 20:53 https://arxiv.org/abs/2402.14083.