VisVM : Scaling Inference-Time Search with Vision Value Modelfor Improved Visual Comprehension https://arxiv.org/pdf/2412.03704v2 inference-time, RLHF/search (multimodal) 2025.01.24
CoMCTS 논문제목 Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search inference-time, RLHF/search (multimodal) 2025.01.22