inference-time, RLHF/STaR, ResT - LMM

REVISIT LARGE-SCALE IMAGE-CAPTION DATA IN PRETRAINING MULTIMODAL FOUNDATION MODELS 논문리뷰

jinuklee 2024. 10. 9. 14:11

https://arxiv.org/pdf/2410.02740