카테고리 없음

PaliGemma: A versatile 3B VLM for transfer 논문리뷰

jinuklee 2024. 9. 15. 17:07

https://arxiv.org/pdf/2407.07726

PaliGemma is an open Vision-Language Model (VLM) that is based on the SigLIP-So400m vision encoder and the Gemma-2B language model