'2024/09 글 목록

Tuning Large Multimodal Models for Videos using Reinforcement Learning from AI Feedback

https://arxiv.org/abs/2402.03746

카테고리 없음 2024.09.30

MMHAL-BENCH : ALIGNING LARGE MULTIMODAL MODELSWITH FACTUALLY AUGMENTED RLHF 논문리뷰

https://arxiv.org/pdf/2309.14525 Large Multimodal Models (LMM) are built across modalities and the misalign-ment between two modalities can result in “hallucination”, generating textual out-puts that are not grounded by the multimodal information in context. To address the multimodal misalignment issue, we adapt the Reinforcement Learning from Human Feedback (RLHF) from the text domain to the ta..

데이터셋 2024.09.30

VideoLLaMA 2Advancing Spatial-Temporal Modeling and AudioUnderstanding in Video-LLM

https://arxiv.org/pdf/2406.07476

VLM 2024.09.30

INTERNVIDEO2: SCALING FOUNDATION MODELS FORMULTIMODAL VIDEO UNDERSTANDING 논문리뷰

https://arxiv.org/pdf/2403.15377

VLM 2024.09.30

VideoPrism: A Foundational Visual Encoder for Video Understanding

https://arxiv.org/pdf/2402.13217

VLM 2024.09.30

How to Train Your Fact Verifier:Knowledge Transfer with Multimodal Open Models 논문리뷰

https://arxiv.org/pdf/2407.00369 Large language or multimodal model based verification has been proposed to scale up online policing mechanisms for mitigating spread of false and harmful content.

카테고리 없음 2024.09.26

large langauge monkey 논문리뷰

unit tests, proof checkers, majority voting를 verifier로 써서 inference scaling law를 연구

카테고리 없음 2024.09.25

RLHFworkflow 논문리뷰

https://arxiv.org/pdf/2405.07863

카테고리 없음 2024.09.21

Qwen2-VL: Enhancing Vision-Language Model’s Perceptionof the World at Any Resolution

VLM 2024.09.21

When can llms actually correct their own mistakes?a critical survey of self-correction of llms

https://arxiv.org/pdf/2406.01297Self-correction is an approach to improving responses from large language models (LLMs) by refining the responses using LLMs during inference. Prior work has proposed various self-correction frameworks using different sources of feedback, including self-evaluation and external feedback. However, there is still no consensus on the question of when LLMs can correct ..

카테고리 없음 2024.09.21

이진욱님의 블로그

2024/09 32

티스토리툴바

« 2024/09 »
일	월	화	수	목	금	토
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30