카테고리 없음

VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-GrainedVideo Reasoning via Core Frame Selection

jinuklee 2024. 11. 27. 01:58

https://arxiv.org/pdf/2411.14794