'2024/11 글 목록

2024/11 45

SuperCorrect: Supervising and Correcting Language Models with Error-Driven Insights

https://arxiv.org/abs/2410.09008

카테고리 없음 2024.11.29

Think Thrice Before You Act: Progressive Thought Refinement in Large Language Models

https://arxiv.org/pdf/2410.13413

카테고리 없음 2024.11.29

INTERLEAVEDSCENEGRAPH FORINTERLEAVED TEXT-AND-IMAGEGENERATIONASSESSMENT

https://arxiv.org/abs/2411.17188

카테고리 없음 2024.11.29

From Generation to Judgment: Opportunities and Challenges ofLLM-as-a-judge

https://arxiv.org/pdf/2411.16594Assessment and evaluation have long been critical challenges in artificial intelligence (AI) and natural language processing (NLP). However, traditional methods, whether matching-based or embedding-based, often fall short of judging subtle attributes and delivering satisfactory results. Recent advancements in Large Language Models (LLMs) inspire the "LLM-as-ajudge..

카테고리 없음 2024.11.27

Evaluating the role of ‘Constitutions’for learning from AI feedback

https://github.com/saskia-rr/Evaluating-Constitutions GitHub - saskia-rr/Evaluating-ConstitutionsContribute to saskia-rr/Evaluating-Constitutions development by creating an account on GitHub.github.comhttps://arxiv.org/pdf/2411.10168 The growing capabilities of large language models (LLMs) have led to their use as substitutes for human feedback for training and assessing other LLMs. These method..

카테고리 없음 2024.11.27

VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-GrainedVideo Reasoning via Core Frame Selection

https://arxiv.org/pdf/2411.14794

카테고리 없음 2024.11.27

Self-Correction is More than Refinement: A Learning Framework for Visual and Language Reasoning Tasks

https://arxiv.org/pdf/2410.04055

카테고리 없음 2024.11.24

SEALONG : Large Language Models Can Self-Improve in Long-context Reasoning

https://arxiv.org/pdf/2411.08147

카테고리 없음 2024.11.24

Beyond Captioning: Task-Specific Prompting for Improved VLM Performance in Mathematical Reasoning

https://arxiv.org/pdf/2410.05928