이진욱님의 블로그

  • 홈
  • 태그
  • 방명록
  • 빅테크 리포트
  • LLM
  • 멀티모달
  • 디퓨전 모델

2024/11/03 2

Style over Substance: Failure Modes of LLM Judges in Alignment Benchmarking 논문리뷰

The release of ChatGPT in November 2022 sparked an explosion of interest in post-training and an avalanche of new preference optimization (PO) methods.These methods claim superior alignment by virtue of better correspondence with human pairwise preferences, often measured by LLM-judges.In this work, we attempt to answer the following question – do LLM-judge preferences translate to progress on o..

카테고리 없음 2024.11.03

RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style 논문리뷰

https://huggingface.co/papers/2410.16184https://arxiv.org/pdf/2410.16184Reward models are critical in techniques like Reinforcement Learning from Human Feedback (RLHF) and Inference Scaling Laws, where they guide language model alignment and select optimal responses. Despite their importance, existing reward model benchmarks often evaluate models by asking them to distinguish between responses g..

카테고리 없음 2024.11.03
이전
1
다음
더보기
프로필사진

이진욱님의 블로그

ai research memo for reference

  • 분류 전체보기 (286)
    • inference-time, RLHF (41)
      • STaR, ReST (4)
      • STaR, ResT - LMM (17)
      • search (language) (10)
      • search (multimodal) (2)
      • Process reward model (6)
      • scalable oversight (1)
      • red-team (1)
    • VLM (5)
    • RLFH (2)
    • 프롬프팅 (3)
    • interpretability (2)
    • agent (23)
      • on-device agent (1)
      • multi - agent (17)
      • 멀티 에이젼트 결과 (2)
    • PEFT (1)
      • LoRA (1)
    • multi-step reasoning(수학, 코딩.. (7)
      • 멀티모달 cot (5)
    • 한계 limitation (1)
    • 데이터셋 (3)
      • 합성데이터 (1)
    • 3D, real world, game, VR (2)

Tag

최근글과 인기글

  • 최근글
  • 인기글

최근댓글

공지사항

페이스북 트위터 플러그인

  • Facebook
  • Twitter

Archives

Calendar

«   2024/11   »
일 월 화 수 목 금 토
1 2
3 4 5 6 7 8 9
10 11 12 13 14 15 16
17 18 19 20 21 22 23
24 25 26 27 28 29 30

방문자수Total

  • Today :
  • Yesterday :

Copyright © Kakao Corp. All rights reserved.

티스토리툴바