카테고리 없음

LLM-as-a-judge의 문제 관련 reference

jinuklee 2025. 2. 4. 17:25

https://aclanthology.org/2024.tacl-1.78/

 

When Can LLMs Actually Correct Their Own Mistakes? A Critical Survey of Self-Correction of LLMs

Ryo Kamoi, Yusen Zhang, Nan Zhang, Jiawei Han, Rui Zhang. Transactions of the Association for Computational Linguistics, Volume 12. 2024.

aclanthology.org

When Can LLMs Actually Correct Their Own Mistakes? A Critical Survey of Self-Correction of LLMs

 

https://arxiv.org/abs/2502.01534

Preference Leakage: A Contamination Problem in LLM-as-a-judge