카테고리 없음

RL on Incorrect Synthetic Data Scales theEfficiency of LLM Math Reasoning by Eight-Fold 논문리뷰

jinuklee 2024. 8. 18. 01:16

https://arxiv.org/pdf/2406.14532