DeepMind AI Expert
14.1K subscribers
1.17K photos
328 videos
111 files
2.07K links
مقالات کاربردی هوش مصنوعی در پایتون، علوم پزشکی، علوم انسانی، علوم اعصاب و...
دوره های آموزشی از دانشگاه های بزرگ و موسسات انلاین

پژوهشگران هوش مصنوعی ایران

تبادلات پیام بدید
加入频道
Self-Taught Evaluators
🔄 Self-Taught Evaluators

This research paper explores the development of self-taught language model evaluators. Instead of relying on costly human annotations, this approach utilizes synthetic data generated by the model itself. The method iteratively trains an LLM-as-a-Judge by creating contrasting response pairs, generating reasoning traces, and fine-tuning the model on this synthetic data. The research demonstrates that this method significantly improves the accuracy of the evaluator on benchmarks like RewardBench, achieving performance comparable to reward models trained with labeled examples. The authors also explore various data sources, ablations, and analyses to understand the effectiveness of the proposed approach.

📎 Link to paper
🌐 Link to their tweet

#LLM_Evaluation #Syntethic_Data #Reward_Model

@LlamaCast
👍4👌1