On scalable oversight with weak LLMs judging strong LLMs
Zachary Kenton and Noah Y Siegel and János Kramár and Jonah Brown-Cohen and Samuel Albanie and Jannis Bulian and Rishabh Agarwal and David Lindner and Yunhao Tang and Noah D Goodman and Rohin Shah, Advances in Neural Information Processing Systems (NeurIPS), 2024
Featured in quanta magazine.
paper
@article{Kenton2024scalable, author = "Kenton, Zachary and Siegel, Noah Y and Kram{\'a}r, J{\'a}nos and Brown-Cohen, Jonah and Albanie, Samuel and Bulian, Jannis and Agarwal, Rishabh and Lindner, David and Tang, Yunhao and Goodman, Noah D and Shah, Rohin", title = "On scalable oversight with weak LLMs judging strong LLMs", year = "2024", month = "December", booktitle = "Advances in Neural Information Processing Systems (NeurIPS)" }