On scalable oversight with weak LLMs judging strong LLMs

Zachary Kenton and Noah Y Siegel and János Kramár and Jonah Brown-Cohen and Samuel Albanie and Jannis Bulian and Rishabh Agarwal and David Lindner and Yunhao Tang and Noah D Goodman and Rohin Shah, Advances in Neural Information Processing Systems (NeurIPS), 2024
Featured in quanta magazine.
paper

@article{Kenton2024scalable,
    author = "Kenton, Zachary and Siegel, Noah Y and Kram{\'a}r, J{\'a}nos and Brown-Cohen, Jonah and Albanie, Samuel and Bulian, Jannis and Agarwal, Rishabh and Lindner, David and Tang, Yunhao and Goodman, Noah D and Shah, Rohin",
    title = "On scalable oversight with weak LLMs judging strong LLMs",
    year = "2024",
    month = "December",
    booktitle = "Advances in Neural Information Processing Systems (NeurIPS)"
}