Research

For a potentially more up-to-date list of publications, see Semantic Scholar.

An approach to technical agi safety and security

Rohin Shah and Alex Irpan and Alexander Matt Turner and Anna Wang and Arthur Conmy and David Lindner and Jonah Brown-Cohen and Lewis Ho and Neel Nanda and Raluca Ada Popa and Rishub Jain and Rory Greig and Samuel Albanie and Scott Emmons and Sebastian Farquhar and Sébastien Krier and Senthooran Rajamanoharan and Sophie Bridgers and Tobi Ijitoye and Tom Everitt and Victoria Krakovna and Vikrant Varma and Vladimir Mikulik and Zachary Kenton and Dave Orr and Shane Legg and Noah Goodman and Allan Dafoe and Four Flynn and Anca Dragan
arXiv preprint arXiv:2504.01849
, 2025
paper

Zerobench: An impossible visual benchmark for contemporary large multimodal models

Jonathan Roberts and Mohammad Reza Taesiri and Ansh Sharma and Akash Gupta and Samuel Roberts and Ioana Croitoru and Simion-Vlad Bogolin and Jialu Tang and Florian Langer and Vyas Raina and Vatsal Raina and Hanyi Xiong and Vishaal Udandarao and Jingyi Lu and Shiyang Chen and Sam Purkis and Tianshuo Yan and Wenye Lin and Gyungin Shin and Qiaochu Yang and Anh Totti Nguyen and David I. Atkinson and Aaditya Baranwal and Alexandru Coca and Mikah Dang and Sebastian Dziadzio and Jakob D. Kunz and Kaiqu Liang and Alexander Lo and Brian Pulfer and Steven Walton and Charig Yang and Kai Han and Samuel Albanie
arXiv preprint arXiv:2502.09696
, 2025
paper, project page

Humanity's Last Exam

Long Phan and Alice Gatti and Ziwen Han and Nathaniel Li and Josephina Hu and Hugh Zhang and Chen Bo Calvin Zhang and Mohamed Shaaban and John Ling and Sean Shi and others
arXiv preprint arXiv:2501.14249
, 2025
paper

On scalable oversight with weak LLMs judging strong LLMs

Zachary Kenton and Noah Y Siegel and János Kramár and Jonah Brown-Cohen and Samuel Albanie and Jannis Bulian and Rishabh Agarwal and David Lindner and Yunhao Tang and Noah D Goodman and Rohin Shah
Advances in Neural Information Processing Systems (NeurIPS)
, 2024
paper, coverage
Featured in quanta magazine.

Moment Detection in Long Tutorial Videos

Ioana Croitoru and Simion-Vlad Bogolin and Samuel Albanie and Yang Liu and Zhaowen Wang and Seunghyun Yoon and Hailin Jin and Trung Bui
IEEE/CVF International Conference on Computer Vision (ICCV)
, 2023
paper, code

Crosslingual Generalization through Multitask Finetuning

Niklas Muennighoff and Thomas Wang and Lintang Sutawika and Adam Roberts and Stella Biderman and Teven Le Scao and M Saiful Bari and Sheng Shen and Zheng-Xin Yong and Hailey Schoelkopf and Xiangru Tang and Dragomir Radev and Alham Fikri Aji and Khalid Almubarak and Samuel Albanie and Zaid Alyafeai and Albert Webson and Edward Raff and Colin Raffel
arXiv preprint arXiv:2211.01786
, 2022
paper, code, video_summary