Zerobench: An impossible visual benchmark for contemporary large multimodal models

Jonathan Roberts and Mohammad Reza Taesiri and Ansh Sharma and Akash Gupta and Samuel Roberts and Ioana Croitoru and Simion-Vlad Bogolin and Jialu Tang and Florian Langer and Vyas Raina and Vatsal Raina and Hanyi Xiong and Vishaal Udandarao and Jingyi Lu and Shiyang Chen and Sam Purkis and Tianshuo Yan and Wenye Lin and Gyungin Shin and Qiaochu Yang and Anh Totti Nguyen and David I. Atkinson and Aaditya Baranwal and Alexandru Coca and Mikah Dang and Sebastian Dziadzio and Jakob D. Kunz and Kaiqu Liang and Alexander Lo and Brian Pulfer and Steven Walton and Charig Yang and Kai Han and Samuel Albanie, arXiv preprint arXiv:2502.09696, 2025
paper, project page

@article{roberts2025zerobench,
    author = "Roberts, Jonathan and Taesiri, Mohammad Reza and Sharma, Ansh and Gupta, Akash and Roberts, Samuel and Croitoru, Ioana and Bogolin, Simion-Vlad and Tang, Jialu and Langer, Florian and Raina, Vyas and Raina, Vatsal and Xiong, Hanyi and Udandarao, Vishaal and Lu, Jingyi and Chen, Shiyang and Purkis, Sam and Yan, Tianshuo and Lin, Wenye and Shin, Gyungin and Yang, Qiaochu and Nguyen, Anh Totti and Atkinson, David I. and Baranwal, Aaditya and Coca, Alexandru and Dang, Mikah and Dziadzio, Sebastian and Kunz, Jakob D. and Liang, Kaiqu and Lo, Alexander and Pulfer, Brian and Walton, Steven and Yang, Charig and Han, Kai and Albanie, Samuel",
    title = "Zerobench: An impossible visual benchmark for contemporary large multimodal models",
    year = "2025",
    month = "February",
    journal = "arXiv preprint arXiv:2502.09696"
}