\
  The most prestigious law school admissions discussion board in the world.
BackRefresh Options Favorite

gpt 5.5 beat fable 5 on new "agents last exam" benchmark

Yuuuge upset I hear but honestly not surprising to me. Anthr...
The Penis
  06/19/26
but which one will help MiG self-actualize and reunite with ...
Howard Nutlick's demonic giggle
  06/19/26


Poast new message in this thread



Reply Favorite

Date: June 19th, 2026 7:19 PM
Author: The Penis

Yuuuge upset I hear but honestly not surprising to me. Anthropic benchmarkmaxxes so hard especially overfitting to SWE benchmarks, and then additionally relies on religious capture of the reddit demographic. I already thought gpt 5.5 codex extra high seemed far beyond claude code on opus models in ways not fully captured by current benchmarks

(http://www.autoadmit.com/thread.php?thread_id=5875728&forum_id=2]#49950088)



Reply Favorite

Date: June 19th, 2026 7:24 PM
Author: Howard Nutlick's demonic giggle

but which one will help MiG self-actualize and reunite with Christ

(http://www.autoadmit.com/thread.php?thread_id=5875728&forum_id=2]#49950095)