GPT-5: Overdue, overhyped and underwhelming. And that’s not the worst of it.
| salmon institution legal warrant | 08/09/25 | | bateful cruise ship gaping | 08/09/25 | | Flesh stirring sneaky criminal | 08/09/25 | | salmon institution legal warrant | 08/09/25 | | effete shrine | 08/09/25 | | Flesh stirring sneaky criminal | 08/09/25 | | Sticky gold hairy legs | 08/09/25 | | salmon institution legal warrant | 08/09/25 | | Sticky gold hairy legs | 08/09/25 | | salmon institution legal warrant | 08/09/25 | | Flesh stirring sneaky criminal | 08/09/25 | | Bearded lodge skinny woman | 08/09/25 | | DR. trillionaire (Ed.D., moon studies) | 08/10/25 | | .,,,.,.,.,.,.,,,,..,,..,.,.,., | 08/10/25 | | gaius baltar | 08/10/25 | | Flesh stirring sneaky criminal | 08/09/25 | | ~~(> ' ' )> | 08/10/25 | | Bearded lodge skinny woman | 08/09/25 | | Flesh stirring sneaky criminal | 08/09/25 | | Harry Chang | 08/10/25 | | Smoker | 08/10/25 | | The Wandering Mercatores | 08/10/25 | | Smoker | 08/10/25 | | gaius baltar | 08/10/25 | | ~~(> ' ' )> | 08/10/25 | | gaius baltar | 08/10/25 | | ~~(> ' ' )> | 08/10/25 | | Smoker | 08/10/25 | | Kenneth Play | 08/10/25 | | ,.,....,...,,,..,..,.,..,.,.,.,. | 08/10/25 | | gaius baltar | 08/10/25 | | rape bunny | 08/10/25 | | Emperor CRISPR Chad von Neumann III | 08/10/25 |
Poast new message in this thread
 |
Date: August 9th, 2025 8:02 PM Author: Bearded lodge skinny woman
"Well, I'd like to see the frontier models wriggle out of THIS jam!"
*frontier models wriggle their way out of the jam easily*
"Ah! Well. Nevertheless,"
(http://www.autoadmit.com/thread.php?thread_id=5760544&forum_id=2).#49170610) |
 |
Date: August 10th, 2025 12:16 PM
Author: .,,,.,.,.,.,.,,,,..,,..,.,.,.,
The problem is he won’t precisely define a class of problems that the models won’t be able to solve with more training. He finds new ones with every model iteration and insists it’s a flawed approach even as the overall error rate goes down significantly. I don’t think the current approach will yield AGI but there is very likely an ML approach that will.
GPT-5 is in some ways underwhelming (if you expected a GPT 3 to 4 level leap), but it’s roughly consistent with known training capacity and the short time period since o3 was released. The training compute of the model is likely around 10x of GPT-4 rather than 100x with GPT-3 to 4. As the larger data centers come online, further model progress is inevitable even without architectural improvements
(http://www.autoadmit.com/thread.php?thread_id=5760544&forum_id=2).#49171733) |
Date: August 9th, 2025 7:59 PM Author: Bearded lodge skinny woman
It's actually quite good. They appear to have fixed the issue where you got aggressively relegated to older shittier models
It's not way better than the other newest reasoning models but it's definitely better. It burns a lot of tokens though. Its answers are longer than they should be in a lot of cases
(http://www.autoadmit.com/thread.php?thread_id=5760544&forum_id=2).#49170598) |
 |
Date: August 10th, 2025 12:06 PM Author: Smoker
If you have to give it an exhaustive list of considerations and instructions then it starts to lose its value relatively quickly. How many of us have just done something ourselves vs giving it to a junior for that exact reason? And that’s about how it feels, like a really good junior but one that autistically stumbles into a brilliant observation every now and then and that has encyclopedia level powers etc etc
(http://www.autoadmit.com/thread.php?thread_id=5760544&forum_id=2).#49171710)
|
Date: August 10th, 2025 1:44 PM
Author: ,.,....,...,,,..,..,.,..,.,.,.,.
it turns out diffusion models are significantly more data efficient than the autoregressive models that are being used by all the major labs. >3x better data efficiency and this is only one possible improvement. the problem with the "AI is hitting a wall" theory is that there are many ways to use additional compute to improve model performance and the field is still too new to make strong conclusions of this sort.
https://jinjieni.notion.site/Diffusion-Language-Models-are-Super-Data-Learners-239d8f03a866800ab196e49928c019ac
(http://www.autoadmit.com/thread.php?thread_id=5760544&forum_id=2).#49171990) |
|
|