or is it just bean counters optimizing enshittification and monetization of a previously free product? oh its certainly the former
Unproven hypothesis seeks to explain ChatGPT's seemingly new reluctance to do hard work.
In late November, some ChatGPT users began to notice that ChatGPT-4 was becoming more "lazy," reportedly refusing to do some tasks or returning simplified results. Since then, OpenAI has admitted that it's an issue, but the company isn't sure why. The answer may be what some are calling "winter break hypothesis." While unproven, the fact that AI researchers are taking it seriously shows how weird the world of AI language models has become.
On Monday, a developer named Rob Lynch announced on X that he had tested GPT-4 Turbo through the API over the weekend and found shorter completions when the model is fed a December date (4,086 characters) than when fed a May date (4,298 characters). Lynch claimed the results were statistically significant.
AI researchers are taking it seriously
Half these guys are religious fruitcakes worshipping the mean scary future computer and the other half are pulling their hair out trying to get their colleagues to stop deifying the random number generator.
found shorter completions when the model is fed a December date (4,086 characters) than when fed a May date (4,298 characters).
Duh, the longer you let it run the more data it has. Why wouldn’t the newer version be better? /s
Me, shaking, terrified: COMPUTER! I COMMAND YOU!! DO AS I SAY!!
The demon in the box:
Large language model AIs are so volatile and unreliable, a previous random update made it unlearn simple math. The one thing computers are supposed to be good at.
It's almost like if you feed an algorithm garbage data in, it gets garbage data out, but that couldn't be it, no way, they're techbro geniuses, far too smart to make that mistake!