Article gives the AI game away - generative models require human creativity to train on, and produce inferior approximations of their own.

AbbysMuscles [she/her] · 1 year ago

Article gives the AI game away - generative models require human creativity to train on, and produce inferior approximations of their own.

frankfurt_schoolgirl [she/her] · 1 year ago

I wonder how close gpt4 and company are to having used every bit of writing in the English language as training data. Like assuming ypu downloaded the entire content of social media sites, used every e book you could find, pulled in all wikis, forums, news sites, and blogs that a web crawler could produce, the only real volume of writing that's left is private communications. At some point, Google or MS or another company with lots of communications will use every message ever sent in their systems as training data. But once you do that, you've run out.

More training data probably brings diminishing returns, so if gpt4 has already used like 10% of all available writing then maybe even with the other 90% it won't be good enough to do what people want. Maybe in the future companies will hire vast numbers of writers just to make good content that can be used for the llms.

Article gives the AI game away - generative models require human creativity to train on, and produce inferior approximations of their own.

Article gives the AI game away - generative models require human creativity to train on, and produce inferior approximations of their own.

The AI feedback loop: Researchers warn of ‘model collapse’ as AI trains on AI-generated content