AI COMPANIES RUNNING OUT OF TRAINING DATA AFTER BURNING THROUGH ENTIRE INTERNET

While there are some companies, such as Dataology, which was formed by ex-Meta and Google DeepMind researcher Ari Morcos, looking into ways to train larger and smarter models with less data and resources, most big companies are looking into novel — and controversial — means of data training.
OpenAI, for instance, has per the WSJ’s sources discussed training GPT-5 on transcriptions from public YouTube videos — even as its own chief technology officer, Mira Murati, struggles to answer questions about whether its Sora video generator was trained using YouTube data.

[Read More…]

Skip to content