What's Hot

    Why these strategists say 45% of portfolios needs to be invested in gold, metals and bitcoin | Invesloan.com

    April 22, 2026

    Why Lockheed Martin’s $2 Trillion F-35 Program Is so Expensive | Invesloan.com

    April 22, 2026

    Walz indicators invoice permitting Minnesota nursing houses to serve alcohol sans license | Invesloan.com

    April 21, 2026
    Facebook Twitter Instagram
    Finance Pro
    Facebook Twitter Instagram
    invesloan.cominvesloan.com
    Subscribe for Alerts
    • Home
    • News
    • Politics
    • Money
    • Personal Finance
    • Business
    • Economy
    • Investing
    • Markets
      • Stocks
      • Futures & Commodities
      • Crypto
      • Forex
    • Technology
    invesloan.cominvesloan.com
    Home » AI Has Already Run Out of Training Data, Goldman’s Data Chief Says | Invesloan.com
    Money

    AI Has Already Run Out of Training Data, Goldman’s Data Chief Says | Invesloan.com

    October 1, 2025
    Share
    Facebook Twitter LinkedIn Pinterest Email

    The meteoric rise of artificial intelligence may appear unstoppable — but it’s facing a shortage of training data.

    “We’ve already run out of data,” Neema Raphael, Goldman Sachs’ chief data officer and head of data engineering, said on the bank’s “Exchanges” podcast published on Tuesday.

    Raphael said that this shortage may already be influencing how new AI systems are built.

    He pointed to China’s DeepSeek as an example, saying one hypothesis for its purported development costs came from training on the outputs of existing models rather than entirely new data.

    “I think the real interesting thing is going to be how previous models then shape what the next iteration of the world is going to look like in this way,” Raphael said.

    With the web tapped out, developers are turning to synthetic data — machine-generated text, images, and code. That approach offers limitless supply, but also risks overwhelming models with low-quality output or AI slop.

    However, Raphael said he doesn’t think the lack of fresh data will be a massive constraint, in part because companies are sitting on untapped reserves of information.

    “I think from a consumer world model, I think it’s interesting we’ve definitely in the synthetic sort of explosion of data. But from an enterprise perspective, I think there’s still a lot of juice I’d say to be squeezed in that,” he said.

    That means the real frontier may not be the open internet, but the proprietary datasets held by corporations. From trading flows to client interactions, firms like Goldman sit on information that could make AI tools far more valuable if harnessed correctly.

    Raphael’s comments come as the industry grapples with “peak data” since the breakout of ChatGPT three years ago.

    Related stories

    Business Insider tells the innovative stories you want to know

    Business Insider tells the innovative stories you want to know

    In January, OpenAI cofounder Ilya Sutskever said at a conference that all the useful data online had already been used to train models, warning that AI’s era of rapid development “will unquestionably end.”

    The next frontier: proprietary data

    For businesses, Raphael stressed, the obstacle isn’t just finding more data — it’s ensuring that the data is usable.

    “The challenge is understanding the data, understanding the business context of the data, and then being able to normalize it in a way that makes sense for the business to consume it,” he said.

    Still, Raphael suggested that heavy reliance on synthetic data raises a deeper question about AI’s trajectory. “I think what might be interesting is people might think there might be a creative plateau,” he said.

    He wondered what would happen if models keep training only on machine-generated content.

    “If all of the data is synthetically generated, then how much human data could then be incorporated?” he said.

    “I think that’ll be an interesting thing to watch from a philosophical perspective,” he added.

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

    Keep Reading

    Why Lockheed Martin’s $2 Trillion F-35 Program Is so Expensive | Invesloan.com

    Zoe Saldaña, 47, Says Her Beauty Routine Has Gotten Simpler With Age | Invesloan.com

    Inside the AI Coding Startup Elon Musk Is Betting $60 Billion on | Invesloan.com

    Demi Moore Says She’s Become ‘Intentional’ About Her Nighttime Routine | Invesloan.com

    Jump Trading Quant Yiming Zhang to Get External Deal From Millennium | Invesloan.com

    OpenAI’s Unveils a ChatGPT Image Generator That Can Crawl the Web | Invesloan.com

    Sam Altman Opens up About the Molotov Cocktail Attack on His Home | Invesloan.com

    How FedEx, UPS, and DHL Plan to Refund Customers for Tariffs | Invesloan.com

    Tim Cook Didn’t Invent the iPhone. however He Made Me Love It. | Invesloan.com

    LATEST NEWS

    Why these strategists say 45% of portfolios needs to be invested in gold, metals and bitcoin | Invesloan.com

    April 22, 2026

    Why Lockheed Martin’s $2 Trillion F-35 Program Is so Expensive | Invesloan.com

    April 22, 2026

    Walz indicators invoice permitting Minnesota nursing houses to serve alcohol sans license | Invesloan.com

    April 21, 2026

    Zoe Saldaña, 47, Says Her Beauty Routine Has Gotten Simpler With Age | Invesloan.com

    April 21, 2026
    POPULAR

    China’s first passenger jet completes maiden commercial flight

    May 28, 2023

    Numbers taking US accountancy exams drop to lowest level in 17 years

    May 29, 2023

    Toyota chair faces removal vote over governance issues

    May 29, 2023
    Advertisement
    Load WordPress Sites in as fast as 37ms!
    Facebook Twitter Pinterest WhatsApp Instagram
    © 2007-2023 Invesloan.com All Rights Reserved.
    • Privacy
    • Terms
    • Press Release
    • Advertise
    • Contact

    Type above and press Enter to search. Press Esc to cancel.

    invesloan.com
    Manage Cookie Consent
    To provide the best experiences, we use technologies like cookies to store and/or access device information. Consenting to these technologies will allow us to process data such as browsing behavior or unique IDs on this site. Not consenting or withdrawing consent, may adversely affect certain features and functions.
    Functional Always active
    The technical storage or access is strictly necessary for the legitimate purpose of enabling the use of a specific service explicitly requested by the subscriber or user, or for the sole purpose of carrying out the transmission of a communication over an electronic communications network.
    Preferences
    The technical storage or access is necessary for the legitimate purpose of storing preferences that are not requested by the subscriber or user.
    Statistics
    The technical storage or access that is used exclusively for statistical purposes. The technical storage or access that is used exclusively for anonymous statistical purposes. Without a subpoena, voluntary compliance on the part of your Internet Service Provider, or additional records from a third party, information stored or retrieved for this purpose alone cannot usually be used to identify you.
    Marketing
    The technical storage or access is required to create user profiles to send advertising, or to track the user on a website or across several websites for similar marketing purposes.
    • Manage options
    • Manage services
    • Manage {vendor_count} vendors
    • Read more about these purposes
    View preferences
    • {title}
    • {title}
    • {title}