What's Hot

    I’m promoting my $1 million residence. Will my agent actually cost lower than a 6% fee? | Invesloan.com

    May 21, 2026

    SpaceX’s Investor Pitch Reads Like a Sci-Fi Manifesto | Invesloan.com

    May 21, 2026

    Japan Economic Update: Exports drive shock commerce surplus;core order | Invesloan.com

    May 20, 2026
    Facebook Twitter Instagram
    Finance Pro
    Facebook Twitter Instagram
    invesloan.cominvesloan.com
    Subscribe for Alerts
    • Home
    • News
    • Politics
    • Money
    • Personal Finance
    • Business
    • Economy
    • Investing
    • Markets
      • Stocks
      • Futures & Commodities
      • Crypto
      • Forex
    • Technology
    invesloan.cominvesloan.com
    Home » AI Has Already Run Out of Training Data, Goldman’s Data Chief Says | Invesloan.com
    Money

    AI Has Already Run Out of Training Data, Goldman’s Data Chief Says | Invesloan.com

    October 1, 2025
    Share
    Facebook Twitter LinkedIn Pinterest Email

    The meteoric rise of artificial intelligence may appear unstoppable — but it’s facing a shortage of training data.

    “We’ve already run out of data,” Neema Raphael, Goldman Sachs’ chief data officer and head of data engineering, said on the bank’s “Exchanges” podcast published on Tuesday.

    Raphael said that this shortage may already be influencing how new AI systems are built.

    He pointed to China’s DeepSeek as an example, saying one hypothesis for its purported development costs came from training on the outputs of existing models rather than entirely new data.

    “I think the real interesting thing is going to be how previous models then shape what the next iteration of the world is going to look like in this way,” Raphael said.

    With the web tapped out, developers are turning to synthetic data — machine-generated text, images, and code. That approach offers limitless supply, but also risks overwhelming models with low-quality output or AI slop.

    However, Raphael said he doesn’t think the lack of fresh data will be a massive constraint, in part because companies are sitting on untapped reserves of information.

    “I think from a consumer world model, I think it’s interesting we’ve definitely in the synthetic sort of explosion of data. But from an enterprise perspective, I think there’s still a lot of juice I’d say to be squeezed in that,” he said.

    That means the real frontier may not be the open internet, but the proprietary datasets held by corporations. From trading flows to client interactions, firms like Goldman sit on information that could make AI tools far more valuable if harnessed correctly.

    Raphael’s comments come as the industry grapples with “peak data” since the breakout of ChatGPT three years ago.

    Related stories

    Business Insider tells the innovative stories you want to know

    Business Insider tells the innovative stories you want to know

    In January, OpenAI cofounder Ilya Sutskever said at a conference that all the useful data online had already been used to train models, warning that AI’s era of rapid development “will unquestionably end.”

    The next frontier: proprietary data

    For businesses, Raphael stressed, the obstacle isn’t just finding more data — it’s ensuring that the data is usable.

    “The challenge is understanding the data, understanding the business context of the data, and then being able to normalize it in a way that makes sense for the business to consume it,” he said.

    Still, Raphael suggested that heavy reliance on synthetic data raises a deeper question about AI’s trajectory. “I think what might be interesting is people might think there might be a creative plateau,” he said.

    He wondered what would happen if models keep training only on machine-generated content.

    “If all of the data is synthetically generated, then how much human data could then be incorporated?” he said.

    “I think that’ll be an interesting thing to watch from a philosophical perspective,” he added.

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

    Keep Reading

    SpaceX’s Investor Pitch Reads Like a Sci-Fi Manifesto | Invesloan.com

    Elon Musk’s Plan to Keep Complete Control of SpaceX After Its Public | Invesloan.com

    SpaceX Warns That Grok’s NSFW AI Is Risky Business | Invesloan.com

    SpaceX IPO’s Could Create One of Venture Capital’s Largest Fortunes | Invesloan.com

    Anthropic Is Paying SpaceX $1.25 Billion a Month for AI Compute | Invesloan.com

    Vox Media CEO Tells Why He Sold Podcast Business to James Murdoch | Invesloan.com

    JetBlue Is Cutting These 11 Routes so It Can Focus More on Florida | Invesloan.com

    New Jersey City at Center of AI Data Center Boom Votes to Ban Them | Invesloan.com

    Countries With the Highest and Lowest Birth Rates, Ranked | Invesloan.com

    LATEST NEWS

    I’m promoting my $1 million residence. Will my agent actually cost lower than a 6% fee? | Invesloan.com

    May 21, 2026

    SpaceX’s Investor Pitch Reads Like a Sci-Fi Manifesto | Invesloan.com

    May 21, 2026

    Japan Economic Update: Exports drive shock commerce surplus;core order | Invesloan.com

    May 20, 2026

    Colorado Democrats censure Gov. Polis over Tina Peters sentence commutation | Invesloan.com

    May 20, 2026
    POPULAR

    China’s first passenger jet completes maiden commercial flight

    May 28, 2023

    Numbers taking US accountancy exams drop to lowest level in 17 years

    May 29, 2023

    Toyota chair faces removal vote over governance issues

    May 29, 2023
    Advertisement
    Load WordPress Sites in as fast as 37ms!
    Facebook Twitter Pinterest WhatsApp Instagram
    © 2007-2023 Invesloan.com All Rights Reserved.
    • Privacy
    • Terms
    • Press Release
    • Advertise
    • Contact

    Type above and press Enter to search. Press Esc to cancel.

    invesloan.com
    Manage Cookie Consent
    To provide the best experiences, we use technologies like cookies to store and/or access device information. Consenting to these technologies will allow us to process data such as browsing behavior or unique IDs on this site. Not consenting or withdrawing consent, may adversely affect certain features and functions.
    Functional Always active
    The technical storage or access is strictly necessary for the legitimate purpose of enabling the use of a specific service explicitly requested by the subscriber or user, or for the sole purpose of carrying out the transmission of a communication over an electronic communications network.
    Preferences
    The technical storage or access is necessary for the legitimate purpose of storing preferences that are not requested by the subscriber or user.
    Statistics
    The technical storage or access that is used exclusively for statistical purposes. The technical storage or access that is used exclusively for anonymous statistical purposes. Without a subpoena, voluntary compliance on the part of your Internet Service Provider, or additional records from a third party, information stored or retrieved for this purpose alone cannot usually be used to identify you.
    Marketing
    The technical storage or access is required to create user profiles to send advertising, or to track the user on a website or across several websites for similar marketing purposes.
    • Manage options
    • Manage services
    • Manage {vendor_count} vendors
    • Read more about these purposes
    View preferences
    • {title}
    • {title}
    • {title}