US chip-maker Nvidia led a rout in tech stocks Monday after the emergence of a low-cost Chinese generative AI model that could threaten US dominance in the fast-growing industry.

The chatbot developed by DeepSeek, a startup based in the eastern Chinese city of Hangzhou, has apparently shown the ability to match the capacity of US AI pace-setters for a fraction of the investments made by American companies.

Shares in Nvidia, whose semiconductors power the AI industry, fell more than 15 percent in midday deals on Wall Street, erasing more than $500 billion of its market value.

The tech-rich Nasdaq index fell more than three percent.

AI players Microsoft and Google parent Alphabet were firmly in the red while Meta bucked the trend to trade in the green.

DeepSeek, whose chatbot became the top-rated free application on Apple’s US App Store, said it spent only $5.6 million developing its model – peanuts when compared with the billions US tech giants have poured into AI.

US “tech dominance is being challenged by China,” said Kathleen Brooks, research director at trading platform XTB.

The focus is now on whether China can do it better, quicker and more cost effectively than the US, and if they could win the AI race,” she said.

US venture capitalist Marc Andreessen has described DeepSeek’s emergence as a “Sputnik moment” – when the Soviet Union shocked Washington with its 1957 launch of a satellite into orbit.

As DeepSeek rattled markets, the startup on Monday said it was limiting the registration of new users due to “large-scale malicious attacks” on its services.

Meta and Microsoft are among the tech giants scheduled to report earnings later this week, offering opportunity for comment on the emergence of the Chinese company.

Shares in another US chip-maker, Broadcom, fell 16 percent while Dutch firm ASML, which makes the machines used to build semiconductors, saw its stock tumble 6.7 percent.

Investors have been forced to reconsider the outlook for capital expenditure and valuations given the threat of discount Chinese AI models,” David Morrison, senior market analyst at Trade Nation.

These appear to be as good, if not better, than US versions.”

Wall Street’s broad-based S&P 500 index shed 1.7 percent while the Dow was flat at midday.

In Europe, the Frankfurt and Paris stock exchanges closed in the red while London finish flat.

Asian stock markets mostly slid.

Just last week following his inauguration, Trump announced a $500 billion venture to build infrastructure for AI in the United States led by Japanese giant SoftBank and ChatGPT-maker OpenAI.

SoftBank tumbled more than eight percent in Tokyo on Monday while Japanese semiconductor firm Advantest was also down more than eight percent and Tokyo Electron off almost five percent.

  • Womble@lemmy.world
    link
    fedilink
    English
    arrow-up
    19
    ·
    edit-2
    3 days ago

    https://www.analyticsvidhya.com/blog/2024/12/deepseek-v3/

    Huh I guess 6 million USD is not millions eh? The innovation is it’s comparatively cheap to train, compared to the billions OpenAI et al are spending (and that is with acquiring thousands of H800s not included in the cost).

    Edit: just realised that was for the wrong model! but r1 was trained in the same budget https://x.com/GavinSBaker/status/1883891311473782995?mx=2

    • UnderpantsWeevil@lemmy.world
      link
      fedilink
      English
      arrow-up
      2
      arrow-down
      16
      ·
      edit-2
      3 days ago

      The innovation is it’s comparatively cheap to train, compared to the billions

      Smaller builds with less comprehensive datasets take less time and money. Again, this doesn’t have to be encyclopedic. You can train your model entirely on a small sample of material detailing historical events in and around Beijing in 1989 if you are exclusively fixated on getting results back about Tienanmen Square.

      • Womble@lemmy.world
        link
        fedilink
        English
        arrow-up
        13
        arrow-down
        1
        ·
        edit-2
        3 days ago

        Oh, by the way, as to your theory of “maybe it just doesnt know about Tiananmen, its not an encyclopedia”…

        • Dhs92@programming.dev
          link
          fedilink
          arrow-up
          1
          ·
          3 days ago

          I don’t think I’ve seen that internal dialog before with LLMs. Do you get that with most models when running using ollama?

          • Womble@lemmy.world
            link
            fedilink
            English
            arrow-up
            3
            ·
            edit-2
            3 days ago

            No it’s not a feature of ollama, thats the innovation of the “chain of thought” models like OpenAI’s o1 and now this deepseek model, it narrates an internal dialogue first in order to try and create more consistent answers. It isnt perfect but it helps it do things like logical reasoning at the cost of taking a lot longer to get to the answer.

      • Womble@lemmy.world
        link
        fedilink
        English
        arrow-up
        9
        ·
        3 days ago

        Ok sure, as I said before I am grateful that they have done this and open sourced it. But it is still deliberately politically censored, and no “Just train your own bro” is not a reasonable reply to that.

        • Rai@lemmy.dbzer0.com
          link
          fedilink
          arrow-up
          4
          ·
          3 days ago

          They know less than I do about LLMs of that’s something they think you can just DO… and that’s saying a lot.