• brucethemoose@lemmy.world
    link
    fedilink
    English
    arrow-up
    1
    ·
    edit-2
    1 day ago

    Who the fuck buys a consumer GPU for AI?

    Plenty. Consumer GPU + CPU offloading is a pretty common way to run MoEs these days, and not everyone will drop $40K just to run Deepseek in CUDA instead of hitting an API or something.

    I can (just barely) run GLM-4.5 on a single 3090 desktop.

    • MystikIncarnate@lemmy.ca
      link
      fedilink
      English
      arrow-up
      1
      ·
      20 hours ago

      … Yeah, for yourself.

      I’m referring to anyone running an LLM for commercial purposes.

      Y’know, 80% of Nvidia’s business?

      • brucethemoose@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        ·
        18 hours ago

        I’ve kinda lost this thread, but what does that have to do with consumer GPU market share? The servers are a totally separate category.

        I guess my original point was agreement: the 5000 series is not great for ‘AI’, not like everyone makes it out to be, to the point where folks who can’t drop $10K for a GPU are picking up older cards instead. But if you look at download stats for these models, there is interest in running stuff locally instead of ChatGPT, just like people are interested in internet free games, or Lemmy instead of Reddit.

        • MystikIncarnate@lemmy.ca
          link
          fedilink
          English
          arrow-up
          1
          ·
          12 hours ago

          The original post is about Nvidia’s domination of discrete GPUs, not consumer GPUs.

          So I’m not limiting myself to people running an LLM on their personal desktop.

          That’s what I was trying to get across.

          And it’s right on point for the original material.

          • brucethemoose@lemmy.world
            link
            fedilink
            English
            arrow-up
            1
            ·
            12 hours ago

            I’m not sure the bulk of datacenter cards count as ‘discrete GPUs’ anymore, and they aren’t counted in that survey. They’re generally sold socketed into 8P servers with crazy interconnects, hyper specialized to what they do. Nvidia does sell some repurposed gaming silicon as a ‘low end’ PCIe server card, but these don’t get a ton of use compared to the big silicon sales.

            • MystikIncarnate@lemmy.ca
              link
              fedilink
              English
              arrow-up
              1
              ·
              10 hours ago

              I wouldn’t be surprised in the slightest if they are included in the list. I dunno, I’m not the statistician who crunched the numbers here. I didn’t collect the data, and that source material is not available for me to examine.

              What I can say is that the article defines “discrete” GPUs instead of just “GPUs” to eliminate all the iGPUs. Because Intel dominates that space with AMD, but it’s hard to make an iGPU when you don’t make CPUs, and the two largest CPU manufacturers make their own iGPUs.

              The overall landscape of the GPU market is very different than what this data implies.

              • brucethemoose@lemmy.world
                link
                fedilink
                English
                arrow-up
                1
                ·
                edit-2
                8 hours ago

                Well, it’s no mystery:

                https://www.jonpeddie.com/news/q225-pc-graphics-add-in-board-shipments-increased-27-0-from-last-quarter/

                It’s specifically desktop addin boards:

                AMD’s RX 9070 XT and RX 9070 represent AMD’s new RDNA 4 architecture, competing with Nvidia’s midrange offerings. Nvidia introduced two new Blackwell-series AIBs: the GeForce RTX 5080 Super and the RTX 5070. The company also announced the RTX 500 workstation AIB. Rumors have persisted about two new AIBs from Intel, including a dual-GPU model.

                It is including workstation cards like the Blackwell Pro. But this is clearly not including server silicon like the B200, H200, MI325X and so on, otherwise they would have mentioned updates. They are not AIBs.

                I hate to obsess over such a distinction, but it’s important: server sales are not skewing this data, and workstation sales volumes are pretty low. It’s probably a accurate chart for gaming GPUs.