• tal@lemmy.today
    link
    fedilink
    English
    arrow-up
    5
    ·
    2 hours ago

    GPU prices are coming to earth

    https://lemmy.today/post/42588975

    Nvidia reportedly no longer supplying VRAM to its GPU board partners in response to memory crunch — rumor claims vendors will only get the die, forced to source memory on their own

    If that’s true, I doubt that they’re going to be coming to earth for long.

  • ffhein@lemmy.world
    link
    fedilink
    English
    arrow-up
    2
    ·
    3 hours ago

    … I was thinking about buying a 96GB DDR5 kit from the local computer store a few weeks ago, but wasn’t sure it was actually worth €700. Checked again now and the exact same product costs €1500. I guess that settles it, 32GB will have to be enough for the next couple of years then.

    • panda_abyss@lemmy.ca
      link
      fedilink
      English
      arrow-up
      3
      ·
      6 hours ago

      I hope this is the beginning of the end for the cuda monopoly. I just want good gpgpu support for numerical code.

  • Jeena@piefed.jeena.net
    link
    fedilink
    English
    arrow-up
    23
    arrow-down
    1
    ·
    10 hours ago

    This is very unfortunate, about a year ago I built my PC and only put in 32 GB of Ram, It was double I had on my laptop so I thought it should be enough for the beginning and I could buy more later.

    Already after 2 months I realizes I can do so much more because of the fast CPU in parallel but suddenly the amount of RAM became the bottleneck. When I looked at the RAM prices it didn’t seem quite worth it and I waited. But that backfired because since then the prices never went down, only up.

    • NotSteve_@piefed.ca
      link
      fedilink
      English
      arrow-up
      24
      ·
      8 hours ago

      What are you running that needs more than 32Gb? I’m only just barely being bottlenecked by my 24Gb when running games at 4k

      • Jeena@piefed.jeena.net
        link
        fedilink
        English
        arrow-up
        5
        ·
        7 hours ago

        Two browsers full of tabs but that is not a problem, but once I start compiling AOSP (which I sometimes want to do for work at home instead in the cloud because it’s easier and faster to debugg) then it eats up all the RAM imediatelly and I have to give it 40 more GB or swap and then this swapping is the bottleneck. Once that is running the computer can’t really do anything else, even the browser struggles.

        • usernamesAreTricky@lemmy.ml
          link
          fedilink
          English
          arrow-up
          5
          ·
          4 hours ago

          Have you tried just compiling it with fewer threads? Would almost certainly reduce the RAM usage, and might even make the compile go faster if it you’re needing to swap that heavily

      • hoshikarakitaridia@lemmy.world
        link
        fedilink
        English
        arrow-up
        5
        arrow-down
        2
        ·
        8 hours ago

        AI or servers probably. I have 40gb and that’s what I would need more ram for.

        I’m still salty because I had the idea of going cpu & ram sticks for AI inference literally days before the big AI companies. And my stupid ass didn’t buy them in time before the prices skyrocketed. Fuck me I guess.

        • NotMyOldRedditName@lemmy.world
          link
          fedilink
          English
          arrow-up
          5
          ·
          edit-2
          8 hours ago

          It does work, but it’s not really fast. I upgraded to 96gb ddr4 from 32gb a year or so ago, and being able to play with the bigger models was fun, but it’s not something I could do anything productive with it was so slow.

          • tal@lemmy.today
            link
            fedilink
            English
            arrow-up
            3
            ·
            edit-2
            8 hours ago

            You can have applications where wall clock tine time is not all that critical but large model size is valuable, or where a model is very sparse, so does little computation relative to the size of the model, but for the major applications, like today’s generative AI chatbots, I think that that’s correct.

            • NotMyOldRedditName@lemmy.world
              link
              fedilink
              English
              arrow-up
              3
              ·
              edit-2
              7 hours ago

              Ya, that’s fair. If I was doing something I didn’t care about time on, it did work. And we weren’t talking hours, it it could be many minutes though.

        • panda_abyss@lemmy.ca
          link
          fedilink
          English
          arrow-up
          1
          ·
          6 hours ago

          I’m often using 100gb of cram for ai.

          Earlier this year I was going to buy a bunch of 1tb ram used servers and I wish I had.

    • Jeena@piefed.jeena.net
      link
      fedilink
      English
      arrow-up
      7
      ·
      7 hours ago

      I just had a look, 2nd of April I payed 67,000 KRW for one 16 GB stick, now the same one (XPG DDR5 PC5-48000 CL30 LANCER BLADE White), they only sell them in pairs, a pair costs 470,000 KRW in the same shop, so 235,000 KRW per 16 GB stick. That is a price increase of 250%, god damn.

    • tal@lemmy.today
      link
      fedilink
      English
      arrow-up
      3
      ·
      edit-2
      8 hours ago

      Last I looked, a few days ago on Google Shopping, you could still find some retailers that had stock of DDR5 (I was looking at 2x16GB, and you may want more than that) and hadn’t jacked their prices up, but if you’re going to buy, I would not wait longer, because if they haven’t been cleaned out by now, I expect that they will be soon.