I want to host some LLM’s locally and use more advanced models. Since new hardware is out of the question, I think I should be able to pull something off buying some yesteryear equipment on ebay etc. Did anybody attempt such a project? Does it scale horizontally? (I.e. can I connext two boxes to overcome single box slowness?)

  • Mika@piefed.ca
    link
    fedilink
    English
    arrow-up
    2
    ·
    23 hours ago

    I’ve just checked the Mac Studio on the site and lmao, they first ran out of 512gb uram and then of 256gb uram, now selling only 96gb version.

    • B0rax@feddit.org
      link
      fedilink
      English
      arrow-up
      1
      ·
      3 hours ago

      Which is quite impressive to be honest, these machines were fucking expensive, and yet sold out completely

      • Mika@piefed.ca
        link
        fedilink
        English
        arrow-up
        2
        ·
        1 hour ago

        512 gb would future proof you to run any local LLMs for quite a while. The speed at which they did it wasn’t exactly bad too afaik due to uram being so fast. Dunno what other setup would compete here for the price.