I want to host some LLM’s locally and use more advanced models. Since new hardware is out of the question, I think I should be able to pull something off buying some yesteryear equipment on ebay etc. Did anybody attempt such a project? Does it scale horizontally? (I.e. can I connext two boxes to overcome single box slowness?)

  • B0rax@feddit.org
    link
    fedilink
    English
    arrow-up
    1
    ·
    4 hours ago

    Which is quite impressive to be honest, these machines were fucking expensive, and yet sold out completely

    • Mika@piefed.ca
      link
      fedilink
      English
      arrow-up
      2
      ·
      3 hours ago

      512 gb would future proof you to run any local LLMs for quite a while. The speed at which they did it wasn’t exactly bad too afaik due to uram being so fast. Dunno what other setup would compete here for the price.