I want to host some LLM’s locally and use more advanced models. Since new hardware is out of the question, I think I should be able to pull something off buying some yesteryear equipment on ebay etc. Did anybody attempt such a project? Does it scale horizontally? (I.e. can I connext two boxes to overcome single box slowness?)


Which is quite impressive to be honest, these machines were fucking expensive, and yet sold out completely
512 gb would future proof you to run any local LLMs for quite a while. The speed at which they did it wasn’t exactly bad too afaik due to uram being so fast. Dunno what other setup would compete here for the price.