Self hosting LLMs on a remote VPS

EmbarrassedDrum@lemmy.dbzer0.com · 28 days ago

Self hosting LLMs on a remote VPS

hendrik@palaver.p3x.de · edit-2 28 days ago

What’s the difference regarding this task? You can rent it 24/7 as a crude webserver. Or run a Linux desktop inside. Pretty much everything you could do with other kinds of servers. I don’t think the exact technology matters. It could be a VPS, virtualized with KVM, or a container. And for AI workloads, these containers have several advantages. Like you can spin them up within seconds. Scale them etc. I mean you’re right. This isn’t a bare-metal server that you’re renting. But I think it aligns well with OP’s requirements?!

just_another_person@lemmy.world · 28 days ago

Well I think the difference is what they asked about.

ddh@lemmy.sdf.org · 24 days ago

Running an LLM can certainly be an on-demand service. Apart from training, which I don’t think we are discussing, GPU compute is only used while responding to prompts.