You would need to run the LLM on the system that has the GPU (your main PC). The front-end (typically a WebUI) could run in a docker container and make API calls to your LLM system. Unfortunately that requires the model to always be loaded in the VRAM on your main PC, severely reducing what you can do with that computer, GPU-wise.
Pretty cool. I wonder if this could be scaled up to a more life-sized print? Maybe go-kart sized???
The STL files are $27 - not free, but I’m sure the designer put a ton of hours into this.