What AI services are you selfhosting? Or, have tested and passed on

kiol@lemmy.world · 18 hours ago

What AI services are you selfhosting? Or, have tested and passed on

y0shi@lemm.ee · 14 hours ago

I’ve an old gaming PC with a decent GPU laying around and I’ve thought of doing that (currently use it for linux gaming and GPU related tasks like photo editing etc) However ,I’m currently stuck using LLMs on demand locally with ollama. Energy costs of having it powered on all time for on demand queries seems a bit overkill to me…

pezhore@infosec.pub · 13 hours ago

I put my Plex media server to work doing Ollama - it has a GPU for transcoding that’s not awful for simple LLMs.

y0shi@lemm.ee · 13 hours ago

That sounds like a great way of leveraging existing infrastructure! I host Plex together with other services in a server with intel transcoding capable CPU. I’m quite sure I would get much better performance with the GPU machine, might end up following this path!

kiol@lemmy.world · 13 hours ago

Have to agree on that. Certainly only makes sense to have up when you are using it.