• morrowind@lemmy.mlOP
      link
      fedilink
      English
      arrow-up
      2
      arrow-down
      11
      ·
      8 个月前

      The MS implementations won’t, but once they build the capability, we can make our own

          • Possibly linux@lemmy.zip
            link
            fedilink
            English
            arrow-up
            2
            ·
            edit-2
            8 个月前

            On which platform?

            Basically you need three things. You need the ollama software, a LLM model such as mistral and a front end like openwebui.

            Ollama is pretty much just a daemon that has a web api apps can use to query LLMs.

            • kaboom36@ani.social
              link
              fedilink
              English
              arrow-up
              1
              ·
              8 个月前

              Linux, specifically nobara (a gaming focused fedora distro) for me

              Do you have any guides you would recommend?

              • Possibly linux@lemmy.zip
                link
                fedilink
                English
                arrow-up
                2
                ·
                8 个月前

                Actually it is pretty easy. You can either run it in a VM or you can run it in podman.

                For a VM, you could install virtual manager and then Debian. From there you need to of course do the normal setup of SSH and disable the root login.

                Once you have a Debian VM you can install ollama and pull down llava and mistral. Make sure you give the VM plenty of resources including almost all cores and 8gb of ram. To setup ollama you can follow the guides

                Once you have ollama working you can then setup openwebui. I had to use network: host with the ollama environment variable pointed to 127.0.0.1 (loopback)

                Once that’s done you should be able to access it at the IP of the VM port 8080. The first time it runs you need to click create account.

                Keep in mind that a blank screen means that it can’t reach ollama.

                The alternative setup to this would be podman. You theoretically could create a ollama container and a openwebui container. They would need to be attached to the same internal network. It probably would be simpler to run but I haven’t tried it.