PR 3313 has been merged with this commit
This is pretty great for those looking for performant llama.cpp/falcon/mpt offloading, and just in general a good CPU inference tool, wanting to use text-generation-webui
For those unaware, this is the ctransformers repo
And for anyone looking for an updated docker image, I have provided an image here on dockerhub
and as always my git repo with instructions can be found here on github
Happy inferencing :)
You must log in or register to comment.