Leaderboard scores often can be a bit misleading since there are other factors to consider.

  • Censorship: Is the model censored?
  • Verbosity: How concise is the output?
  • Intelligence: Does the model know what it is talking about?
  • Hallucination: How much does the model makes up facts?
  • Domain Knowledge: What specialization a model has.
  • Size: Best models for 70b, 30b, 7b respectively.

And much more! What models do you use and would recommend to everyone?

The model that has caught my attention the most personally is the original 65b Llama. It seems genuine and truly has a personality. Everyone should chat with the original non-fine tuned version if they can get a chance. It’s an experience that is quite unique within the sea of “As an AI language model” openai tunes.

  • @Kerfuffle
    link
    English
    111 months ago

    I thought its mode of operation is always “continue the provided text”

    I haven’t played with trying to use it for conversation like stuff so I can’t say anything about whether it’s “particularly good” or not. However, “continue the provided text” doesn’t preclude conversational stuff. If you give it enough of an example of the “conversation”, even non-conversation tuned models will complete it. They’ll write both sides of the conversation if you let them, but you can use stuff like reverse prompts to return control what it’s your “turn”.

    I’d guess the chat tuned models are kind of more aimed at question/answer and specifically providing accurate and helpful answers rather than just dialog in general as well.