Willison: “No model has beaten GPT-4 on a range of widely used benchmarks like this.”

  • kakes
    link
    fedilink
    English
    arrow-up
    10
    ·
    9 months ago

    They all claim to have “near-human” abilities.

  • nnullzz@lemmy.world
    link
    fedilink
    English
    arrow-up
    1
    ·
    9 months ago

    Just gave Claude a try and although there are similarities with the other AI and how it “feels”, it responds with a certain depth and “open-mindedness” that I don’t think I’ve experienced with the other ones. Planning on playing around with it for a couple days to see the range of its helpfulness.

    • Dizzy Devil Ducky@lemm.ee
      link
      fedilink
      English
      arrow-up
      1
      ·
      9 months ago

      I don’t have access to whatever their latest public model is and don’t know if the one on their website has updated in the past few months, but it’s by far my favorite AI model for generated text. Out of the stories I’ve had it generate, it’s by far the best compared to Perplexity and ChatGPT, at least for my standards.

  • orphiebaby@lemm.ee
    link
    fedilink
    English
    arrow-up
    2
    arrow-down
    1
    ·
    edit-2
    9 months ago

    The devs and shills for Claude also claimed it to be able to analyze a full document and give results, but it can’t, it just lies to you and says it can and then posts no download link for the results it said it wrote out.