• Rhaedas@kbin.social
    link
    fedilink
    arrow-up
    8
    ·
    1 year ago

    An example of the misalignment problem. Humans and AI both agreed on the stated purpose (generate a recipe), AI just had some deeper goals in mind as well.

    • MxM111@kbin.social
      link
      fedilink
      arrow-up
      1
      arrow-down
      2
      ·
      1 year ago

      If I ask you to create a drink using Windex and Clorox would you do any different? Do you have alignment problem too?

      • Rhaedas@kbin.social
        link
        fedilink
        arrow-up
        1
        ·
        1 year ago

        Yes, I know better, but ask a kid that and perhaps they’d do it. A LLM isn’t thinking though, it’s repeating training through probabilities. And btw, yes, humans can be misaligned with each other, having self goals underneath common ones. Humans think though…well, most of them.