Until AI is allowed to vote perhaps they sit the fuck down.

  • @Voroxpete
    link
    1610 months ago

    In fact, since AI in theory should be much better at fact checking than humans, the standards of information quality should be much higher

    What we’re all calling “AI” right now has basically zero ability to fact check.

    Large Language Models are essentially just a form of autocomplete. They predict valid outputs based on statistical analysis of their training data. This makes them quite good at passing the Turing test (ie, convincing the average user that they have something approximating intelligence), but what they completely lack is the ability to evaluate source for reliability. That’s why it’s so easy to deliberately trick them into repeating false information.

    Real fact checking is a lot more than just googling something and finding a source that agrees with you. I can find sources claiming that the Earth is flat, aliens rule the world and Hillary Clinton is a baby eating lizard person. But none of those sources are in any way credible. However explaining why they’re not credible is a much more difficult question. Media literacy is a conplex skill, and it’s one that involves evaluating a huge number of different criteria, using a large number of different metrics, and it often involves making difficult judgement calls. Even people who are good at media literacy can be fooled, or just get it wrong. The entire study of history is basically about evaluating sources, and there’s often serious disagreements over the veracity of a piece of information. Good journalists have to be very careful over exactly how they frame information to disambiguate the exact degree of confidence they have about it (ie, I can say with absolute certainty that this person told me this thing, but I can’t say with absolute certainty that what they told me is true)… And that’s the good journalists. There are a LOT of bad journalists out there.

    It’s possible that some hypothetical future generation of AI will be better at fact checking them humans, but that’s not what we have today. The only way to get modern LLMs to produce factual information is to be extremely careful about what data they are fed; and even then, they will often just make shit up out of whole cloth from that data. Any output has to be verified by a human operator to avoid situations like Microsoft recommending the Ottawa food bank as a must see tourist attraction.

    • @[email protected]
      link
      fedilink
      110 months ago

      No, I know that modern AI has no real ability to fact check, but the reason is because they’ve never been built that way, nor do they have the resources to do it properly. They have no way to know what is a reliable source, nor how to interpret the data in a meaningful way if it needs to be used in an abstract manner.

      But I do believe that modern AI technology should be able to do so if given the resources. Create an AI that only references from a list of credible sources, and is able to compare them to what is said elsewhere.

      I’m no AI specialist or anything, so maybe I’m completely wrong and such a method wouldn’t work. But at the very least, I haven’t even heard of any real attempt at making a fact checking AI yet. All the existing ones are shit and only adapt normal language learning models to reference other internet sources regardless of their legitimacy.

      • @Voroxpete
        link
        210 months ago

        The problem is that for any of what you’re describing to work, AI has to be capable of comprehension and interpretation, neither of which are capabilities that LLMs have. This would be a quantum leap forward in terms of AI technology.

        That’s the key thing that has to be understood about “AI”; it fundamentally does not understand any of the words that it’s saying. It’s engaged in nothing more than extremely complex mimicry. Even a dog has more comprehension of human language than an LLM, and you wouldn’t trust a dog to fact check political ads. Remember, even when working from accurate training data, LLMs will still cheerfully invent entirely fictitious data that just happens to fit the pattern of the training data, because that’s all they are; pattern matchers.

        If I present an AI with the statements “Mike Harris sold our LTC care system to corporate profiteers” and “Mike Harris sold your grandma’s house to corporate profiteers” it has no way of accurately determining if the latter statement is true or false, because it fits the pattern of the first statement. A human can instantly distinguish between the concept of a long term care home and a person’s privately owned house. An AI doesn’t know what a person is, what a long term care home is, what ownership is, what the difference between private and public ownership are, what a house is and how that’s different from a long term care home even though both are referred to as homes, what it means to sell something, what profiteering is and whether or not that term accurately describes the actions taken by the corporations that bought most of Ontario’s LTC system. And then you have to get into the complex legalities of whether or not you’re allowed to use the term “profiteers” in a political ad… It’s a nightmare of complexity.

        If there’s a way to get to what you’re describing, from where we are now, no one has come up with it yet and the first company that does will be rich beyond their wildest dreams. We’re just not even remotely close to that kind of technology.