Many artificial intelligence (AI) systems have already learned how to deceive humans, even systems that have been trained to be helpful and honest.

Lugh@futurology.today · 6 months ago

Many artificial intelligence (AI) systems have already learned how to deceive humans, even systems that have been trained to be helpful and honest.

Endward23@futurology.today · 6 months ago

“Indeed, we have already observed an AI system deceiving its evaluation. One study of simulated evolution measured the replication rate of AI agents in a test environment, and eliminated any AI variants that reproduced too quickly.10 Rather than learning to reproduce slowly as the experimenter intended, the AI agents learned to play dead: to reproduce quickly when they were not under observation and slowly when they were being evaluated.” Source: AI deception: A survey of examples, risks, and potential solutions, Patterns (2024). DOI: 10.1016/j.patter.2024.100988

As it appears, it refered to: Lehman J, Clune J, Misevic D, Adami C, Altenberg L, et al. The Surprising Creativity of Digital Evolution: A Collection of Anecdotes from the Evolutionary Computation and Artificial Life Research Communities. Artif Life. 2020 Spring;26(2):274-306. doi: 10.1162/artl_a_00319. Epub 2020 Apr 9. PMID: 32271631.

Very interesting.

Many artificial intelligence (AI) systems have already learned how to deceive humans, even systems that have been trained to be helpful and honest.

Many artificial intelligence (AI) systems have already learned how to deceive humans, even systems that have been trained to be helpful and honest.

AI systems are already skilled at deceiving and manipulating humans, study shows