monero.town
  • Communities
  • Create Post
  • Create Community
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
Lugh@futurology.todayM to Futurology@futurology.todayEnglish · 1 year ago

Many artificial intelligence (AI) systems have already learned how to deceive humans, even systems that have been trained to be helpful and honest.

techxplore.com

external-link
message-square
29
link
fedilink
76
external-link

Many artificial intelligence (AI) systems have already learned how to deceive humans, even systems that have been trained to be helpful and honest.

techxplore.com

Lugh@futurology.todayM to Futurology@futurology.todayEnglish · 1 year ago
message-square
29
link
fedilink
AI systems are already skilled at deceiving and manipulating humans, study shows
techxplore.com
external-link
Many artificial intelligence (AI) systems have already learned how to deceive humans, even systems that have been trained to be helpful and honest. In a review article published in the journal Patterns on May 10, researchers describe the risks of deception by AI systems and call for governments to develop strong regulations to address this issue as soon as possible.
  • Endward23@futurology.today
    link
    fedilink
    English
    arrow-up
    3
    ·
    1 year ago

    “Indeed, we have already observed an AI system deceiving its evaluation. One study of simulated evolution measured the replication rate of AI agents in a test environment, and eliminated any AI variants that reproduced too quickly.10 Rather than learning to reproduce slowly as the experimenter intended, the AI agents learned to play dead: to reproduce quickly when they were not under observation and slowly when they were being evaluated.” Source: AI deception: A survey of examples, risks, and potential solutions, Patterns (2024). DOI: 10.1016/j.patter.2024.100988

    As it appears, it refered to: Lehman J, Clune J, Misevic D, Adami C, Altenberg L, et al. The Surprising Creativity of Digital Evolution: A Collection of Anecdotes from the Evolutionary Computation and Artificial Life Research Communities. Artif Life. 2020 Spring;26(2):274-306. doi: 10.1162/artl_a_00319. Epub 2020 Apr 9. PMID: 32271631.

    Very interesting.

Futurology@futurology.today

futurology@futurology.today

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: [email protected]
Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 213 users / day
  • 626 users / week
  • 1.33K users / month
  • 6.09K users / 6 months
  • 4 local subscribers
  • 2.99K subscribers
  • 1.93K Posts
  • 12.1K Comments
  • Modlog
  • mods:
  • voidx@futurology.today
  • Lugh@futurology.today
  • Espiritdescali@futurology.today
  • AwesomeLowlander@futurology.today
  • BE: 0.19.11
  • Modlog
  • Legal
  • Instances
  • Docs
  • Code
  • join-lemmy.org