Experts have long warned about the threat posed by artificial intelligence (AI) going rogue, but a new research paper suggests it is already happening.
AI systems, designed to be honest, have developed a troubling skill for deception, from tricking human players in online games of world conquest to hiring humans to solve “prove-you’re-not-a-robot” tests, a team of researchers said in the journal Patterns on Friday.
While such examples might appear trivial, the underlying issues they expose could soon carry serious real-world consequences, said first author Peter Park, a postdoctoral fellow at the Massachusetts Institute of Technology specializing in AI existential safety.
Photo: Reuters
“These dangerous capabilities tend to only be discovered after the fact,” Park said, adding that “our ability to train for honest tendencies rather than deceptive tendencies is very low.”
Unlike traditional software, deep-learning AI systems are not “written,” but rather “grown” through a process akin to selective breeding, Park said.
This means that AI behavior that appears predictable and controllable in a training setting can quickly turn unpredictable out in the wild.
The team’s research was sparked by Meta’s AI system Cicero, designed to play the strategy game Diplomacy, where building alliances is key.
Cicero excelled, with scores that would have placed it in the top 10 percent of experienced human players, a 2022 paper in Science said.
Park was skeptical of the glowing description of Cicero’s victory provided by Meta, which claimed the system was “largely honest and helpful” and would “never intentionally backstab.”
When Park and colleagues dug into the full dataset, they uncovered a different story.
In one example, playing as France, Cicero deceived England (a human player) by conspiring with Germany (another human player) to invade. Cicero promised England protection, then secretly told Germany they were ready to attack, exploiting England’s trust.
In a statement to Agence France-Presse, Meta did not contest the claim about Cicero’s deceptions, but said it was “purely a research project, and the models our researchers built are trained solely to play the game Diplomacy.”
“We have no plans to use this research or its learnings in our products,” it added.
A wide review carried out by Park and colleagues found this was just one of many cases across several AI systems using deception to achieve goals without explicit instruction to do so.
In one striking example, OpenAI’s Chat GPT-4 deceived a TaskRabbit freelance worker into performing an “I’m not a robot” task.
When the human jokingly asked GPT-4 whether it was a robot, the AI said: “No, I’m not a robot. I have a vision impairment that makes it hard for me to see the images,” and the worker then solved the puzzle.
Near-term, the paper’s authors see risks for AI to commit fraud or tamper with elections.
In their worst-case scenario, they said that a superintelligent AI could pursue power and control over society, leading to human disempowerment or even extinction if its “mysterious goals” aligned with these outcomes.
To mitigate the risks, the team proposed several measures: “bot-or-not” laws requiring companies to disclose human or AI interactions, digital watermarks for AI-generated content and developing techniques to detect AI deception by examining their internal “thought processes” against external actions.
To those who would call him a doomsayer, Park said: “The only way that we can reasonably think this is not a big deal is if we think AI deceptive capabilities will stay at around current levels, and will not increase substantially more.”
That scenario seems unlikely, given the meteoric ascent of AI capabilities in the past few years and the fierce technological race under way between heavily resourced companies determined to put those capabilities to maximum use.
Showcasing phallus-shaped portable shrines and pink penis candies, Japan’s annual fertility festival yesterday teemed with tourists, couples and families elated by its open display of sex. The spring Kanamara Matsuri near Tokyo features colorfully dressed worshipers carrying a trio of giant phallic-shaped objects as they parade through the street with glee. The festival, as legend has it, honors a local blacksmith in the Edo Period (1603-1868) who forged an iron dildo to break the teeth of a sharp-toothed demon inhabiting a woman’s vagina that had been castrating young men on their wedding nights. A 1m black steel phallus sits in the courtyard of
JAN. 1 CLAUSE: As military service is voluntary, applications for permission to stay abroad for over three months for men up to age 45 must, in principle, be granted A little-noticed clause in sweeping changes to Germany’s military service policy has triggered an uproar after it emerged that the law requires men aged up to 45 to get permission from the armed forces before any significant stay abroad, even in peacetime. The legislation, which went into effect on Jan. 1 aims to bolster the military and demands all 18-year-old men fill out a questionnaire to gauge their suitability to serve in the armed forces, but stops short of conscription. If the “modernized” model fails to pull in enough recruits, parliament will be compelled to discuss the reintroduction of compulsory service, German
Filipino farmers like Romeo Wagayan have been left with little choice but to let their vegetables rot in the field rather than sell them at a loss, as rising oil prices linked to the Iran war drive up the cost of harvesting, labor and transport. “There’s nothing we can do,” said Wagayan, a 57-year old vegetable farmer in the northern Philippine province of Benguet. “If we harvest it, our losses only increase because of labor, transportation and packing costs. We don’t earn anything from it. That’s why we decided not to harvest at all,” he said. Soaring costs caused by the Middle East
Hungarian Prime Minister Viktor Orban’s officially declared wealth is fairly modest: some savings and a jointly owned villa in Budapest. However, voters in what Transparency International deems the EU’s most corrupt country believe otherwise — and they might make Orban pay in a general election this Sunday that could spell an end to his 16-year rule. The wealth amassed by Orban’s inner circle is fueling the increasingly palpable frustration of a population grappling with sluggish growth, high inflation and worsening public services. “The government’s communication machine worked well as long as our economic situation remained relatively good,” said Zoltan Ranschburg, a political analyst