In an age of Photoshop, filters and social media, many people are used to seeing manipulated pictures — subjects become slimmer and smoother or, in the case of Snapchat, transformed into puppies.
However, there is a new breed of video and audio manipulation tools, made possible by advances in artificial intelligence and computer graphics, that would allow for the creation of realistic-looking footage of public figures appearing to say, well, anything.
US President Donald Trump declaring his proclivity for water sports. Former US secretary of state Hillary Rodham Clinton describing the stolen children she keeps locked in her wine cellar. Actor Tom Cruise finally admitting what we suspected all along — that he is a brony (a My Little Pony fan.)
This is the future of fake news. People have long been told not to believe everything they read, but soon they will have to question everything they see and hear as well.
For now, there are several research teams working on capturing and synthesizing different visual and audio elements of human behavior.
Software developed at Stanford University is able to manipulate video footage of public figures to allow a second person to put words in their mouth — in real time.
Face2Face captures the second person’s facial expressions as they talk into a webcam and then morphs those movements directly onto the face of the person in the original video.
The research team demonstrated their technology by puppeteering videos of former US president George W. Bush, Russian President Vladimir Putin and Trump.
On its own, Face2Face is a fun plaything for creating memes and entertaining late-night talk show hosts.
However, with the addition of a synthesized voice, it becomes more convincing — not only does the digital puppet look like the politician, but it can also sound like the politician.
A research team at the University of Alabama at Birmingham has been working on voice impersonation.
With three to five minutes of audio of a victim’s voice — taken live or from YouTube videos or radio shows — an attacker can create a synthesized voice that can fool both humans and voice biometric security systems used by some banks and smartphones.
The attacker can then talk into a microphone and the software will convert it so that the words sound like they are being spoken by the victim — whether that is over the telephone or on a radio show.
Canadian start-up Lyrebird has developed similar capabilities, which it says can be used to turn text into on-the-spot audiobooks “read” by famous voices or for characters in video games.
Although their intentions might be well-meaning, voice-morphing technology could be combined with face-morphing technology to create convincing fake statements by public figures.
You only have to look at the University of Washington’s Synthesizing Obama project, where they took the audio from one of former US president Barack Obama’s speeches and used it to animate his face in an entirely different video with incredible accuracy — thanks to training a recurrent neural network with hours of footage — to get a sense of how insidious these adulterations can be.
Beyond fake news, there are many other implications, said Nitesh Saxena, associate professor and research director of the University of Alabama at Birmingham’s department of computer science.
“You could leave fake voice messages posing as someone’s mom. Or defame someone and post the audio samples online,” Saxena said.
These morphing technologies are not yet perfect. The facial expressions in the videos can seem a little distorted or unnatural and the voices can sound a little robotic.
However, given time, they will be able to faithfully recreate the sound or appearance of a person — to the point where it might be very difficult for humans to detect the fraud.
Given the erosion of trust in the media and the rampant spread of hoaxes via social media, it will become even more important for news organizations to scrutinize content that looks and sounds like the real deal.
Telltale signs will be where the video or audio was created, who else was at the event and whether the weather conditions match the records of that day.
People should also be looking at the lighting and shadows in the video, whether all of the elements featured in the frame are the right size and whether the audio is synced perfectly, said Mandy Jenkins, from social news company Storyful, which specializes in verifying news content.
Doctored content might not pass the scrutiny of a rigorous newsroom, but if posted as a grainy video to social media, it could spread virally and trigger a public relations, political or diplomatic disaster. Imagine Trump declaring war on North Korea, for example.
“If someone looks like Trump and speaks like Trump, they will think it’s Trump,” Saxena said.
“We already see it doesn’t even take doctored audio or video to make people believe something that isn’t true,” Jenkins added. “This has the potential to make it worse.”
Two sets of economic data released last week by the Directorate-General of Budget, Accounting and Statistics (DGBAS) have drawn mixed reactions from the public: One on the nation’s economic performance in the first quarter of the year and the other on Taiwan’s household wealth distribution in 2021. GDP growth for the first quarter was faster than expected, at 6.51 percent year-on-year, an acceleration from the previous quarter’s 4.93 percent and higher than the agency’s February estimate of 5.92 percent. It was also the highest growth since the second quarter of 2021, when the economy expanded 8.07 percent, DGBAS data showed. The growth
In the intricate ballet of geopolitics, names signify more than mere identification: They embody history, culture and sovereignty. The recent decision by China to refer to Arunachal Pradesh as “Tsang Nan” or South Tibet, and to rename Tibet as “Xizang,” is a strategic move that extends beyond cartography into the realm of diplomatic signaling. This op-ed explores the implications of these actions and India’s potential response. Names are potent symbols in international relations, encapsulating the essence of a nation’s stance on territorial disputes. China’s choice to rename regions within Indian territory is not merely a linguistic exercise, but a symbolic assertion
More than seven months into the armed conflict in Gaza, the International Court of Justice ordered Israel to take “immediate and effective measures” to protect Palestinians in Gaza from the risk of genocide following a case brought by South Africa regarding Israel’s breaches of the 1948 Genocide Convention. The international community, including Amnesty International, called for an immediate ceasefire by all parties to prevent further loss of civilian lives and to ensure access to life-saving aid. Several protests have been organized around the world, including at the University of California Los Angeles (UCLA) and many other universities in the US.
Every day since Oct. 7 last year, the world has watched an unprecedented wave of violence rain down on Israel and the occupied Palestinian Territories — more than 200 days of constant suffering and death in Gaza with just a seven-day pause. Many of us in the American expatriate community in Taiwan have been watching this tragedy unfold in horror. We know we are implicated with every US-made “dumb” bomb dropped on a civilian target and by the diplomatic cover our government gives to the Israeli government, which has only gotten more extreme with such impunity. Meantime, multicultural coalitions of US