The revelation that a documentary filmmaker used voice-cloning software to make late chef Anthony Bourdain say words he never spoke has drawn criticism amid ethical concerns about use of the powerful technology.
The movie Roadrunner: A Film About Anthony Bourdain appeared in cinemas on Friday and mostly features real footage of the beloved celebrity chef and globe-trotting television host before he died in 2018.
However, its director, Morgan Neville, told The New Yorker that a snippet of dialogue was created using artificial intelligence (AI) technology.
Photo: AP
That has renewed a debate about the future of voice-cloning technology, not just in the entertainment world, but in politics and a fast-growing commercial sector dedicated to transforming text into realistic-sounding human speech.
“Unapproved voice cloning is a slippery slope,” Andrew Mason, the founder and chief executive of voice generator Descript Inc, wrote in a blog post on Friday. “As soon as you get into a world where you’re making subjective judgment calls about whether specific cases can be ethical, it won’t be long before anything goes.”
Before this week, most of the public controversy around such technologies focused on the creation of hard-to-detect deepfakes using simulated audio and/or video and their potential to fuel misinformation and political conflict.
Yet Mason, the founder and chief executive of Groupon, said in an interview that Descript has repeatedly rejected requests to bring back a voice, including from “people who have lost someone and are grieving.”
“It’s not even so much that we want to pass judgment,” he said. “We’re just saying you have to have some bright lines in what’s okay and what’s not.”
Angry and uncomfortable reactions to the voice cloning in the Bourdain case reflect expectations and issues of disclosure and consent, said Sam Gregory, program director at Witness, a nonprofit working on using video technology for human rights.
Obtaining consent and disclosing the technowizardry at work would have been appropriate, he said.
Instead, viewers were stunned — first by the fact of the audio fakery, then by the director’s seeming dismissal of any ethical questions — and expressed their displeasure online.
“It touches also on our fears of death and ideas about the way people could take control of our digital likeness and make us say or do things without any way to stop it,” Gregory said.
Neville has not identified what tool he used to recreate Bourdain’s voice, but said he used it for a few sentences that Bourdain wrote but never said aloud.
“With the blessing of his estate and literary agent we used AI technology,” Neville said in a written statement. “It was a modern storytelling technique that I used in a few places where I thought it was important to make Tony’s words come alive.”
Neville also told GQ magazine that he got the approval of Bourdain’s widow and literary executor.
“I certainly was NOT the one who said Tony would have been cool with that,” the chef’s wife, Ottavia Busia, wrote on Twitter.
Although tech giants, such as Microsoft Inc, Alphabet Inc’s Google and Amazon.com Inc, have dominated text-to-speech research, there are now also a number of start-ups such as Descript that offer voice-cloning software. The uses range from talking customer service chatbots to video games and podcasts.
Many of these voice cloning companies prominently feature an ethics policy on their Web site that explains the terms of use. Of nearly a dozen firms contacted by The Associated Press, many said they did not recreate Bourdain’s voice and would not have if asked.
Others did not respond.
“We have pretty strong polices around what can be done on our platform,” said Zohaib Ahmed, founder and chief executive of Resemble AI, a Toronto company that sells a custom AI voice generator service. “When you’re creating a voice clone, it requires consent from whoever’s voice it is.”
Ahmed said the rare occasions where he has allowed some posthumous voice cloning were for academic research, including a project working with the voice of former British Prime Minister Winston Churchill, who died in 1965.
Ahmed said a more common commercial use is to edit a TV ad recorded by real voice actors and then customize it to a region by adding a local reference. It is also used to dub anime movies and other videos, by taking a voice in one language and making it speak a different language, he said.
He compared it to past innovations in the entertainment industry, from stunt actors to greenscreen technology.
Just seconds or minutes of recorded human speech can help teach an AI system to generate its own synthetic speech, although getting it to capture the clarity and rhythm of Anthony Bourdain’s voice probably took a lot more training, said Rupal Patel, a professor at Northeastern University who is the founder and chief executive of another voice-generating company, VocaliD Inc, which focuses on customer service chatbots.
“If you wanted it to speak really like him, you’d need a lot, maybe 90 minutes of good, clean data,” she said. “You’re building an algorithm that learns to speak like Bourdain spoke.”
Neville is an acclaimed documentarian who also directed the Fred Rogers portrait Won’t You Be My Neighbor? and the Oscar-winning 20 Feet From Stardom.
He began making his latest movie in 2019, more than a year after Bourdain’s death by suicide.
HORMUZ ISSUE: The US president said he expected crude prices to drop at the end of the war, which he called a ‘minor excursion’ that could continue ‘for a little while’ The United Arab Emirates (UAE) and Kuwait started reducing oil production, as the near-closure of the crucial Strait of Hormuz ripples through energy markets and affects global supply. Abu Dhabi National Oil Co (ADNOC) is “managing offshore production levels to address storage requirements,” the company said in a statement, without giving details. Kuwait Petroleum Corp said it was lowering production at its oil fields and refineries after “Iranian threats against safe passage of ships through the Strait of Hormuz.” The war in the Middle East has all but closed Hormuz, the narrow waterway linking the Persian Gulf to the open seas,
Apple Inc increased iPhone production in India by about 53 percent last year and now makes a quarter of its marquee devices there, reflecting the US company’s efforts to avoid tariffs on China. The company assembled about 55 million iPhones in India last year, up from 36 million a year earlier, people familiar with the matter said, asking not to be named because the numbers aren’t public. Apple makes about 220 million to 230 million iPhones a year globally, with India’s share of the total increasing rapidly. Apple has accelerated its expansion in the world’s most populous country in recent years, bolstered
HEADWINDS: The company said it expects its computer business, as well as consumer electronics and communications segments to see revenue declines due to seasonality Pegatron Corp (和碩) yesterday said it aims to grow its artificial intelligence (AI) server revenue more than 10-fold this year from last year, driven by orders from neocloud solutions clients and large cloud service providers. The electronics manufacturing service provider said AI server revenue growth would be driven primarily by the Nvidia Corp GB300 server platform. Server shipments are expected to increase each quarter this year, with the second half likely to outperform the first half, it said. The AI server market is expected to broaden this year as more inference applications emerge, which would drive demand for system-on-chip, application-specific integrated circuits
PROJECTION: TSMC said it expects strong growth this year, with revenue in US dollars projected to grow by about 30 percent, outperforming the industry Taiwan Semiconductor Manufacturing Co (TSMC, 台積電) yesterday reported consolidated sales last month reached NT$317.66 billion (US$9.98 billion), the highest ever for the month of February, driven by robust demand for chips built using the company’s advanced 3-nanometer (3nm) process. Last month’s figure was up 22.2 percent from a year earlier, but fell 20.8 percent from January, the world’s largest contract chipmaker said in a statement. For the first two months of the year, TSMC posted cumulative sales of NT$718.91 billion, up 29.9 percent from a year earlier. Analysts attributed the growth to sustained global demand for artificial intelligence (AI) products