Dotted around the Internet are a multitude of silly images of former US president Donald Trump. Here he is as a superhero, there he is as a cartoon warrior, and here he is playing basketball with a young Michael Jordan. The glossy look gives away that these are AI-generated, but there is no denying the underlying message: Trump holds on to a kind of online cultural power and is still weirdly beloved.
The messenger is Midjourney, a San Francisco-based artificial intelligence (AI) startup that in its 17-month existence has carried out no marketing, raised not one cent from venture capitalists, but is making US$200 million in annual revenue and has become one of the most powerful tools for generating remarkably real AI “photos.”
Fake snaps of Trump getting arrested and Pope Francis in a white puffer jacket confused Internet users and went viral earlier this year — and they were generated on Midjourney. The company has now released a new version that can do this with even more realism.
Having previously insisted that he does not like fake photos, Midjourney founder David Holz finds himself steering a tool created for artists that is also being exploited by propagandists. That is the trajectory of a kind of classic AI innovator, one who could not resist making their system more powerful at the price of their own standards.
Midjourney did not respond to multiple requests for comment or for an interview with Holz.
Holz grew up in Florida, where he carried out advanced science experiments as a kid — shooting paper airplanes at 257.5kph down a homemade wind tunnel, for example — before juggling interests in design and math in higher education. He co-founded Leap Motion in 2008, a startup that made a USB device the size of an iPod that allowed you to control a computer program with hand gestures. Holz was more than a decade too early with his idea, and he stuck it out to his detriment. After spurning a takeover bid from Apple Inc in 2013 for hundreds of millions of dollars, Leap Motion dropped in value. Holz finally sold it to a British firm in 2019 for US$30 million.
Undeterred, he founded Midjourney in 2021, eventually turning it into an independent research lab that would “expand the imaginative powers of the human species.”
In interviews with Forbes and The Register in 2022, Holz described how Midjourney was a space for people to “make beautiful things.”
He built the tool on an algorithm from OpenAI called “CLIP” or Contrastive Language Image Pre-training — a neural network that could learn visual concepts from natural language.
In the spring and summer of 2022, several months before OpenAI wowed the world with DALL-E 2, Holz released his own early versions of Midjourney, a tool that could conjure digital art from text prompts. His customers paid about US$10 to US$60 per month to use the tool, accessing it through public chatrooms on the Discord app.
It could have all ended there, except Holz had the gold dust every Silicon Valley entrepreneur craves: connections. He picked up several renowned advisers, including coding site Github’s CEO Nat Friedman, who purchased thousands of AI chips this year for startups to exploit. These chips, known as GPUs, are critical to keeping ahead in the race to build AI. With help from his network, Holz amassed an enormous collection of GPUs — as many as 10,000, roughly the number estimated to train ChatGPT — to make Midjourney’s models smarter and faster.
All that computing power translated into jaw-dropping improvements in version 5 of Midjourney, released in March last year. When users asked it for photorealistic images of people, everything from skin texture to facial features were much more realistic, while reflections, shadows and lighting were also more true to life.
Version 6, released on Dec. 20, generates faces with even more startling detail, with skin pores and texture that make them virtually indistinguishable from real photos. Midjourney users have been playing with the new software by generating images of Hollywood actors in imaginary movie stills, such as one of Leonardo DiCaprio as former Soviet leader Vladimir Lenin.
Holz seems to have gone quiet since his interviews last year. Back then, the entrepreneur expressed deep discomfort with people using Midjourney to create fake photos. That was “extremely dangerous,” he said. “I don’t really want to be a source of fake photos in the world.”
Yet by making Midjourney’s software more powerful, Holz laid the groundwork for fake images to proliferate. It now takes mere seconds to generate “photos” of celebrities and historic figures that are hyper-realistic, and some have risen to the top of Google search results.
Many Midjourney users are still creating art, but plenty more are creating photorealistic portraits like the one of Israeli Prime Minister Benjamin Netanyahu in Saudi clothing, which was created by a Midjourney user on Dec. 4 last year. The Midjourney user behind the Netanyahu image did not respond to requests for comment.
Many fear that AI photos would worsen misinformation, but in reality, it is hard to see fake images of Netanyahu and Trump tricking most people. AI photos still look a little uncanny, and media attention to “deepfakes” has made everyone more vigilant.
Their real impact would be more nuanced and harder to detect: A flood of fake images promoting certain ideas would act more like advertising campaigns that influence opinion rather than fool people. Fake photos that put Trump in critical historic moments — knocking down the Berlin Wall or fighting in the Vietnam War — that were made as a joke on X, formerly known as Twitter, have found their way onto Make America Great Again forums where people view them as inspiration, for instance.
AI pics could also be a propaganda tool outside of politics. Online forums that encourage eating disorders have been using Midjourney and other similar software to post images of ultra-skinny people to “inspire” their members, an August study by the nonprofit group Center for Countering Digital Hate showed.
Midjourney uses its own AI software to block potentially harmful content and it has about 70 hires monitoring its output, Bloomberg News reporting showed. However, such restrictions are relatively easy to circumvent. For instance, while Midjourney does not let you generate images of Bill and Hillary Clinton with blood on their hands, it does let you put blood-red strawberry syrup on them.
Holz seems aware of the problem. Back in March, he ended Midjourney’s free-trial program citing “abuse” by users.
However, the effects of propaganda are, like advertising, hard to track. As more people use his service — with 9 million users reported to be on his Midjourney’s Discord server — all that new content generated to support Trump, eating disorders or any other cultural value becomes more difficult to police.
It took eight months for Midjourney to release Version 6, a noticeably long time compared with the quicker releases of previous versions, which came two to four months apart. Holz might be struggling to get the AI chips he needs, or he could be grappling with the responsibilities of opening such an influential tool to the public mere months before a presidential election. Let us hope it was the latter.
Parmy Olson is a Bloomberg Opinion columnist covering technology. A former reporter for the Wall Street Journal and Forbes, she is author of We Are Anonymous. This column does not necessarily reflect the opinion of the editorial board or Bloomberg LP and its owners.
On May 7, 1971, Henry Kissinger planned his first, ultra-secret mission to China and pondered whether it would be better to meet his Chinese interlocutors “in Pakistan where the Pakistanis would tape the meeting — or in China where the Chinese would do the taping.” After a flicker of thought, he decided to have the Chinese do all the tape recording, translating and transcribing. Fortuitously, historians have several thousand pages of verbatim texts of Dr. Kissinger’s negotiations with his Chinese counterparts. Paradoxically, behind the scenes, Chinese stenographers prepared verbatim English language typescripts faster than they could translate and type them
More than 30 years ago when I immigrated to the US, applied for citizenship and took the 100-question civics test, the one part of the naturalization process that left the deepest impression on me was one question on the N-400 form, which asked: “Have you ever been a member of, involved in or in any way associated with any communist or totalitarian party anywhere in the world?” Answering “yes” could lead to the rejection of your application. Some people might try their luck and lie, but if exposed, the consequences could be much worse — a person could be fined,
Xiaomi Corp founder Lei Jun (雷軍) on May 22 made a high-profile announcement, giving online viewers a sneak peek at the company’s first 3-nanometer mobile processor — the Xring O1 chip — and saying it is a breakthrough in China’s chip design history. Although Xiaomi might be capable of designing chips, it lacks the ability to manufacture them. No matter how beautifully planned the blueprints are, if they cannot be mass-produced, they are nothing more than drawings on paper. The truth is that China’s chipmaking efforts are still heavily reliant on the free world — particularly on Taiwan Semiconductor Manufacturing
Last week, Nvidia chief executive officer Jensen Huang (黃仁勳) unveiled the location of Nvidia’s new Taipei headquarters and announced plans to build the world’s first large-scale artificial intelligence (AI) supercomputer in Taiwan. In Taipei, Huang’s announcement was welcomed as a milestone for Taiwan’s tech industry. However, beneath the excitement lies a significant question: Can Taiwan’s electricity infrastructure, especially its renewable energy supply, keep up with growing demand from AI chipmaking? Despite its leadership in digital hardware, Taiwan lags behind in renewable energy adoption. Moreover, the electricity grid is already experiencing supply shortages. As Taiwan’s role in AI manufacturing expands, it is critical that