US microchip export controls imposed last year to freeze China’s development of supercomputers used to develop nuclear weapons and artificial intelligence (AI) systems such as ChatGPT are having only minimal effects on China’s tech sector.
The rules restricted shipments of Nvidia Corp and Advanced Micro Devices Inc chips that have become the global technology industry’s standard for developing chatbots and other AI systems.
However, Nvidia has created variants of its chips for the Chinese market that are slowed down to meet US rules.
Photo: AFP
Industry experts said the newest one — the Nvidia H800, announced in March — is likely to take 10 percent to 30 percent longer to carry out some AI tasks and could double some costs compared with Nvidia’s fastest US chips.
Even the slowed Nvidia chips represent an improvement for Chinese firms. Tencent Holdings Ltd (騰訊), one of China’s largest tech companies, last month estimated that systems using Nvidia’s H800 would cut the time it takes to train its largest AI system by more than half, from 11 days to four days.
“The AI companies that we talk to seem to see the handicap as relatively small and manageable,” said Charlie Chai, a Shanghai-based analyst with 86Research.
The back-and-forth between government and industry exposes the US challenge of slowing China’s progress in the high-tech sector without hurting US companies.
Part of the US strategy in setting the rules was to avoid such a shock that the Chinese would ditch US chips altogether and redouble their own chip development efforts.
“They had to draw the line somewhere, and wherever they drew it, they were going to run into the challenge of how to not be immediately disruptive, but how to also over time degrade China’s capability,” said one chip industry executive who asked to remain anonymous as they were discussing private discussions with regulators.
The export restrictions have two parts. The first puts a ceiling on a chip’s ability to calculate extremely precise numbers, a measure designed to limit supercomputers that can be used in military research.
Chip industry sources said that was an effective action.
However, calculating extremely precise numbers is less relevant in AI work such as large language models where the amount of data the chip can chew through is more important.
Nvidia is selling the H800 to China’s largest technology firms, including Tencent, Alibaba Group Holding Ltd (阿里巴巴) and Baidu Inc (百度), for use in such work, although it has not yet started shipping the chips in high volumes.
“The government isn’t seeking to harm competition or US industry, and allows US firms to supply products for commercial activities, such as providing cloud services for consumers,” Nvidia said in a statement last week.
China is an important customer for US technology, it added.
“The October export controls require that we create products with an expanding gap between the two markets,” Nvidia said last week. “We comply with the regulation while offering as competitive as possible products in each market.”
Nvidia chief scientist Bill Dally said in a separate statement this week that “this gap will grow quickly over time as training requirements continue to double every six to 12 months.”
A spokesperson for the Bureau of Industry and Security, the arm of the US Department of Commerce that oversees the rules, did not return a request for comment.
The second US limit is on chip-to-chip transfer speeds, which does affect AI. The models behind technologies such as ChatGPT are too large to fit onto a single chip. Instead, they must be spread over many chips — often thousands at a time — which all need to communicate with one another.
Nvidia has not disclosed the China-only H800 chip’s performance details, but a specification sheet seen by Reuters shows a chip-to-chip speed of 400 gigabytes per second, less than half the peak speed of 900 gigabytes per second for Nvidia’s flagship H100 chip available outside China.
Some in the AI industry believe that is still plenty of speed.
Naveen Rao, CEO of the start-up MosaicML, which specializes in helping AI models to run better on limited hardware, estimated a 10 to 30 percent system slowdown.
“There are ways to get around all this algorithmically,” he said. “I don’t see this being a boundary for a very long time — like 10 years.”
Money helps. A chip in China that takes twice as long to finish an AI training task than a faster US chip can still get the work done.
“At that point, you’ve got to spend US$20 million instead of US$10 million to train it,” said one industry source, who asked to remain anonymous because of agreements with partners.
“Does that suck? Yes it does, but does that mean this is impossible for Alibaba or Baidu? No, that’s not a problem,” they said.
Moreover, AI researchers are trying to slim down the massive systems they have built to cut the cost of training products similar to ChatGPT and other processes. They would require fewer chips, reducing chip-to-chip communications and lessening the effect of the US speed limits.
Two years ago, the industry was thinking AI models would get bigger and bigger, said Cade Daniel, a software engineer at Anyscale, a San Francisco start-up that provides software to help companies perform AI work.
“If that were still true today, this export restriction would have a lot more impact,” Daniel said. “This export restriction is noticeable, but it’s not quite as devastating as it could have been.”
Starlux Airlines Co (星宇航空) today unveiled a long-haul network expansion plan at a shareholders’ meeting in Taipei, including direct flights to Barcelona, Spain, and Zurich, Switzerland, as well as a service connecting Taipei, Sydney and New Zealand. Starlux is to become the first Taiwanese carrier to offer non-stop services to the two European cities, while the inaugural oceanic route is expected to expand transit opportunities within the Australia-New Zealand market, Starlux said. Flight services to Chicago, Dallas, Washington and New York are under evaluation, the airline added. Prior to the shareholders’ meeting, the airline earlier this year announced that it would be
Cairo’s new monorail slices across the city skyline, running above the familiar chaos of blaring horns and aging buses’ exhaust fumes that mark rush hour below. The US$4.5 billion monorail, opened this month, is among Egypt’s most prominent new transport projects, part of a debt-funded infrastructure drive criticized for sapping state finances while bringing limited benefits to most of the country’s 109 million people. “It feels like you’re in a different country,” said Ramy Sayed, a restaurant manager, aboard a driverless Innovia 300 train. “No noise, no traffic, we’re not used to this.” The eastern line runs 56km from the bustling middle-class
Netherlands-based semiconductor equipment supplier ASML Holding NV yesterday said that it is planning to hire an additional 1,000 people in Taiwan this year in response to growing demand from clients. ASML had previously planned to recruit 600 people this year, but that the plan has been adjusted upward, ASML vice president and ASML Taiwan general manager Grace Wang (汪佳慧) told reporters. ASML has a workforce of more than 4,500 in Taiwan, accounting for about 10 percent of its global total, Wang said. This year’s recruitment campaign would focus on adding people in the customer support, manufacturing and supply chain domains to assist ASML
Nvidia Corp yesterday announced that CEO Jensen Huang (黃仁勳) would attend an employee meeting in Taipei tomorrow to celebrate the launch of the company’s Taiwan headquarters project. Huang would attend a gathering at the site of Nvidia’s planned headquarters in Beitou Shilin Technology Park (北投士林科技園區), the company said in a statement. After arriving in Taiwan on Saturday last week, Huang told reporters that he plans to meet with Quanta Computer Inc (廣達) chairman Barry Lam (林百里) and Taiwan Semiconductor Manufacturing Co (TSMC, 台積電) chairman C.C. Wei (魏哲家), and would attend the groundbreaking ceremony for Nvidia’s Taiwan headquarters tomorrow. Nvidia has not yet applied