The global artificial intelligence (AI) server market is expected to remain dominated by servers based on graphics processing units (GPUs) this year, but application-specific integrated circuits (ASICs)-based models are set to see faster growth in shipments, the Market Intelligence and Consulting Institute (MIC) said.
As major cloud service providers (CSPs) continue to train large language models for machine learning, outlays on AI data centers and high-density GPU server racks remain a priority, MIC said.
CSPs would continue relying on GPU ecosystems when offering computing resources to customers, since most enterprises and developers remain accustomed to working within GPU environments, MIC information and communications technology section deputy director Chris Wei (魏傳虔) said last week.
Photo courtesy of Hon Hai Precision Industry Co
However, demand for servers powered by ASICs is growing as the market’s focus gradually shifts toward inference, Wei said.
While AI training is akin to preparing materials in a centralized factory — capital-intensive and large in scale — inference resembles bringing those materials back on premises, where use is more distributed and steady.
Such a pattern makes shipment growth for ASIC-based servers more likely to outpace that of GPU-based models this year, Wei said.
Shipments of servers used for AI inference are expected to be about four times higher than those used for training this year, highlighting the role ASICs could play.
ASICs are well-suited to inference demand, as they prioritize efficiency and cost control, Wei said.
Inference workloads are relatively fixed and suitable for specialized designs that can complete specific tasks with lower power consumption, he said.
MIC expects Taiwanese manufacturers who already produce GPU servers to expand into the ASIC segment smoothly, Wei said.
The key to future competition for these manufacturers lies not only in securing orders tied to specific chips, but in whether they can build system integration capabilities and flexible production lines for both GPU and ASIC-based platforms, he said.
HORMUZ ISSUE: The US president said he expected crude prices to drop at the end of the war, which he called a ‘minor excursion’ that could continue ‘for a little while’ The United Arab Emirates (UAE) and Kuwait started reducing oil production, as the near-closure of the crucial Strait of Hormuz ripples through energy markets and affects global supply. Abu Dhabi National Oil Co (ADNOC) is “managing offshore production levels to address storage requirements,” the company said in a statement, without giving details. Kuwait Petroleum Corp said it was lowering production at its oil fields and refineries after “Iranian threats against safe passage of ships through the Strait of Hormuz.” The war in the Middle East has all but closed Hormuz, the narrow waterway linking the Persian Gulf to the open seas,
Apple Inc increased iPhone production in India by about 53 percent last year and now makes a quarter of its marquee devices there, reflecting the US company’s efforts to avoid tariffs on China. The company assembled about 55 million iPhones in India last year, up from 36 million a year earlier, people familiar with the matter said, asking not to be named because the numbers aren’t public. Apple makes about 220 million to 230 million iPhones a year globally, with India’s share of the total increasing rapidly. Apple has accelerated its expansion in the world’s most populous country in recent years, bolstered
HEADWINDS: The company said it expects its computer business, as well as consumer electronics and communications segments to see revenue declines due to seasonality Pegatron Corp (和碩) yesterday said it aims to grow its artificial intelligence (AI) server revenue more than 10-fold this year from last year, driven by orders from neocloud solutions clients and large cloud service providers. The electronics manufacturing service provider said AI server revenue growth would be driven primarily by the Nvidia Corp GB300 server platform. Server shipments are expected to increase each quarter this year, with the second half likely to outperform the first half, it said. The AI server market is expected to broaden this year as more inference applications emerge, which would drive demand for system-on-chip, application-specific integrated circuits
PROJECTION: TSMC said it expects strong growth this year, with revenue in US dollars projected to grow by about 30 percent, outperforming the industry Taiwan Semiconductor Manufacturing Co (TSMC, 台積電) yesterday reported consolidated sales last month reached NT$317.66 billion (US$9.98 billion), the highest ever for the month of February, driven by robust demand for chips built using the company’s advanced 3-nanometer (3nm) process. Last month’s figure was up 22.2 percent from a year earlier, but fell 20.8 percent from January, the world’s largest contract chipmaker said in a statement. For the first two months of the year, TSMC posted cumulative sales of NT$718.91 billion, up 29.9 percent from a year earlier. Analysts attributed the growth to sustained global demand for artificial intelligence (AI) products