Alibaba Group Holding Ltd (阿里巴巴) cofounder Jack Ma (馬雲)-backed Ant Group Co (螞蟻集團) used Chinese-made semiconductors to develop techniques for training artificial intelligence (AI) models that would cut costs by 20 percent, people familiar with the matter said.
Ant used domestic chips, including from Alibaba and Huawei Technologies Co (華為), to train models using the so-called “mixture of experts” machine learning approach, the people said.
It got results similar to those from Nvidia Corp chips, such as the H800, they said.
Photo: AFP
Hangzhou-based Ant is still using Nvidia for AI development, but is now relying mostly on alternatives, including from Advanced Micro Devices Inc and Chinese chips for its latest models, one of the people said.
The models mark Ant’s entry into a race between Chinese and US companies that has accelerated since DeepSeek (深度求索) demonstrated how capable models can be trained for far less than the billions invested by OpenAI and Alphabet Inc’s Google.
It underscores how Chinese companies are trying to use local alternatives to the most advanced Nvidia semiconductors. While not the most advanced, the H800 is a relatively powerful processor and is barred by the US from China.
The company published a research paper this month that said its models at times outperformed Meta Platforms Inc in certain benchmarks, which has not been independently verified.
However, if they work as advertised, Ant’s platforms could mark another step forward for Chinese AI development by slashing the cost of inferencing or supporting AI services.
Ant said it cost about 6.35 million yuan (US$875,952) to train 1 trillion tokens using high-performance hardware, but its optimized approach would cut that down to 5.1 million yuan using lower-specification hardware.
Tokens are the fundamental units of text — such as words, characters or parts of words — that a language model breaks down and analyzes to understand context, meaning and structure.
In essence, they are the building blocks that enable the model to interpret human language and produce intelligent output.
The company plans to leverage the recent breakthrough in the large language models it has developed, Ling-Plus and Ling-Lite, for industrial AI solutions including healthcare and finance, the people said.
On English-language understanding, Ant in its paper said that the Ling-Lite model did better in a key benchmark compared with one of Meta’s Llama models.
Ling-Lite and Ling-Plus models outperformed DeepSeek’s equivalents on Chinese-language benchmarks.
Ant has made the Ling models open-source. Ling-Lite contains 16.8 billion parameters, which are adjustable settings that work like knobs and dials to direct the model’s performance.
Ling-Plus has 290 billion parameters, which is considered relatively large in the realm of language models. For comparison, experts estimate that ChatGPT’s GPT-4.5 has 1.8 trillion parameters, MIT Technology Review said. DeepSeek-R1 has 671 billion.
The company faced challenges in some areas of the training, including stability.
Even small changes in the hardware or the model’s structure led to problems, including jumps in the models’ error rate, it said in the paper.
Additonal reporting by staff writer
Taiwan will prioritize the development of silicon photonics by taking advantage of its strength in the semiconductor industry to build another shield to protect the local economy, National Development Council (NDC) Minister Paul Liu (劉鏡清) said yesterday. Speaking at a meeting of the legislature’s Economics Committee, Liu said Taiwan already has the artificial intelligence (AI) industry as a shield, after the semiconductor industry, to safeguard the country, and is looking at new unique fields to build more economic shields. While Taiwan will further strengthen its existing shields, over the longer term, the country is determined to focus on such potential segments as
UNCERTAINTY: Innolux activated a stringent supply chain management mechanism, as it did during the COVID-19 pandemic, to ensure optimal inventory levels for customers Flat-panel display makers AUO Corp (友達) and Innolux Corp (群創) yesterday said that about 12 to 20 percent of their display business is at risk of potential US tariffs and that they would relocate production or shipment destinations to mitigate the levies’ effects. US tariffs would have a direct impact of US$200 million on AUO’s revenue, company chairman Paul Peng (彭雙浪) told reporters on the sidelines of the Touch Taiwan trade show in Taipei yesterday. That would make up about 12 percent of the company’s overall revenue. To cope with the tariff uncertainty, AUO plans to allocate its production to manufacturing facilities in
COLLABORATION: Given Taiwan’s key position in global supply chains, the US firm is discussing strategies with local partners and clients to deal with global uncertainties Advanced Micro Devices Inc (AMD) yesterday said it is meeting with local ecosystem partners, including Taiwan Semiconductor Manufacturing Co (TSMC, 台積電), to discuss strategies, including long-term manufacturing, to navigate uncertainties such as US tariffs, as Taiwan occupies an important position in global supply chains. AMD chief executive officer Lisa Su (蘇姿丰) told reporters that Taiwan is an important part of the chip designer’s ecosystem and she is discussing with partners and customers in Taiwan to forge strong collaborations on different areas during this critical period. AMD has just become the first artificial-intelligence (AI) server chip customer of TSMC to utilize its advanced
Chizuko Kimura has become the first female sushi chef in the world to win a Michelin star, fulfilling a promise she made to her dying husband to continue his legacy. The 54-year-old Japanese chef regained the Michelin star her late husband, Shunei Kimura, won three years ago for their Sushi Shunei restaurant in Paris. For Shunei Kimura, the star was a dream come true. However, the joy was short-lived. He died from cancer just three months later in June 2022. He was 65. The following year, the restaurant in the heart of Montmartre lost its star rating. Chizuko Kimura insisted that the new star is still down