Chinese technology company Alibaba Group Holding Ltd (阿里巴巴) yesterday released a new version of its Qwen 2.5 artificial intelligence (AI) model that it said surpassed the highly acclaimed DeepSeek-V3.
The unusual timing of the Qwen 2.5-Max’s release, on the first day of the Lunar New Year when most Chinese people are off work and with their families, points to the pressure Chinese AI start-up DeepSeek’s (深度求索) meteoric rise in the past three weeks has placed on not just overseas rivals, but also its domestic competition.
“Qwen 2.5-Max outperforms ... almost across the board GPT-4o, DeepSeek-V3 and Llama-3.1-405B,” Alibaba’s cloud unit said in an announcement posted on its official WeChat account, referring to OpenAI and Meta Platforms Inc’s most advanced open-source AI models.
Photo: Bloomberg
The Jan. 10 release of DeepSeek’s AI assistant, powered by the DeepSeek-V3 model, as well as the Jan. 20 release of its R1 model, has shocked Silicon Valley and caused tech shares to plunge, with the Chinese start-up’s purportedly low development and usage costs prompting investors to question huge spending plans by leading AI firms in the US.
However, DeepSeek’s success has also led to a scramble among its domestic competitors to upgrade their own AI models.
Two days after the release of DeepSeek-R1, TikTok owner ByteDance Ltd (字節跳動) released an update to its flagship AI model, which it said outperformed Microsoft Corp-backed OpenAI’s o1 in AIME, a benchmark test that measures how well AI models understand and respond to complex instructions.
This echoed DeepSeek’s claim that its R1 model rivaled OpenAI’s o1 on several performance benchmarks.
The predecessor of DeepSeek’s V3 model, DeepSeek-V2, triggered an AI model price war in China after it was released in May last year.
The fact that DeepSeek-V2 was open-source and unprecedentedly cheap, only 1 yuan (US$0.14) per 1 million tokens — or units of data processed by the AI model — led to Alibaba’s cloud unit announcing price cuts of up to 97 percent on a range of models.
Other Chinese tech companies followed suit, including Baidu Inc (百度), which released China’s first equivalent to ChatGPT in March 2023, and the country’s most valuable Internet company, Tencent Holdings Ltd (騰訊).
Liang Wenfeng (梁文鋒), DeepSeek’s enigmatic founder, said in a rare interview with Chinese media outlet Waves in July last year that the start-up “did not care” about price wars and that achieving AGI (artificial general intelligence) was its main goal.
OpenAI defines AGI as autonomous systems that surpass humans in most economically valuable tasks.
Liang said he believed China’s largest tech companies might not be well suited to the future of the AI industry, contrasting their high costs and top-down structures with DeepSeek’s lean operation and loose management style.
“Large foundational models require continued innovation, tech giants’ capabilities have their limits,” he said.
NEW MARKET: The partnership opens up India to the Dutch company, which already has a strong hold in the semiconductor market of South Korea, Taiwan and China ASML Holding NV entered into a partnership agreement with Tata Electronics Pvt Ltd aimed at ramping up India’s goal to develop domestic chip-manufacturing capabilities. The Dutch company’s technology would help power Tata Electronics’ planned 300 millimeter (mm) semiconductor foundry in Gujarat, according to a joint statement from the two companies on Saturday. The signing of a memorandum of understanding coincides with a visit by Indian Prime Minister Narendra Modi to the Netherlands, which is looking to deepen bilateral relations with New Delhi. ASML, whose top customers include Taiwan Semiconductor Manufacturing Co (台積電) and Samsung Electronics Co, makes lithography machines that can print
ROUGH RECORDS: Bonds in Japan, as well is in New Zealand, Australia and the US, are seeing the effects of a nervy market as stock exchanges across Asia edge down A deepening slump in Japanese government bonds added fuel to the selloff in global debt markets as rising oil prices stoked inflation fears and pushed yields to multi-decade highs. Japan’s 30-year yield yesterday surged as much as 20 basis points to the highest level since the tenor’s debut in 1999, before paring some of the move. Shorter-maturity Japanese debt was also under pressure, underscored by weak demand at a sale of five-year notes that offered a record-high coupon of 2 percent. Concerns over inflation and government spending rippling through markets including the US, Australia and New Zealand are being amplified in Japan,
The US has cleared about 10 Chinese firms to buy Nvidia Corp’s second-most powerful artificial intelligence (AI) chip, the H200, but not a single delivery has been made so far, three people familiar with the matter said, leaving a major technology deal in limbo as chief executive officer Jensen Huang (黃仁勳) seeks a breakthrough in China this week. Huang, who was not initially listed in a White House delegation to Beijing, joined the trip after an invitation from US President Donald Trump, a source said. Trump picked him up in Alaska en route to a summit with Chinese President Xi Jinping
Wall Street is licking its chops over an unprecedented slate of massive initial public offerings (IPOs) set to arrive in the coming months, beginning with Elon Musk’s Space Exploration Technologies Corp (SpaceX) next month. That is expected to be followed by artificial intelligence (AI) rivals OpenAI and Anthropic PBC. The trio of mega listings, each eyeing valuations around US$1 trillion or more, constitutes a heady period of elevated risk and reward. SpaceX is targeting an IPO that would raise up to US$80 billion — about double the funds generated from all IPOs last year. OpenAI and Anthropic are eyeing IPOs raising US$60 billion. “We’re