As artificial intelligence (AI) sweeps across the world, many countries have begun to develop their own large language models, while various industries are evaluating the impact of AI usage on their business. Against this background, Academia Sinica recently released a beta version of its large language model-based chatbot, CKIP-Llama-2-7b, hoping to get feedback from the public and become the starting point for local AI chatbots so that Taiwan will not fall behind other countries.
However, the top-level research institute last week removed the traditional Chinese-language AI chatbot from its Web site after testing by netizens showed some disturbing answers to basic questions such as Taiwan’s national day, anthem and leader. Moreover, netizens found the content provided by Academia Sinica’s chat AI model was not localized enough and made inappropriate verbal choices, as its datasets were provided by several Chinese research institutes, while its dialogue training materials were compiled in simplified Chinese characters.
Academia Sinica admitted the mistake, saying the questionable responses to netizens’ queries were due to the researchers’ use of Chinese datasets in CKIP-Llama-2-7b. As the researchers wanted to save time in developing a chat AI, they simply converted the datasets from simplified Chinese into traditional Chinese characters and put the model online for crowdsourced testing, Academia Sinica explained, adding that it had learned a lesson from the incident and vowed to set up a special task force to avoid repeating the mistakes.
Clearly, it is necessary to establish a Taiwan-based large language model using datasets collected locally, otherwise the content generated by a chat AI would be disputable and controversial on certain issues. Take CKIP-Llama-2-7b as an example: Academia Sinica had claimed its model could be used for academic, commercial, copywriting, literary creation and question-and-answer systems, as well as customer service, language translation, text editing and teaching Chinese. But without datasets taken from local language examples that reflect a Taiwanese context, any homegrown AI would be hard pressed to achieve its expected goals and might be inappropriate to the locality, some AI experts have warned.
This begs the question of whether Taiwan is determined to develop local language datasets and large language models. Granted, doing so is extremely expensive, not only financially, but also in terms of time as well as the massive computing power required, and it poses a challenge to the government to draft the required budget proposal and obtain approval from legislators. It is also very unlikely that private enterprises would invest more resources in software development in addition to hardware upgrades. Nevertheless, it is a fundamental requirement, since large language model AIs need to be trained using massive datasets.
In addition to the localization issue, users of chat AIs, whether it be OpenAI’s ChatGPT or Google’s Bard, face a common problem: inaccuracy and factual errors. Until there is a major breakthrough in AI technology, users’ judgement and knowledge are essential to detect and counter any AI bias, which derives from the algorithms’ tendency to reflect the national, cultural and ideological biases of their creators. Take AI-assisted teaching in Taiwan’s classrooms as an example: Teachers’ ability to judge and correct questionable content generated by AIs is the key to their success in teaching, enabling the technology to greatly boost teaching efficiency.
The CKIP-Llama-2-7b incident serves as a reminder that the use of chat AI on a large scale has national security implications which the government must address urgently. Moreover, this powerful tool has its own fundamental flaws, which require users’ utmost discretion based on their own expertise and judgement, rather than blind trust.
The narrative surrounding Indian Prime Minister Narendra Modi’s attendance at last week’s Shanghai Cooperation Organization (SCO) summit — where he held hands with Russian President Vladimir Putin and chatted amiably with Chinese President Xi Jinping (習近平) — was widely framed as a signal of Modi distancing himself from the US and edging closer to regional autocrats. It was depicted as Modi reacting to the levying of high US tariffs, burying the hatchet over border disputes with China, and heralding less engagement with the Quadrilateral Security dialogue (Quad) composed of the US, India, Japan and Australia. With Modi in China for the
The Jamestown Foundation last week published an article exposing Beijing’s oil rigs and other potential dual-use platforms in waters near Pratas Island (Dongsha Island, 東沙島). China’s activities there resembled what they did in the East China Sea, inside the exclusive economic zones of Japan and South Korea, as well as with other South China Sea claimants. However, the most surprising element of the report was that the authors’ government contacts and Jamestown’s own evinced little awareness of China’s activities. That Beijing’s testing of Taiwanese (and its allies) situational awareness seemingly went unnoticed strongly suggests the need for more intelligence. Taiwan’s naval
A large part of the discourse about Taiwan as a sovereign, independent nation has centered on conventions of international law and international agreements between outside powers — such as between the US, UK, Russia, the Republic of China (ROC) and Japan at the end of World War II, and between the US and the People’s Republic of China (PRC) since recognition of the PRC as the sole representative of China at the UN. Internationally, the narrative on the PRC and Taiwan has changed considerably since the days of the first term of former president Chen Shui-bian (陳水扁) of the Democratic
On Sept. 3 in Tiananmen Square, the Chinese Communist Party (CCP) and the People’s Liberation Army (PLA) rolled out a parade of new weapons in PLA service that threaten Taiwan — some of that Taiwan is addressing with added and new military investments and some of which it cannot, having to rely on the initiative of allies like the United States. The CCP’s goal of replacing US leadership on the global stage was advanced by the military parade, but also by China hosting in Tianjin an August 31-Sept. 1 summit of the Shanghai Cooperation Organization (SCO), which since 2001 has specialized