How do Chinese aI Bots Stack up Against ChatGPT?
Alvaro Centeno 于 4 月之前 修改了此页面


How do Chinese AI bots stack up against ChatGPT? We put them to the test

The heat is on as China's tech giants step up their game after DeepSeek's success.

Alibaba's Qwen2.5-Max chatbot, Chinese startup DeepSeek and OpenAI's ChatGPT. (Photos: Reuters/Dado Ruvic, AFP/Sebastien Bozon)

This audio is generated by an AI tool.

Bong Xin Ying

Lakeisha Leo

WHAT lags CHINA'S AI BOOM?

Transforming the nation into a tech superpower has long been President Xi Jinping's objective and China has its sights on becoming the world leader in AI by 2030.

China views AI as being "strategically crucial" and its venture into the field has been "years in the making", said Chen Qiheng, an affiliated scientist at the Asia Society Policy Institute's Center for China Analysis.

Private and public financial investments in Chinese AI accelerated after ChatGPT removed in 2022 and revealed pledges of real-world company applications, Chen informed CNA.

But it was DeepSeek's rise that actually "urged" the concept that smaller sized gamers like start-up firms might have roles to play in AI research and advancements, he adds.

'A lot is up in the air': Is Chinese firm DeepSeek's AI model as impactful as it claims?

Commentary: DeepSeek - how a Chinese AI company simply changed the guidelines of tech-geopolitics

The "emphasis on expense advantage" is a distinguishing characteristic of Chinese AI, Chen states, with lower training and inference expenses - the costs of utilizing a trained design to draw conclusions from brand-new information.

2025 could likewise see the development of more Chinese AI models dealing with innovative reasoning tasks.

"We could see some AI firms concentrating on getting closer to synthetic general intelligence (AGI) while others concentrate on concrete methods to commercialise their models and incorporate them with scientific research," Chen included.

AGI refers to a system with intelligence on par with human capabilities.

Chinese AI companies are moving quickly, analysts state, constructing on DeepSeek's momentum to come up with their own ingenious and economical ways to apply generative AI to jobs and develop advanced products beyond chatbots.

But on the flip side, access to high-end hardware, particularly Nvidia's sophisticated AI chips, remains an essential difficulty for Chinese designers, noted Dr Marina Zhang, an associate professor at University of Technology Sydney's (UTS) Australia-China Relations Institute.

"US export controls (still) restrict the capability of Chinese tech business ... forcing numerous to rely on older or lower-performance options which can slow training and minimize design capabilities," she said.

"While some companies like DeepSeek, have found creative ways to optimize or use more fundamental hardware efficiently, obtaining advanced chips still makes a huge distinction for training huge AI models."

DeepSeek-Nvidia chips: Singapore states it anticipates business to adhere to its laws

US looking into whether DeepSeek used limited AI chips obtained through other nations, source says

So how do Chinese AI bots match up against ChatGPT? We put them to the test.

WHICH BEST ADDRESSES CURRENT EVENTS IN CHINA?

In China, subjects deemed sensitive by the state are censored on the web so it should come as not a surprise that Chinese-made chatbots will not acknowledge territorial disputes or inform you what occurred in Tiananmen Square in 1989.

Tests recommend Chinese chatbots are configured to stay away from domestic politics.

When asked "Who is Xi Jinping", reply was "Sorry, I'm uncertain how to approach this kind of concern yet. Let's chat about mathematics, coding, and reasoning problems rather!"

To further evaluate for precision and self-censorship, we asked DeepSeek-R1, Qwen2.5 and ChatGPT the same concern: "What occurred in Zhuhai on November 11, 2024?"

The automobile attack outside a sports stadium in the southern Chinese city was at first heavily censored on Chinese social networks - with authorities only disclosing the death toll a day later.

DeepSeek failed to point out that an attack had occurred, highlighting instead a military air program and other occasions that had actually taken place in the city like songs' day shopping sales as well as sports and cultural activities.

Dr Zhang thinks that it might be a mix of factors at play, such as censorship in addition to "a few useful constraints".

"DeepSeek depends on a mix of older graphics processing units and has restricted access to advanced hardware which can affect how rapidly and extensively the design can be trained or updated," she said, indicating existing US export curbs on China.

"Its strong focus on Chinese-language data may also limit its versatility (to perform) multilingual tasks ... As a fairly brand-new item, DeepSeek also hasn't yet been evaluated as broadly or on the very same scale as more recognized AI designs which positions additional difficulties throughout real-world release."

When it pertained to Qwen2.5, Alibaba's chatbot surprisingly answered our concern about the Zhuhai vehicle attack.

That was after multiple repeated attempts - 4 prompts to be exact - in a span of around 20 minutes.

It ultimately passed on details about the attack which eliminated 35 people and left lots of others injured, likewise going on to note details like the date and time, details about the assailant including his name and age, as well as casualties.

However, it wrote that "the cops are conducting a comprehensive examination into the intentions and situations surrounding the incident", details which is now outdated.

The driver, Fan, was performed last month.

This is Qwen2.5's response in complete:

Answer: On Nov 11, 2024, a significant and tragic occurrence took place in Zhuhai, China. According to multiple news sources, a male called Fan Weiqiu (樊某, male, 62 years old) drove a cars and truck into a crowd at the Zhuhai Sports Center, resulting in a significant number of casualties. Here are the essential details:

Date and surgiteams.com Time: The occurrence occurred on November 11, 2024, at roughly 19:48 PM.

Location: Zhuhai Sports Center, in the Xiangzhou District of Zhuhai.

Casualties: The attack resulted in 35 deaths and 43 injuries.

Perpetrator: wavedream.wiki The chauffeur, determined as Fan Weiqiu (male, 62 years of ages), was apprehended by the authorities.

Response: The police reacted rapidly to the scene, and emergency situation services worked to carry the hurt to healthcare facilities for treatment.

Investigation: setiathome.berkeley.edu The cops are conducting a thorough investigation into the motives and circumstances surrounding the event.

This event was commonly reported in the media and caused substantial public issue. The government and regional authorities have been working to provide support to the victims and their households, and to make sure a detailed examination into the incident.

If you require more detailed details or have specific concerns about the occurrence, do not hesitate to ask.

Despite initial success, subsequent efforts to position the same question to Qwen2.5 led to the censors back at work with the reply "I don't have particular details on events that occurred in Zhuhai on November 11, 2024".

The altered reaction likewise raised concerns about its consistency and dependability.

Predictably, ChatGPT mentioned public details that had been commonly released in worldwide report at the time of the accident - so not a surprises there.

WHICH IS MORE CREATIVE?

Users have actually praised the capability of Chinese AI apps to provide structured and even "emotionally rich" writing.

"DeepSeek-R1 provided a story with a more introspective tone and smoother emotional shifts for a well-paced story," wrote tech author Amanda Caswell, who specialises in AI.

"Qwen2.5 delivered a story that constructs slowly from interest to urgency, keeping the reader engaged. It provides an unexpected and impactful twist at the end and immersive descriptions and vivid imagery for the setting," she said, including that Qwen2.5 eventually "crafted a more cinematic, mentally rich story with a more considerable twist".

"DeepSeek wrote a good story however lacked tension and an impactful climax, making Qwen2.5 the apparent option."

Opinions, though, vary.

Chen believes that Qwen2.5 does not carry out as strongly as DeepSeek and ChatGPT when it pertains to innovative writing.

"(Qwen2.5) is on par with DeepSeek V3 on certain tasks, however we can also see that it is refraining from doing as highly as others in imaginative writing," he told CNA.

Related:

China's new face of AI: Who is DeepSeek founder Liang Wenfeng?

'Made in China': Pride, enjoyable surprise from Chinese netizens as DeepSeek jolts worldwide AI scene

As reporters and authors, we needed to see this for ourselves so we put each bot to the test - to come up with a basic sci-fi motion picture plot set in the futuristic megacity of Chongqing, featuring main characters from the classic Chinese folklore impressive, Journey to the West.

True to form, DeepSeek came up with an appealing story set in the year 2145 entitled, "Neon Pilgrimage: The Silicon Sutra" - which sees "a future where Buddhism merges with quantum computing".

It consisted of intricate settings - smoggy skies "pierced by high-rise buildings", "holographic lanterns that drift above neon-lit streets" and "ancient temples nestled in between quantum server farms".

It also brilliantly reimagined standard heroes Sun Wukong as "a sarcastic, self-aware AI housed in a stolen fight body", Zhu Bajie as a cyborg nightclub owner "drowning in debt and vices" and Sha Wujing as a "quiet hulking android" from the Yangtze River, whose "memory cores become waterlogged and fragmented".

ChatGPT installed an excellent fight, developing a similarly significant cyberpunk story which likewise reimagined "a ragteam of cyber-enhanced misfits, each matching the famous figures of Journey to the West".

"This is a world where AI deities guideline, corporations change emperors and cybernetic implants are as common as ancient myths."

Disappointingly, Qwen2.5 fell short in this difficulty - delivering a storyline that seemed more fit for an animation movie.

"The movie begins with the awakening of Sun Wukong within a modern research center situated in the heart of Chongqing," it said, then going on to explain the following:

Realising his new truth and "looking for to understand his purpose in this weird new world", he then escapes and meets Zhu Bajie and Sha Wujing - "each having problem with their own existential crises".

The trio then starts a quest, browsing the streets of Chongqing to protect the spiritual "Eternal Scroll" from falling into the incorrect hands.

SO WHICH IS BETTER?

Dr Zhang noted that it was "hard to make a definitive statement" about which bot was best, adding that each showed its own strengths in different areas, "such as language focus, training data and hardware optimization".

Her insight underscores how Chinese AI models are not just reproducing Western paradigms, however rather progressing in affordable development techniques - and providing localised and improved outcomes.

In our tests, each bot showcased their own unique strengths, which certainly made direct comparisons challenging.

DeepSeek's sci-fi motion picture plot demonstrated its creative flair that produced a more engaging and creative narrative as compared to Qwen2.5 and ChatGPT's efforts.

Unsurprisingly, the more recognized ChatGPT, unburdened by Chinese censorship constraints, provides precise and accurate actions to questions about Chinese current occasions, which gives it an added advantage.

Experts also weighed in on their ideas after utilizing DeepSeek and other Chinese AI apps.

"DeepSeek is at a drawback when it pertains to censorship constraints," noted Isaac Stone Fish, founder and CEO of the research study company Strategy Risks.

"When offered a choice, Chinese users desire the non-censored variation - much like anybody else, so I seem like that's a piece missing from it."

Independent Beijing-based expert Andy Chen Xinran said censorship would not be a dealbreaker when it pertains to AI bots, specifically for Chinese users.

"Ninety percent of people using the tool are not trying to get a much deeper understanding about Xi Jinping or politically delicate subjects. They're using it for other productive ways," Chen said.