综合一区欧美国产,99国产麻豆免费精品,九九精品黄色录像,亚洲激情青青草,久久亚洲熟妇熟,中文字幕av在线播放,国产一区二区卡,九九久久国产精品,久久精品视频免费

Global EditionASIA 中文雙語Fran?ais
Business
Home / Business / Technology

Nation's firms eye lightweight LLMs as AI race heats up

Smaller large models require fewer calculations, less powerful processors

By CHENG YU | CHINA DAILY | Updated: 2024-03-11 09:02
Share
Share - WeChat
An employee introduces an AI large model to a visitor (middle) during the 2nd Global Digital Trade Expo in Hangzhou, Zhejiang province. [ZHU HAIWEI/FOR CHINA DAILY]

More Chinese companies are developing lightweight large language models after US-based technology firm OpenAI launched a text-to-video model, Sora, last month, hiking the stakes in the global AI race.

The lightweight model, also known as a smaller large model, basically refers to those that require fewer parameters. This means they will have limited capacity to process and generate text compared to large models.

Simply put, these small models are like compact cars, while large models are like luxury sport utility vehicles.

In February, Chinese artificial intelligence startup ModelBest Inc launched its latest lightweight large model, generating much attention in the AI industry.

Dubbed as MiniCPM-2B, the model is embedded with a capacity of 2 billion parameters, much smaller than the 1.7 trillion parameters that OpenAI's massive GPT-4.0 can handle.

In December, US tech giant Microsoft released Phi-2, a small language model capable of common-sense reasoning and language understanding, although this packed 2.7 billion parameters.

Li Dahai, CEO of ModelBest, said the new model's performance is close to that of Mistral-7B from French AI company Mistral on open-sourced general benchmarks with better ability on Chinese, mathematics and coding. Its overall performance exceeds some peer large models with some 10-billion-level parameters, Li said.

"Both large and smaller large models have their advantages, depending on the specific requirements of a task and their constraints, but Chinese companies may find a way out to leverage small models amid an AI boom," said Li.

Zhou Hongyi, founder and chairman of 360 Security Technology, and a member of the 14th National Committee of the Chinese People's Political Consultative Conference at the ongoing two sessions, had also said previously in an interview that creating a universal large model that surpasses GPT-4.0 may be challenging at the moment.

Though GPT-4.0 currently "knows everything, it is not specialized", he said.

"If we can excel in a particular business domain by training a model with unique business data and integrating it with many business tools within that sector, such a model will not only have intelligence, but also possess unique knowledge, even hands and feet," he said.

Li said that if such a lightweight model can be applied to industries, its commercial value will be huge.

"If the model is compressed, it will require fewer calculations to operate, which also means less powerful processors and less time to complete responses," Li said.

"With the popularity of such end-side models, the inference cost of more electronic devices, such as mobile phones, will further decrease in the future," he added.

Top
BACK TO THE TOP
English
Copyright 1994 - . All rights reserved. The content (including but not limited to text, photo, multimedia information, etc) published in this site belongs to China Daily Information Co (CDIC). Without written authorization from CDIC, such content shall not be republished or used in any form. Note: Browsers with 1024*768 or higher resolution are suggested for this site.
License for publishing multimedia online 0108263

Registration Number: 130349
FOLLOW US
CLOSE
 
开原市| 东乌珠穆沁旗| 鄂伦春自治旗| 新泰市| 光泽县| 洪江市| 潼南县| 黄梅县| 罗城| 余干县| 东阳市| 杂多县| 永丰县| 大厂| 惠州市| 工布江达县| 奉贤区| 天峨县| 克什克腾旗| 得荣县| 宁城县| 轮台县| 乌拉特后旗| 方山县| 宁陕县| 东莞市| 邢台市| 会宁县| 广昌县| 临江市| 德钦县| 会宁县| 新泰市| 改则县| 溧水县| 乾安县| 军事| 贵南县| 罗山县| 繁峙县| 台北市|