综合一区欧美国产,99国产麻豆免费精品,九九精品黄色录像,亚洲激情青青草,久久亚洲熟妇熟,中文字幕av在线播放,国产一区二区卡,九九久久国产精品,久久精品视频免费

Global EditionASIA 中文雙語Fran?ais
China
Home / China / Innovation

Chinese developer launches multimodal model unifying video, image, text

Xinhua | Updated: 2024-10-22 11:03
Share
Share - WeChat

BEIJING -- The Beijing Academy of Artificial Intelligence (BAAI) on Monday released Emu3, a multimodal world model that unifies the understanding and generation of text, image, and video modalities with next-token prediction.

Emu3 successfully validates that next-token prediction can serve as a powerful paradigm for multimodal models, scaling beyond language models and delivering state-of-the-art performance across multimodal tasks, said Wang Zhongyuan, director of BAAI, in a press release.

"By tokenizing images, text, and videos into a discrete space, we train a single transformer from scratch on a mixture of multimodal sequences," Wang said, adding that Emu3 eliminates the need for diffusion or compositional approaches entirely.

Emu3 outperforms several well-established task-specific models in both generation and perception tasks, according to BAAI, which has open-sourced the key technologies and models of Emu3 to the international technology community.

Technology practitioners have said that a new opportunity has emerged to explore multimodality through a unified architecture, eliminating the need to combine complex diffusion models with large language models (LLMs).

"In the future, the multimodal world model will promote scenario applications such as robot brains, autonomous driving, multimodal dialogue and inference," Wang said.

Top
BACK TO THE TOP
English
Copyright 1994 - . All rights reserved. The content (including but not limited to text, photo, multimedia information, etc) published in this site belongs to China Daily Information Co (CDIC). Without written authorization from CDIC, such content shall not be republished or used in any form. Note: Browsers with 1024*768 or higher resolution are suggested for this site.
License for publishing multimedia online 0108263

Registration Number: 130349
FOLLOW US
 
来安县| 马尔康县| 河曲县| 海阳市| 仁化县| 平邑县| 黄龙县| 呼伦贝尔市| 合作市| 安泽县| 斗六市| 弥勒县| 泗阳县| 玛沁县| 乐昌市| 驻马店市| 淮北市| 贵德县| 易门县| 三门峡市| 延长县| 锦州市| 嘉峪关市| 宣武区| 教育| 酒泉市| 仁寿县| 肇庆市| 修水县| 濉溪县| 榕江县| 达州市| 通化县| 遂平县| 合作市| 华宁县| 晋江市| 体育| 阳东县| 濉溪县| 凤冈县|