男女羞羞视频在线观看,国产精品黄色免费,麻豆91在线视频,美女被羞羞免费软件下载,国产的一级片,亚洲熟色妇,天天操夜夜摸,一区二区三区在线电影
Global EditionASIA 中文雙語Fran?ais
China
Home / China / Innovation

Chinese developer launches multimodal model unifying video, image, text

Xinhua | Updated: 2024-10-22 11:03
Share
Share - WeChat

BEIJING -- The Beijing Academy of Artificial Intelligence (BAAI) on Monday released Emu3, a multimodal world model that unifies the understanding and generation of text, image, and video modalities with next-token prediction.

Emu3 successfully validates that next-token prediction can serve as a powerful paradigm for multimodal models, scaling beyond language models and delivering state-of-the-art performance across multimodal tasks, said Wang Zhongyuan, director of BAAI, in a press release.

"By tokenizing images, text, and videos into a discrete space, we train a single transformer from scratch on a mixture of multimodal sequences," Wang said, adding that Emu3 eliminates the need for diffusion or compositional approaches entirely.

Emu3 outperforms several well-established task-specific models in both generation and perception tasks, according to BAAI, which has open-sourced the key technologies and models of Emu3 to the international technology community.

Technology practitioners have said that a new opportunity has emerged to explore multimodality through a unified architecture, eliminating the need to combine complex diffusion models with large language models (LLMs).

"In the future, the multimodal world model will promote scenario applications such as robot brains, autonomous driving, multimodal dialogue and inference," Wang said.

Top
BACK TO THE TOP
English
Copyright 1995 - . All rights reserved. The content (including but not limited to text, photo, multimedia information, etc) published in this site belongs to China Daily Information Co (CDIC). Without written authorization from CDIC, such content shall not be republished or used in any form. Note: Browsers with 1024*768 or higher resolution are suggested for this site.
License for publishing multimedia online 0108263

Registration Number: 130349
FOLLOW US
 
主站蜘蛛池模板: 西华县| 资源县| 辉县市| 宝清县| 溧阳市| 乡城县| 沐川县| 旺苍县| 马山县| 淮南市| 定兴县| 丰都县| 滨州市| 合水县| 武强县| 邹平县| 诸暨市| 临沭县| 隆德县| 南涧| 隆尧县| 开鲁县| 呼图壁县| 延津县| 原阳县| 屏南县| 安西县| 孟津县| 应城市| 甘德县| 谢通门县| 南投市| 普定县| 淮北市| 株洲县| 和田市| 富平县| 泉州市| 临安市| 汝南县| 留坝县| 庄浪县| 白水县| 利辛县| 太和县| 清苑县| 宁国市| 通渭县| 从江县| 广东省| 寿宁县| 浮山县| 山东| 赤水市| 麦盖提县| 辽源市| 郴州市| 新疆| 霍林郭勒市| 昭通市| 安宁市| 普兰县| 惠来县| 顺平县| 兴宁市| 紫云| 秭归县| 鄂州市| 阿克苏市| 图木舒克市| 夏河县| 甘孜县| 桦甸市| 红原县| 福鼎市| 池州市| 界首市| 红原县| 周宁县| 九江市| 杭锦旗| 广汉市|