男女羞羞视频在线观看,国产精品黄色免费,麻豆91在线视频,美女被羞羞免费软件下载,国产的一级片,亚洲熟色妇,天天操夜夜摸,一区二区三区在线电影
Global EditionASIA 中文雙語Fran?ais
China
Home / China / Innovation

Beijing Academy of AI unveils next-gen multimodal model Emu3

By DU JUAN | chinadaily.com.cn | Updated: 2024-10-24 15:37
Share
Share - WeChat

This week, the Beijing Academy of Artificial Intelligence unveiled a self-developed multimodal world model named Emu3, which achieves a unified understanding and generation of video, images and text.

Emu3 successfully validates that next-token prediction can serve as a powerful paradigm for multimodal models, scaling beyond language models and delivering state-of-the-art performance across multimodal tasks. In simple terms, it shows that predicting the next word or element in a sequence can be useful for models that handle both text and images, not just text alone.

Emu3 focuses on predicting the next part of a sequence, removing the necessity for complex methods like diffusion or composition. It converts images, text, and videos into a common format, teaching a single transformer model from the beginning on a mix of different types of sequences containing both text and images.

According to the academy, it has open-sourced Emu3's key technologies and models to the international tech community. Industry experts have expressed that for researchers, Emu3 signifies a new opportunity to explore multimodality through a unified architecture without the need to combine complex diffused models with large language models.

Wang Zhongyuan, director of the academy, said Emu3 has demonstrated high performance in multimodal tasks through next-token prediction, paving the way for the development of multimodal AGI.

"Emu3 has the potential to converge infrastructure development onto a single technical path, laying the foundation for large-scale multimodal training and inference," he said. "This simple architectural design will facilitate industrialization. In the future, multimodal world models will drive applications in scenarios such as robotic cognition, autonomous driving, multimodal conversations and reasoning."

Top
BACK TO THE TOP
English
Copyright 1994 - . All rights reserved. The content (including but not limited to text, photo, multimedia information, etc) published in this site belongs to China Daily Information Co (CDIC). Without written authorization from CDIC, such content shall not be republished or used in any form. Note: Browsers with 1024*768 or higher resolution are suggested for this site.
License for publishing multimedia online 0108263

Registration Number: 130349
FOLLOW US
 
主站蜘蛛池模板: 阿坝县| 游戏| 巢湖市| 芜湖县| 神池县| 上蔡县| 九龙城区| 探索| 精河县| 金门县| 曲沃县| 富顺县| 芦溪县| 深圳市| 盐城市| 孝昌县| 皋兰县| 社会| 株洲县| 贵港市| 宁远县| 镇远县| 苍梧县| 中江县| 从化市| 溧阳市| 奎屯市| 阳新县| 新乐市| 陆河县| 建平县| 泾阳县| 子洲县| 岢岚县| 西丰县| 谷城县| 开封县| 区。| 浦北县| 龙川县| 玉山县| 东阳市| 佳木斯市| 台东县| 隆子县| 昔阳县| 澄迈县| 家居| 乌兰察布市| 中阳县| 巴彦淖尔市| 江津市| 南康市| 赤水市| 安徽省| 绩溪县| 松原市| 西吉县| 隆昌县| 双流县| 孝感市| 新泰市| 靖江市| 广宁县| 宿松县| 尼勒克县| 洪泽县| 孟津县| 东乡族自治县| 小金县| 翼城县| 永吉县| 池州市| 宁海县| 白沙| 灵山县| 兰考县| 台南市| 柘荣县| 华池县| 凤山县| 南京市|