银河网上赌场官方网址-网上赌场平台犯法吗_百家乐技巧平注常赢法_全讯网新2英文读书(中国)·官方网站

position: EnglishChannel  > AI ripples> Chinese AI Model Emu3 Handles Text, Image, Video Seamlessly

Chinese AI Model Emu3 Handles Text, Image, Video Seamlessly

Source: Science and Technology Daily | 2024-12-17 15:44:35 | Author: Gong Qian

On October 21, the Beijing Academy of Artificial Intelligence (BAAI), a Chinese non-profit organization engaged in AI R&D, released Emu3, a multimodal AI model that seamlessly integrates text, image, and video modalities into a single, unified framework.

The BAAI research team said Emu3 is expected to be used in scenario applications such as robot brains, autonomous driving, multimodal dialogue and inference.

Emu3, based solely on next-token prediction, proves that next-token prediction can be a powerful paradigm for multimodal models.

The existing multimodal AI models are mostly designed for specific tasks. Each has its corresponding architecture and methods. For instance, in the field of video generation, many developers use the diffusion in time (DiT) architecture, as referenced by Sora. Other models such as Stable Diffusion are used for text-to-image synthesis, Sora for text-to-video conversion, and GPT-4V for image-to-text generation.

In contrast to these models, which have a combination of isolated skills rather than an inherently unified ability, Emu3, eliminates the need for diffusion or compositional approaches. By tokenizing images, text, and videos into a discrete space, BAAI has developed a single transformer from scratch.

Emu3 outperforms several well-established task-specific models in both generation and perception tasks, surpassing flagship models such as SDXL and LLaVA.

In September, BAAI open-sourced the key technologies and models of Emu3 including the chat model and generation model after supervised fine-tuning.

Emu3 has been receiving rave reviews from overseas developers. "For researchers, a new opportunity has emerged to explore multimodality through a unified architecture, eliminating the need to combine complex diffusion models with large language models. This approach is akin to the transformative impact of transformers in vision-related tasks," AI consultant Muhammad Umair said on social media platform Meta.

While next-token prediction is considered a promising path towards artificial general intelligence, it struggled to excel in multimodal tasks, which were dominated by diffusion models such as Stable Diffusion and compositional approaches like CLIP combined with large language models.

Raphael Mansuy, co-founder of QuantaLogic, an AI agent platform, thinks that Em3 has significant implications for Al development. Mansuy wrote on X that Em3's success suggests several key insights: Next-token prediction as a viable path to general multimodal Al; potential for simplified and more scalable model architectures; challenge to the dominance of diffusion and compositional approaches.

Editor:GONG Qian

Top News

Energy Cooperation Gets New Direction

?Chinese President Xi Jinping sent a congratulatory message to the 7th China-Russia Energy Business Forum in Beijing on November 25, sparking enthusiastic responses from various sectors in both countries.

WEEKLY REVIEW (Dec.3-10)

Liang Wenfeng, founder and CEO of the Chinese AI firm DeepSeek, and "deep diver" Chinese geoscientist Du Mengran are on the annual "Nature's 10" list, which highlights 10 people at the heart of some of the biggest science stories of 2025.

抱歉,您使用的瀏覽器版本過(guò)低或開(kāi)啟了瀏覽器兼容模式,這會(huì)影響您正常瀏覽本網(wǎng)頁(yè)

您可以進(jìn)行以下操作:

1.將瀏覽器切換回極速模式

2.點(diǎn)擊下面圖標(biāo)升級(jí)或更換您的瀏覽器

3.暫不升級(jí),繼續(xù)瀏覽

繼續(xù)瀏覽
顶尖百家乐开户| 视频百家乐游戏| 百家乐官网破解仪| 威尼斯人娱乐平台注册网址| 立即博百家乐的玩法技巧和规则| 百家乐官网怎打能赢| 百家乐微笑投注| 大发888认识的见解| 乐享百家乐官网的玩法技巧和规则| 德州扑克在线| 百家乐网上投注网站| 真人百家乐官网平台排行| 百家乐视频游戏视频| 太阳城百家乐官网手机投注| 大发888 方管下载| 3U百家乐游戏| 沙巴百家乐官网现金网| 大发888娱乐场1888| 百胜百家乐软件| 足球.百家乐官网投注网出租 | 百家乐官网官方游戏下载| bet365后备网址| 百家乐赌场现金网| 澳门百家乐官网真人娱乐城| 棋牌游戏平台哪个好| 百家乐赌博详解| 做生意门朝向什么方向| 百家乐官网娱乐城地址| 亿酷棋牌世界下载手机版| 济州岛百家乐的玩法技巧和规则| 迷你百家乐论坛| 百家乐翻天在线观看| 百家乐轮盘桌| 大连百家乐官网食品| 视频百家乐官网是真是假| 海立方百家乐官网客户端| 华宝娱乐城| 娱乐城注册送28| 大发888sut8| 大发8888| 百家乐园qq群|