Doubao's Video Generation Model "VideoWorld" Can Understand the World Through Vision Alone| TMTPOST

中文

HOME

BRIEF NEWS

OPINION

FEATURES

LIVE

EVENTS

Feb. 10, 2025

Doubao's Video Generation Model "VideoWorld" Can Understand the World Through Vision Alone

TMTPOST -- The experimental video generation model "VideoWorld" was jointly developed by the Doubao Large Model Team, Beijing Jiaotong University, and the University of Science and Technology of China. Unlike mainstream multimodal models like Sora, DALL-E, and Midjourney, VideoWorld is the first in the industry to achieve world understanding purely through visual cognition, without relying on language models. The project's code and model have now been made open-source.

Subscribe To Our News