OpenAI has unveiled Sora, a video generation model capable of producing up to a minute of high-fidelity video, utilizing a diffusion transformer architecture that processes video and image data as spacetime patches. This approach allows Sora to handle variable durations, resolutions, and aspect ratios, aiming to create general-purpose simulators of the physical world. Concurrently, a new benchmark suite called WorldMark has been introduced to standardize the evaluation of interactive video world models, addressing the previous lack of comparable metrics across different models. AI
Summary written by None from 4 sources. How we write summaries →
RANK_REASON OpenAI released Sora, a frontier video generation model, alongside a technical report detailing its capabilities.