VideoWorldModel | CVPR 2026 Workshop

About

World models are systems that predict the next state of the world based on historical states and interactive action control. Among various approaches, Video World Models based on interactive video generation are particularly promising due to the photorealism and scalability of video data, as well as recent advances in video generation.

However, current video generation models face significant challenges in achieving the ideal Video World Model. This workshop aims to provide a platform for researchers from both academia and industry to discuss and address these challenges, foster collaboration, advance related academic research, and promote the practical application of Video World Models.

🎮

Interaction

Effective interaction with virtual worlds, including navigation and object manipulation

🧠

Memory

Maintaining consistency over long video sequences with causal reasoning

⚡

Efficiency

Real-time video generation with high quality, addressing throughput and latency

🤖

Applications

Robotics, Embodied AI, autonomous driving, and more

Schedule

Event	Time
Opening Remarks	8:30 AM - 8:35 AM
Invited Talk #1 (20 min + 5 min Q&A)	8:35 AM - 9:00 AM
Invited Talk #2 (20 min + 5 min Q&A)	9:00 AM - 9:25 AM
Invited Talk #3 (20 min + 5 min Q&A)	9:25 AM - 9:50 AM
Poster Session + Coffee Break + Best Paper Award	9:50 AM - 10:40 AM
Invited Talk #4 (20 min + 5 min Q&A)	10:40 AM - 11:05 AM
Invited Talk #5 (20 min + 5 min Q&A)	11:05 AM - 11:30 AM
Invited Talk #6 (20 min + 5 min Q&A)	11:30 AM - 11:55 AM
Closing Remarks	11:55 AM - 12:00 PM

Call for Papers

Topics of Interest

Interactive video generation and world simulation
Long-term memory and consistency in video generation
Efficient and real-time video generation methods
Video world models for robotics and embodied AI
Video world models for autonomous driving
Action-conditioned video prediction and generation
Evaluation and benchmarking of video world models
Novel architectures and training methods for video world models

Track 1: Proceedings

Submissions must present original, unpublished research. Manuscripts should be 4–8 pages (excluding references) using the CVPR 2026 template. Accepted papers will be published in the CVPR 2026 Workshop Proceedings.

Submission Deadline March 1, 2026 (AoE)
Notification to Authors March 20, 2026
Camera-Ready Deadline April 10, 2026

Submission Portal

Track 2: Non-Proceedings

A flexible, non-archival venue for sharing a broad range of contributions without restrictive publishing constraints, formatting requirements, or page limits. We warmly welcome:

Works-in-progress and preliminary results
Open Datasets
Technical Reports
Recent work submitted or published within the last year
Position papers and conceptual frameworks

Submission Deadline April 14, 2026 (AoE)
Notification to Authors May 4, 2026
Camera-Ready Deadline May 11, 2026

Submission Portal

Submission Guidelines

Proceedings Track papers must follow the official CVPR 2026 template and be anonymized for double-blind review
Supplementary materials (videos, code, etc.) are welcome

WorldArena Challenge

⚡ Open Now — Join the Challenge

An associated challenge of the Workshop on Video World Models, built on the WorldArena benchmark. The challenge evaluates embodied world models from two complementary perspectives: Track 1: Video Perception Quality and Track 2: Embodied Task Functionality. Submit your model, climb the leaderboard, and compete for awards at CVPR 2026.

Final Submission Deadline in

--Days

--Hours

--Mins

--Secs