World Models · University of Macau
PF-OPSD: When Should an MLLM Trust a World Model's Video?
PF-OPSD teaches a Qwen3.5-9B MLLM to decide when to simulate the future with a video world model, verify the rollout, and fold it into its answer, lifting accuracy +10.6 and +10.9 points on two new QA benchmarks.