SketchVerify: Physics-Savvy Planning for More Realistic AI Videos
Ever asked AI to make a video and got floating cups or jittery motion? SketchVerify is a training-free planning trick that makes AI videos more physically believable—before you spend compute on full synthesis.
How it works:
- Given a prompt and reference image, it proposes many candidate object trajectories (plans).
- Each plan is rendered as a lightweight “video sketch” by compositing moving objects over a static background—no heavy diffusion needed.
- A vision-language verifier scores these sketches for instruction match and physical plausibility, picks the best, and iteratively refines until good enough.
- The chosen plan then guides the final trajectory-conditioned generator—just once.
Results: On WorldModelBench and PhyWorldBench, SketchVerify improves motion quality, realism, and long-term consistency while being significantly more efficient. More candidate trajectories consistently yield better outcomes.
Paper: https://arxiv.org/abs/2511.17450v1
Authors: Yidong Huang et al.
Paper: https://arxiv.org/abs/2511.17450v1
Register: https://www.AiFeta.com
#AI #VideoGeneration #GenerativeAI #ComputerVision #Physics #Planning #Diffusion #Research